No Cover Image

Journal article 659 views 170 downloads

Inferring distributions from observed mRNA and protein copy counts in genetic circuits

Komlan Atitey, Pavel Loskot Orcid Logo, Paul Rees Orcid Logo

Biomedical Physics & Engineering Express

Swansea University Authors: Pavel Loskot Orcid Logo, Paul Rees Orcid Logo

Abstract

Defining distributions of molecule counts produced in the cell can elucidate stochastic dynamics of the underlying biological circuits. For genetic circuits, only a few distributions of messenger RNA and protein counts were reported in literature, so the task is to decide which of these candidate di...

Full description

Published in: Biomedical Physics & Engineering Express
ISSN: 2057-1976
Published: 2018
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa45470
Tags: Add Tag
No Tags, Be the first to tag this record!
first_indexed 2018-11-06T20:16:27Z
last_indexed 2019-09-23T14:09:22Z
id cronfa45470
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2019-09-23T04:33:34.5565739</datestamp><bib-version>v2</bib-version><id>45470</id><entry>2018-11-06</entry><title>Inferring distributions from observed mRNA and protein copy counts in genetic circuits</title><swanseaauthors><author><sid>bc7cba9ef306864239b9348c3aea4c3e</sid><ORCID>0000-0002-2773-2186</ORCID><firstname>Pavel</firstname><surname>Loskot</surname><name>Pavel Loskot</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>537a2fe031a796a3bde99679ee8c24f5</sid><ORCID>0000-0002-7715-6914</ORCID><firstname>Paul</firstname><surname>Rees</surname><name>Paul Rees</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2018-11-06</date><deptcode>EEN</deptcode><abstract>Defining distributions of molecule counts produced in the cell can elucidate stochastic dynamics of the underlying biological circuits. For genetic circuits, only a few distributions of messenger RNA and protein counts were reported in literature, so the task is to decide which of these candidate distributions best fit the observed data. In this paper, we present a statistical method to infer distributions of mRNA and protein counts from observed data. The main advantage of this method is that it does not require any prior assumptions or knowledge about underlying chemical reactions. In particular, a given distribution is fitted to the observed copy counts using a histogram with optimized bin sizes in order to reduce the fitting error. The goodness of fit is evaluated by Kolmogorov-Smirnov and chi-square statistical tests to accept or reject the hypothesis that observed molecule counts were generated from given distribution. The distribution fitting also yields the values of distribution parameters, or they can be estimated using the Bayes theorem. These parameters appear to be themselves random processes. The presented statistical framework for analyzing the observed mRNA and protein copy counts is illustrated for a simulated model of lac genetic circuit in Escherichia coli. For reaction rates assumed in the model, the results in literature predict that mRNA and protein counts at steady-state are gamma distributed. Our analysis shows that both mRNA and protein in the lac circuit model can be considered gamma distributed in at least 70% of times from the initial state until steady-state. The shape and scale parameters of observed gamma distributions are also gamma distributed, giving rise to double stochastic processes. More importantly, as shown previously, the distribution parameters are functions of transcription and translation rates, so presented statistical framework can be used to estimate or optimize reaction rates in biochemical systems.</abstract><type>Journal Article</type><journal>Biomedical Physics &amp; Engineering Express</journal><publisher/><issnElectronic>2057-1976</issnElectronic><keywords/><publishedDay>31</publishedDay><publishedMonth>12</publishedMonth><publishedYear>2018</publishedYear><publishedDate>2018-12-31</publishedDate><doi>10.1088/2057-1976/aaef5c</doi><url/><notes/><college>COLLEGE NANME</college><department>Engineering</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>EEN</DepartmentCode><institution>Swansea University</institution><apcterm/><lastEdited>2019-09-23T04:33:34.5565739</lastEdited><Created>2018-11-06T15:05:34.4695950</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Engineering and Applied Sciences - Biomedical Engineering</level></path><authors><author><firstname>Komlan</firstname><surname>Atitey</surname><order>1</order></author><author><firstname>Pavel</firstname><surname>Loskot</surname><orcid>0000-0002-2773-2186</orcid><order>2</order></author><author><firstname>Paul</firstname><surname>Rees</surname><orcid>0000-0002-7715-6914</orcid><order>3</order></author></authors><documents><document><filename>0045470-06112018150805.pdf</filename><originalFilename>atitey2018(2).pdf</originalFilename><uploaded>2018-11-06T15:08:05.4100000</uploaded><type>Output</type><contentLength>1198878</contentLength><contentType>application/pdf</contentType><version>Accepted Manuscript</version><cronfaStatus>true</cronfaStatus><embargoDate>2019-11-08T00:00:00.0000000</embargoDate><copyrightCorrect>true</copyrightCorrect><language>eng</language></document></documents><OutputDurs/></rfc1807>
spelling 2019-09-23T04:33:34.5565739 v2 45470 2018-11-06 Inferring distributions from observed mRNA and protein copy counts in genetic circuits bc7cba9ef306864239b9348c3aea4c3e 0000-0002-2773-2186 Pavel Loskot Pavel Loskot true false 537a2fe031a796a3bde99679ee8c24f5 0000-0002-7715-6914 Paul Rees Paul Rees true false 2018-11-06 EEN Defining distributions of molecule counts produced in the cell can elucidate stochastic dynamics of the underlying biological circuits. For genetic circuits, only a few distributions of messenger RNA and protein counts were reported in literature, so the task is to decide which of these candidate distributions best fit the observed data. In this paper, we present a statistical method to infer distributions of mRNA and protein counts from observed data. The main advantage of this method is that it does not require any prior assumptions or knowledge about underlying chemical reactions. In particular, a given distribution is fitted to the observed copy counts using a histogram with optimized bin sizes in order to reduce the fitting error. The goodness of fit is evaluated by Kolmogorov-Smirnov and chi-square statistical tests to accept or reject the hypothesis that observed molecule counts were generated from given distribution. The distribution fitting also yields the values of distribution parameters, or they can be estimated using the Bayes theorem. These parameters appear to be themselves random processes. The presented statistical framework for analyzing the observed mRNA and protein copy counts is illustrated for a simulated model of lac genetic circuit in Escherichia coli. For reaction rates assumed in the model, the results in literature predict that mRNA and protein counts at steady-state are gamma distributed. Our analysis shows that both mRNA and protein in the lac circuit model can be considered gamma distributed in at least 70% of times from the initial state until steady-state. The shape and scale parameters of observed gamma distributions are also gamma distributed, giving rise to double stochastic processes. More importantly, as shown previously, the distribution parameters are functions of transcription and translation rates, so presented statistical framework can be used to estimate or optimize reaction rates in biochemical systems. Journal Article Biomedical Physics & Engineering Express 2057-1976 31 12 2018 2018-12-31 10.1088/2057-1976/aaef5c COLLEGE NANME Engineering COLLEGE CODE EEN Swansea University 2019-09-23T04:33:34.5565739 2018-11-06T15:05:34.4695950 Faculty of Science and Engineering School of Engineering and Applied Sciences - Biomedical Engineering Komlan Atitey 1 Pavel Loskot 0000-0002-2773-2186 2 Paul Rees 0000-0002-7715-6914 3 0045470-06112018150805.pdf atitey2018(2).pdf 2018-11-06T15:08:05.4100000 Output 1198878 application/pdf Accepted Manuscript true 2019-11-08T00:00:00.0000000 true eng
title Inferring distributions from observed mRNA and protein copy counts in genetic circuits
spellingShingle Inferring distributions from observed mRNA and protein copy counts in genetic circuits
Pavel Loskot
Paul Rees
title_short Inferring distributions from observed mRNA and protein copy counts in genetic circuits
title_full Inferring distributions from observed mRNA and protein copy counts in genetic circuits
title_fullStr Inferring distributions from observed mRNA and protein copy counts in genetic circuits
title_full_unstemmed Inferring distributions from observed mRNA and protein copy counts in genetic circuits
title_sort Inferring distributions from observed mRNA and protein copy counts in genetic circuits
author_id_str_mv bc7cba9ef306864239b9348c3aea4c3e
537a2fe031a796a3bde99679ee8c24f5
author_id_fullname_str_mv bc7cba9ef306864239b9348c3aea4c3e_***_Pavel Loskot
537a2fe031a796a3bde99679ee8c24f5_***_Paul Rees
author Pavel Loskot
Paul Rees
author2 Komlan Atitey
Pavel Loskot
Paul Rees
format Journal article
container_title Biomedical Physics & Engineering Express
publishDate 2018
institution Swansea University
issn 2057-1976
doi_str_mv 10.1088/2057-1976/aaef5c
college_str Faculty of Science and Engineering
hierarchytype
hierarchy_top_id facultyofscienceandengineering
hierarchy_top_title Faculty of Science and Engineering
hierarchy_parent_id facultyofscienceandengineering
hierarchy_parent_title Faculty of Science and Engineering
department_str School of Engineering and Applied Sciences - Biomedical Engineering{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Engineering and Applied Sciences - Biomedical Engineering
document_store_str 1
active_str 0
description Defining distributions of molecule counts produced in the cell can elucidate stochastic dynamics of the underlying biological circuits. For genetic circuits, only a few distributions of messenger RNA and protein counts were reported in literature, so the task is to decide which of these candidate distributions best fit the observed data. In this paper, we present a statistical method to infer distributions of mRNA and protein counts from observed data. The main advantage of this method is that it does not require any prior assumptions or knowledge about underlying chemical reactions. In particular, a given distribution is fitted to the observed copy counts using a histogram with optimized bin sizes in order to reduce the fitting error. The goodness of fit is evaluated by Kolmogorov-Smirnov and chi-square statistical tests to accept or reject the hypothesis that observed molecule counts were generated from given distribution. The distribution fitting also yields the values of distribution parameters, or they can be estimated using the Bayes theorem. These parameters appear to be themselves random processes. The presented statistical framework for analyzing the observed mRNA and protein copy counts is illustrated for a simulated model of lac genetic circuit in Escherichia coli. For reaction rates assumed in the model, the results in literature predict that mRNA and protein counts at steady-state are gamma distributed. Our analysis shows that both mRNA and protein in the lac circuit model can be considered gamma distributed in at least 70% of times from the initial state until steady-state. The shape and scale parameters of observed gamma distributions are also gamma distributed, giving rise to double stochastic processes. More importantly, as shown previously, the distribution parameters are functions of transcription and translation rates, so presented statistical framework can be used to estimate or optimize reaction rates in biochemical systems.
published_date 2018-12-31T03:57:16Z
_version_ 1763752895394611200
score 11.012924