{"@context":"http://iiif.io/api/presentation/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/manifest.json","@type":"sc:Manifest","label":"On miRNA-mRNA network extraction and ultra-fast nucleotide barcodes clustering algorithm","metadata":[{"label":"dc.description.sponsorship","value":"This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree."},{"label":"dc.format","value":"Monograph"},{"label":"dc.format.medium","value":"Electronic Resource"},{"label":"dc.identifier.uri","value":"http://hdl.handle.net/11401/77410"},{"label":"dc.language.iso","value":"en_US"},{"label":"dc.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.abstract","value":"This thesis consists of two topics: (1) discovery of microRNA/mRNA regulatory networks on essential thrombocytosis (ET), and (2) a novel ultrafast clustering algorithm to count nucleotide barcode and amplicon reads with errors. The objective of the first study is to discover miRNA-mRNA regulatory networks related to ET, a chronic myeloproliferative disorder with an unregulated surplus of platelets. Complications of ET include stroke, heart attack, and formation of blood clots. While the genetic basis of ET has been studied to some extent, no direct diagnostic test is available to date. In this study, we aim to identify novel ET-related miRNA-mRNA regulatory networks through comparisons of transcriptomes between healthy control and ET patients. Four network discovery algorithms have been employed, including (a) Pearson correlation network, (b) sparse supervised canonical correlation analysis (sparse sCCA), (c) sparse partial correlation network analysis (SPACE), and, (d) (sparse) Bayesian network analysis \u00e2\u20ac\u201c all through a combination of data-driven and knowledge-based analyses. The result predicts a close relationship between 8 miRNAs (including miR-9, miR-490-5p, miR-490-3p, miR-182, miR-34a, miR-196b, miR-34b*, miR-181a-2*) and a 9-mRNA set (including CAV2, LAPTM4B, TIMP1, PKIG, WASF1, MMP1, ERVH-4, NME4, HSD17B12). The majority of the identified variables have been linked to hematologic function by a sizable number of studies. Furthermore, it is observed that the selected mRNAs are high relevant to ET disease. The study will shed light on understanding the etiology of ET. The objective of the second study is to develop an ultrafast and accurate clustering algorithm and software to detect barcodes, certain DNA sequences, and their abundances from raw next-generation barcode sequencing (bar-seq) data. Although bar-seq use has been quickly growing, the computational pipelines for its analyses have not been well developed. Available methods are slow and often result in over-clustering artifacts that group distinct barcodes together. Here, we developed a software package called Bartender, which employs a divide-and-conquer strategy for fast implementation and a modified two-sample proportion test for cluster merging. Additionally, Bartender includes a \u00e2\u20ac\u0153multiple time point\u00e2\u20ac mode that matches barcode clusters between different clustering runs for seamless handling of time course data. For both simulated and real data, Bartender clusters millions of unique barcodes in a few minutes at high accuracy (>99.9%), and is ~100-fold faster than previous methods."},{"label":"dcterms.available","value":"2017-09-20T16:52:38Z"},{"label":"dcterms.contributor","value":"Gao, Yi"},{"label":"dcterms.creator","value":"Zhao, Lu"},{"label":"dcterms.dateAccepted","value":"2017-09-20T16:52:38Z"},{"label":"dcterms.dateSubmitted","value":"2017-09-20T16:52:38Z"},{"label":"dcterms.description","value":"Department of Applied Mathematics and Statistics"},{"label":"dcterms.extent","value":"103 pg."},{"label":"dcterms.format","value":"Monograph"},{"label":"dcterms.identifier","value":"http://hdl.handle.net/11401/77410"},{"label":"dcterms.issued","value":"2016-12-01"},{"label":"dcterms.language","value":"en_US"},{"label":"dcterms.provenance","value":"Made available in DSpace on 2017-09-20T16:52:38Z (GMT). No. of bitstreams: 1\nZhao_grad.sunysb_0771E_12926.pdf: 4117443 bytes, checksum: a7cb1f93ba8961fd864c0547ae4366da (MD5)\n Previous issue date: 1"},{"label":"dcterms.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.subject","value":"barcode, bar-seq, clustering, ET, miRNA-mRNA regulatory network, Sparse modeling"},{"label":"dcterms.title","value":"On miRNA-mRNA network extraction and ultra-fast nucleotide barcodes clustering algorithm"},{"label":"dcterms.type","value":"Dissertation"},{"label":"dc.type","value":"Dissertation"}],"description":"This manifest was generated dynamically","viewingDirection":"left-to-right","sequences":[{"@type":"sc:Sequence","canvases":[{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json","@type":"sc:Canvas","label":"Page 1","height":1650,"width":1275,"images":[{"@type":"oa:Annotation","motivation":"sc:painting","resource":{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/55%2F09%2F16%2F55091632010765009625076898391966635376/full/full/0/default.jpg","@type":"dctypes:Image","format":"image/jpeg","height":1650,"width":1275,"service":{"@context":"http://iiif.io/api/image/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/55%2F09%2F16%2F55091632010765009625076898391966635376","profile":"http://iiif.io/api/image/2/level2.json"}},"on":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json"}]}]}]}