{"@context":"http://iiif.io/api/presentation/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/manifest.json","@type":"sc:Manifest","label":"A Conditional Likelihood Based Model for Differential Expression Analysis for Paired RNA-seq Data","metadata":[{"label":"dc.description.sponsorship","value":"This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree"},{"label":"dc.format","value":"Monograph"},{"label":"dc.format.medium","value":"Electronic Resource"},{"label":"dc.identifier.uri","value":"http://hdl.handle.net/11401/78213"},{"label":"dc.language.iso","value":"en_US"},{"label":"dcterms.abstract","value":"Next generation sequencing (NGS) technology provides an attractive platform for genomic study. RNA-seq employs NGS technology to sequence and quantify RNA content in samples and reveal their gene expression profiles. In RNA-seq studies, one important objective is to identify the gene expression difference between two experimental conditions (e.g. control vs. treatment), which is known as differential expression (DE) analysis. Various statistical methods, such as edgeR and DESeq, have been developed to perform the two-sample DE analysis. However, in practice, expression data may come in pairs, e.g., pre-vs. post-treatment on the same individual, and new models incorporating this paired structure are in great demand. In this thesis, we propose a new analysis framework that directly takes into account the paired structure of RNA-seq data and perform the paired DE analysis. Normalization is a crucial pre-processing step for DE analysis. However, none of the currently available normalization methods are designed for paired RNA-seq data. We investigated all existing normalization methods through a series of simulation studies to gain insights about their applicability. Based on these, a customized normalization method (pairedNorm) has been proposed for paired RNA-seq DE analysis. Regarding the statistical test, we adopt the Poisson model for the paired RNA-seq data and propose a conditional likelihood framework, named as pairedBN, for parameter estimation and hypothesis testing. Unlike the other DE tests, the proposed method does not assume distribution of baseline expression level across samples and has no restriction on proportion of DE genes within a sample. The conditional likelihood framework is employed to reduce the nuisance parameters, e.g., the sample specific true expression levels, thus largely improving the computational efficiency. Furthermore, a non-parametric test procedure can serve as an ad-hoc procedure allowing for more flexibility of the data. We conduct an extensive comparison of our method (pairedBN) with two most popular methods, edgeR and DESeq, through simulation studies. The results show the superiority of pairedBN in FDR control while maintaining good sensitivity. We also apply our method to analyze a paired RNA-seq data from TCGA to demonstrate its practical usage."},{"label":"dcterms.available","value":"2018-03-22T22:39:19Z"},{"label":"dcterms.contributor","value":"Wu, Song."},{"label":"dcterms.creator","value":"Xu, Jianjin"},{"label":"dcterms.dateAccepted","value":"2018-03-22T22:39:19Z"},{"label":"dcterms.dateSubmitted","value":"2018-03-22T22:39:19Z"},{"label":"dcterms.description","value":"Department of Applied Mathematics and Statistics."},{"label":"dcterms.extent","value":"105 pg."},{"label":"dcterms.format","value":"Application/PDF"},{"label":"dcterms.identifier","value":"http://hdl.handle.net/11401/78213"},{"label":"dcterms.issued","value":"2017-08-01"},{"label":"dcterms.language","value":"en_US"},{"label":"dcterms.provenance","value":"Made available in DSpace on 2018-03-22T22:39:19Z (GMT). No. of bitstreams: 1\nXu_grad.sunysb_0771E_13396.pdf: 1331944 bytes, checksum: b8cfa8dee63c49db15631775a3eb24f2 (MD5)\n Previous issue date: 2017-08-01"},{"label":"dcterms.subject","value":"Biostatistics"},{"label":"dcterms.title","value":"A Conditional Likelihood Based Model for Differential Expression Analysis for Paired RNA-seq Data"},{"label":"dcterms.type","value":"Dissertation"},{"label":"dc.type","value":"Dissertation"}],"description":"This manifest was generated dynamically","viewingDirection":"left-to-right","sequences":[{"@type":"sc:Sequence","canvases":[{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json","@type":"sc:Canvas","label":"Page 1","height":1650,"width":1275,"images":[{"@type":"oa:Annotation","motivation":"sc:painting","resource":{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/16%2F61%2F01%2F166101674843126134566434384782894777862/full/full/0/default.jpg","@type":"dctypes:Image","format":"image/jpeg","height":1650,"width":1275,"service":{"@context":"http://iiif.io/api/image/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/16%2F61%2F01%2F166101674843126134566434384782894777862","profile":"http://iiif.io/api/image/2/level2.json"}},"on":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json"}]}]}]}