Overview

Authors:

Serghei Mangul ⁰,
Harry Taegyun Yang ¹,
Eleazar Eskin ²,
…
Noah Zaitlen ³

Serghei Mangul
1. Department of Computer Science, Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, USA
View author publications

You can also search for this author in PubMed Google Scholar
Harry Taegyun Yang
1. Department of Computer Science, University of California Los Angeles, Los Angeles, USA
View author publications

You can also search for this author in PubMed Google Scholar
Eleazar Eskin
1. Department of Computer Science, Department of Human Genetics, University of California Los Angeles, Los Angeles, USA
View author publications

You can also search for this author in PubMed Google Scholar
Noah Zaitlen
1. Division of Pulmonary, Critical Care, Sleep and Allergy, Department of Medicine, Cardiovascular Research Institute, University of California, San Francisco, USA
View author publications

You can also search for this author in PubMed Google Scholar

Focuses on revealing the features hidden in unmapped reads, which traditional RNA-sequencing research commonly regards as uninformative
Highlights how a comprehensive search for the origin of unmapped reads can reveal characteristics about the sample’s immune repertoire

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

3053 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 16.99 ~~USD 39.99~~

Discount applied Price excludes VAT (USA)

Softcover Book USD 16.99 ~~USD 54.99~~

Discount applied Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (1 chapter)

Front Matter

Pages i-v

Download chapter PDF
Hidden Treasures in Contemporary RNA Sequencing
- Serghei Mangul, Harry Taegyun Yang, Eleazar Eskin, Noah Zaitlen
Pages 1-93

Keywords

About this book

Advances in RNA-sequencing (RNA-seq) technologies have provided an unprecedented opportunity to explore the gene expression landscape across individuals, tissues, and environments by efficiently profiling the RNA sequences present in the samples. When a reference genome sequence or a transcriptome of the sample is available, mapping-based RNA-seq analysis protocols align the RNA-seq reads to the reference sequences, identify novel transcripts, and quantify the abundance of expressed transcripts.
The reads that fail to map to the human reference, known as unmapped reads, are a large and often overlooked output of standard RNA-seq analyses. Even in carefully executed experiments, the unmapped reads can comprise a considerable fraction of the complete set of reads produced, and can arise due to technical sequencing produced by low-quality and error-prone copies of the nascent RNA sequence being sampled. Reads can also remain unmapped due to unknown transcripts, recombined Band T cell receptor sequences, A-to-G mismatches from A-to-I RNA editing, trans-splicing, gene fusion, circular RNAs, and the presence of non-host RNA sequences (e.g. bacterial, fungal, and viral organisms). Unmapped reads represent a rich resource for the study of B and T cell receptor repertoires and the human microbiome system—without incurring the expense of additional targeted sequencing.
This book introduces and describes the Read Origin Protocol (ROP), a tool that identifies the origin of both mapped and unmapped reads. The protocol first identifies human reads using a standard high-throughput algorithm to map them onto a reference genome and transcriptome. After alignment, reads are grouped into genomic (e.g. CDS, UTRs, introns) and repetitive (e.g. SINEs, LINEs, LTRs) categories. The rest of the ROP protocol characterizes the remaining unmapped reads, which failed to map to the human reference sequences.

Authors and Affiliations

Department of Computer Science, Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, USA

Serghei Mangul
Department of Computer Science, University of California Los Angeles, Los Angeles, USA

Harry Taegyun Yang
Department of Computer Science, Department of Human Genetics, University of California Los Angeles, Los Angeles, USA

Eleazar Eskin
Division of Pulmonary, Critical Care, Sleep and Allergy, Department of Medicine, Cardiovascular Research Institute, University of California, San Francisco, USA

Noah Zaitlen

About the authors

Dr. Serghei Mangul is a Postdoctoral Fellow at the University of California, Los Angeles, where he holds a Collaboratory Fellowship from the Institute for Quantitative and Computational Biosciences.

Bibliographic Information

Book Title: Hidden Treasures in Contemporary RNA Sequencing
Authors: Serghei Mangul, Harry Taegyun Yang, Eleazar Eskin, Noah Zaitlen
Series Title: SpringerBriefs in Computer Science
DOI: https://doi.org/10.1007/978-3-030-13973-5
Publisher: Springer Cham
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: The Author(s), under exclusive license to Springer Nature Switzerland AG 2019
Softcover ISBN: 978-3-030-13972-8Published: 13 March 2019
eBook ISBN: 978-3-030-13973-5Published: 01 March 2019
Series ISSN: 2191-5768
Series E-ISSN: 2191-5776
Edition Number: 1
Number of Pages: V, 93
Number of Illustrations: 1 b/w illustrations, 48 illustrations in colour
Topics: Computational Biology/Bioinformatics, Bioinformatics, Genetics and Genomics

Publish with us

Policies and ethics

Hidden Treasures in Contemporary RNA Sequencing

Overview

Access this book

Other ways to access

Table of contents (1 chapter)

Front Matter