Optimizing RNA-Seq Mapping with STAR

Methods Mol Biol. 2016:1415:245-62. doi: 10.1007/978-1-4939-3572-7_13.

Abstract

Recent advances in high-throughput sequencing technology made it possible to probe the cell transcriptomes by generating hundreds of millions of short reads which represent the fragments of the transcribed RNA molecules. The first and the most crucial task in the RNA-seq data analysis is mapping of the reads to the reference genome. STAR (Spliced Transcripts Alignment to a Reference) is an RNA-seq mapper that performs highly accurate spliced sequence alignment at an ultrafast speed. STAR alignment algorithm can be controlled by many user-defined parameters. Here, we describe the most important STAR options and parameters, as well as best practices for achieving the maximum mapping accuracy and speed.

Keywords: RNA-seq; Reads mapping; STAR; Sequence alignment; Spliced alignment; Transcriptome.

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Gene Expression Profiling / methods*
  • Humans
  • RNA Splicing
  • Sequence Alignment / methods
  • Sequence Analysis, RNA / methods*
  • User-Computer Interface