Accurate assembly of transcripts through phase-preserving graph decomposition

Nat Biotechnol. 2017 Dec;35(12):1167-1169. doi: 10.1038/nbt.4020. Epub 2017 Nov 13.

Abstract

We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.

MeSH terms

  • Algorithms*
  • Humans
  • RNA* / analysis
  • RNA* / genetics
  • RNA* / metabolism
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*
  • Software*

Substances

  • RNA