Analysis and Performance Assessment of the Whole Genome Bisulfite Sequencing Data Workflow: Currently Available Tools and a Practical Guide to Advance DNA Methylation Studies

Small Methods. 2022 Mar;6(3):e2101251. doi: 10.1002/smtd.202101251. Epub 2022 Jan 22.

Abstract

DNA methylation is associated with transcriptional repression, genomic imprinting, stem cell differentiation, embryonic development, and inflammation. Aberrant DNA methylation can indicate disease states, including cancer and neurological disorders. Therefore, the prevalence and location of 5-methylcytosine in the human genome is a topic of interest. Whole-genome bisulfite sequencing (WGBS) is a high-throughput method for analyzing DNA methylation. This technique involves library preparation, alignment, and quality control. Advancements in epigenetic technology have led to an increase in DNA methylation studies. This review compares the detailed experimental methodology of WGBS using accessible and up-to-date analysis tools. Practical codes for WGBS data processing are included as a general guide to assist progress in DNA methylation studies through a comprehensive case study.

Keywords: DNA methylation; alignment algorithm comparison; library preparation methods; methylation data analysis pipeline; whole genome bisulfite sequencing.

Publication types

  • Review

MeSH terms

  • CpG Islands
  • DNA Methylation* / genetics
  • Humans
  • Sulfites*
  • Workflow

Substances

  • Sulfites
  • hydrogen sulfite