Mapping MAVE data for use in human genomics applications

Genome Biol. 2025 Jun 25;26(1):179. doi: 10.1186/s13059-025-03647-x.

Abstract

Background: Experimental data from functional assays have a critical role in interpreting the impact of genetic variants. Assay data must be unambiguously mapped to a reference genome to make it accessible, but it is often reported relative to assay-specific sequences, complicating downstream use and integration of variant data across resources. To make multiplexed assays of variant effect (MAVE) data more broadly available to the research and clinical communities, the Atlas of Variant Effects Alliance mapped MAVE data from the MaveDB community database to human reference sequences, creating an extensive set of machine-readable homology mappings that are incorporated into widely used human genomics applications.

Results: Here, we map approximately 9.0 million individual protein and nucleotide variants in MaveDB to the human genome, describing the examined variants with respect to human reference sequences while preserving the data provenance of the original MAVE sequences. We then disseminate the results to major genomic resources including the Genomics 2 Proteins Portal, UCSC Genome Browser, Ensembl Variant Effect Predictor, and DECIPHER platform. Within these applications, MAVE variants can now be visualized and integrated with other relevant clinical and biological data, making additional knowledge available when performing variant interpretation and conducting other research activities.

Conclusions: Mapping MAVE variants to human reference sequences and sharing the mapped dataset with several key human genomics applications enables a new and diverse set of applications for MAVE data. This study provides increased access to functional data that can assist in clinical variant interpretation pipelines and enable biomedical research and discovery.

Keywords: Deep mutational scanning; Functional assay; Genomic medicine; Genomics; Global Alliance for Genomics and Health; Massively parallel reporter assays; Multiplexed assays of variant effect; Variation representation specification.

MeSH terms

  • Chromosome Mapping* / methods
  • Databases, Genetic
  • Genetic Variation*
  • Genome, Human*
  • Genomics* / methods
  • Humans