aer2vec: Distributed Representations of Adverse Event Reporting System Data as a Means to Identify Drug/Side-Effect Associations

AMIA Annu Symp Proc. 2020 Mar 4:2019:717-726. eCollection 2019.

Abstract

Adverse event report (AER) data are a key source of signal for post marketing drug surveillance. The standard methodology to analyze AER data applies disproportionality metrics, which estimate the strength of drug/side-effect associations from discrete counts of their occurrence at report level. However, in other domains, improvements in predictive modeling accuracy have been obtained through representation learning, where discrete features are replaced by distributed representations learned from unlabeled data. This paper describes aer2vec, a novel representational approach for AER data in which concept embeddings emerge from neural networks trained to predict drug/side-effect co-occurrence. Trained models are evaluated for their utility in identifying drug/side-effect relationships, with improvements over disproportionality metrics in most cases. In addition, we evaluate the utility of an otherwise-untapped resource in the Food and Drug Administration (FDA) AER system - reporter designations of suspected causality - and find that incorporating this information enhances performance of all models evaluated.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adverse Drug Reaction Reporting Systems*
  • Databases, Factual
  • Drug-Related Side Effects and Adverse Reactions / diagnosis*
  • Humans
  • Models, Theoretical*
  • Neural Networks, Computer
  • Product Surveillance, Postmarketing*
  • United States
  • United States Food and Drug Administration