Case-sibling studies that acknowledge unstudied parents and permit the inclusion of unmatched individuals

Int J Epidemiol. 2013 Feb;42(1):298-307. doi: 10.1093/ije/dys212. Epub 2012 Dec 17.

Abstract

Background: Family-based designs enable assessment of genetic associations without bias from population stratification. When parents are not readily available - especially for diseases with onset later in life - the case-sibling design, where each case is matched with one or more unaffected siblings, is useful. Analysis typically accounts for within-family dependencies by using conditional logistic regression (CLR).

Methods: We consider an alternative to CLR that treats each case-sibling set as a nuclear family with both parents missing by design. One can carry out maximum likelihood analysis by using the Expectation-Maximization (EM) algorithm to account for missing parental genotypes. We compare conditional logistic regression and the missing-parents approach under several risk scenarios.

Results: We show that the missing-parents approach improves power when some families have more than one unaffected sibling and that under weak assumptions the approach permits the incorporation of supplemental cases who have no sibling available and supplemental controls whose case sibling is unavailable (e.g., due to disability or death).

Conclusion: The missing-parents approach offers both improved statistical efficiency and asymptotically unbiased estimation for genotype relative risks and genotype-by-exposure interaction parameters.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Chromosome Mapping
  • Gene-Environment Interaction
  • Genetic Predisposition to Disease*
  • Genotype*
  • Humans
  • Logistic Models
  • Models, Genetic
  • Nuclear Family
  • Parents*
  • Research Design
  • Siblings*