Ensuring Model Fairness via Stratified Training: TP53 Mutation Prediction with Estrogen Receptor Stratification in Breast Histopathology

Nikos Tsiknakis; Kang Wang; Dimitrios Salgkamis; Evangelos Tzoras; Georgios C Manikis; Emmanouil Sifakis; Jonas Bergh; Ioannis Zerdes; Kostas Marias; Alexios Matikas; Theodoros Foukakis

doi:10.1109/EMBC53108.2024.10782012

Ensuring Model Fairness via Stratified Training: TP53 Mutation Prediction with Estrogen Receptor Stratification in Breast Histopathology

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul:2024:1-5. doi: 10.1109/EMBC53108.2024.10782012.

Authors

Nikos Tsiknakis, Kang Wang, Dimitrios Salgkamis, Evangelos Tzoras, Georgios C Manikis, Emmanouil Sifakis, Jonas Bergh, Ioannis Zerdes, Kostas Marias, Alexios Matikas, Theodoros Foukakis

PMID: 40039878
DOI: 10.1109/EMBC53108.2024.10782012

Abstract

Developing AI models on medical images as decision support systems has seen a huge increase in interest during the last few years. However, most published studies have neglected testing the model's robustness against certain dataset-related biases and unbalanced variables. For example, although the prevalence of TP53 mutations is higher in Estrogen Receptor (ER)-negative breast cancer, while most ER-positive tumors are not mutated, published models have been developed on the entirety of the available data without testing for such intrinsic biases that can lead to overfitting. In this study we show that models trained for TP53 mutation prediction overfit on ER status and that stratification of training on the basis of ER is beneficial for all subgroups while it reduces bias and increases generalizability and fairness. (Implementation: https://github.com/tsikup/er-stratified-training-tp53-prediction).

MeSH terms

Breast Neoplasms* / genetics
Breast Neoplasms* / metabolism
Breast Neoplasms* / pathology
Female
Humans
Mutation*
Receptors, Estrogen* / metabolism
Tumor Suppressor Protein p53* / genetics

Substances

Tumor Suppressor Protein p53
Receptors, Estrogen
TP53 protein, human