Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience

Nicholas A Coles; Bartosz Perz; Maciej Behnke; Johannes C Eichstaedt; Soo Hyung Kim; Tu N Vu; Chirag Raman; Julian Tejada; Van-Thong Huynh; Guangyi Zhang; Tanming Cui; Sharanyak Podder; Rushi Chavda; Shubham Pandey; Arpit Upadhyay; Jorge I Padilla-Buritica; Carlos J Barrera Causil; Linying Ji; Felix Dollack; Kiyoshi Kiyokawa; Huakun Liu; Monica Perusquia-Hernandez; Hideaki Uchiyama; Xin Wei; Houwei Cao; Ziqing Yang; Alessia Iancarelli; Kieran McVeigh; Yiyu Wang; Isabel M Berwian; Jamie C Chiu; Dan-Mircea Mirea; Erik C Nook; Henna I Vartiainen; Claire Whiting; Young Won Cho; Sy-Miin Chow; Zachary F Fisher; Yanling Li; Xiaoyue Xiong; Yuqi Shen; Enzo Tagliazucchi; Leandro A Bugnon; Raydonal Ospina; Nicolas M Bruno; Tomas A D'Amelio; Federico Zamberlan; Luis R Mercado Diaz; Javier O Pinzon-Arenas; Hugo F Posada-Quintero; Maneesh Bilalpur; Saurabh Hinduja; Fernando Marmolejo-Ramos; Shaun Canavan; Liza Jivnani; Stanisław Saganowski

doi:10.1098/rsos.241778

Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience

R Soc Open Sci. 2025 Jun 25;12(6):241778. doi: 10.1098/rsos.241778. eCollection 2025 Jun.

Authors

Nicholas A Coles¹, Bartosz Perz², Maciej Behnke³, Johannes C Eichstaedt⁴, Soo Hyung Kim⁵, Tu N Vu⁵, Chirag Raman⁶, Julian Tejada⁷, Van-Thong Huynh⁸, Guangyi Zhang⁹, Tanming Cui¹⁰, Sharanyak Podder¹¹, Rushi Chavda¹², Shubham Pandey¹², Arpit Upadhyay¹², Jorge I Padilla-Buritica¹³, Carlos J Barrera Causil¹³, Linying Ji¹⁴, Felix Dollack¹⁵, Kiyoshi Kiyokawa¹⁵, Huakun Liu¹⁵, Monica Perusquia-Hernandez¹⁵, Hideaki Uchiyama¹⁵, Xin Wei¹⁵, Houwei Cao¹⁶, Ziqing Yang¹⁶, Alessia Iancarelli¹⁷, Kieran McVeigh¹⁷, Yiyu Wang¹⁷, Isabel M Berwian¹⁸, Jamie C Chiu¹⁸, Dan-Mircea Mirea¹⁸, Erik C Nook¹⁸, Henna I Vartiainen¹⁸, Claire Whiting¹⁸, Young Won Cho¹⁹, Sy-Miin Chow¹⁹, Zachary F Fisher¹⁹, Yanling Li¹⁹, Xiaoyue Xiong¹⁹, Yuqi Shen¹⁹, Enzo Tagliazucchi²⁰, Leandro A Bugnon²¹, Raydonal Ospina²², Nicolas M Bruno²³, Tomas A D'Amelio²³, Federico Zamberlan^{23

24}, Luis R Mercado Diaz²⁵, Javier O Pinzon-Arenas²⁵, Hugo F Posada-Quintero²⁵, Maneesh Bilalpur²⁶, Saurabh Hinduja²⁷, Fernando Marmolejo-Ramos²⁸, Shaun Canavan²⁹, Liza Jivnani²⁹, Stanisław Saganowski²

Affiliations

¹ University of Florida, Gainesville, FL, USA.
² Wrocław University of Science and Technology, Wroclaw, Województwo Dolnośląskie, Poland.
³ Adam Mickiewicz University, Poznan, Poland.
⁴ Stanford University, Stanford, CA, USA.
⁵ Chonnam National University, Gwangju, Jeollanam-do, Republic of Korea.
⁶ Delft University of Technology, Delft, Zuid-Holland, The Netherlands.
⁷ Federal University of Sergipe, Sao Cristovao, Sergipe, Brazil.
⁸ FPT University, Hanoi, Vietnam.
⁹ Harvard Medical School, Boston, MA, USA.
¹⁰ Independent Researcher, State College, PA, USA.
¹¹ Indian Institute of Science Education and Research Bhopal, Bhopal, Madhya Pradesh, India.
¹² Indian Institute of Technology Bombay, Mumbai, Maharashtra, India.
¹³ Institución Universitaria ITM, Medellín, Colombia.
¹⁴ Montana State University, Bozeman, MT, USA.
¹⁵ Nara Institute of Science and Technology, Ikoma, Nara, Japan.
¹⁶ New York Institute of Technology, Old Westbury, NY, USA.
¹⁷ Northeastern University-Boston Campus, Boston, MA, USA.
¹⁸ Princeton University, Princeton, NJ, USA.
¹⁹ The Pennsylvania State University, University Park, PA, USA.
²⁰ Universidad Adolfo Ibanez, Penalolen, Chile.
²¹ Research Institute for Signals, Systems and Computational Intelligence, sinc(i), FICH-UNL, CONICET, Santa Fe, Argentina.
²² Universidade Federal da Bahia, Salvador, Brazil.
²³ University of Buenos Aires, Buenos Aires, Argentina.
²⁴ Tilburg University, Tilburg, Netherlands.
²⁵ University of Connecticut, Storrs, CT, USA.
²⁶ University of Pittsburgh, Pittsburgh, PA, USA.
²⁷ University of Akron, Akron, OH, USA.
²⁸ Flinders University, Adelaide, South Australia, Australia.
²⁹ University of South Florida, Tampa, FL, USA.

Abstract

Researchers are increasingly using machine learning to study physiological markers of emotion. We evaluated the promises and limitations of this approach via a big team science competition. Twelve teams competed to predict self-reported affective experiences using a multi-modal set of peripheral nervous system measures. Models were trained and tested in multiple ways: with data divided by participants, targeted emotion, inductions, and time. In 100% of tests, teams outperformed baseline models that made random predictions. In 46% of tests, teams also outperformed baseline models that relied on the simple average of ratings from training datasets. More notably, results uncovered a methodological challenge: multiplicative constraints on generalizability. Inferences about the accuracy and theoretical implications of machine learning efforts depended not only on their architecture, but also how they were trained, tested, and evaluated. For example, some teams performed better when tested on observations from the same (vs. different) subjects seen during training. Such results could be interpreted as evidence against claims of universality. However, such conclusions would be premature because other teams exhibited the opposite pattern. Taken together, results illustrate how big team science can be leveraged to understand the promises and limitations of machine learning methods in affective science and beyond.

Keywords: affective computing; big team science; emotion; generalizability; machine learning; physiology.

Associated data

figshare/10.6084/m9.figshare.c.7828922