Abstract
AbstractA bias in health research to favor understanding of diseases as they present in men can have a grave impact on the health of women. This paper reports on a conceptual review of the literature that used machine learning or NLP techniques to interrogate big data for identifying sex-specific health disparities. We searched Ovid MEDLINE, Embase, and PsycINFO in October 2021 using synonyms and indexing terms for (1) “women” or “men” or “sex,” (2) “big data” or “artificial intelligence” or “NLP”, and (3) “disparities” or “differences.” From 902 records, 22 studies met the inclusion criteria and were analyzed. Results demonstrate that the inclusion by sex is inconsistent and often unreported, although the inclusion of men in the included studies is disproportionately less than women. Even though AI and NLP techniques are widely applied in health research, few studies use them to take advantage of unstructured text to investigate sex-related differences or disparities. Researchers are increasingly aware of sex-based data bias, but the process towards correction is slow. We reflected on what would be the best practices on using big data analytics to address sex-specific biases in understanding the etiology, diagnosis, and prognosis of diseases.
Publisher
Cold Spring Harbor Laboratory
Reference79 articles.
1. NIH. n.d. NIH policy on sex as a biological variable. https://orwh.od.nih.gov/sex-gender/nih-policy-sex-biological-variable
2. Sex does matter: comments on the prevalence of male-only investigations of drug effects on rodent behaviour
3. A Guide for the Design of Pre-clinical Studies on Sex Differences in Metabolism
4. Males still dominate animal studies
5. Criado-Perez C. 2019. Invisible women: exposing data bias in a world designed for men. Random House