Analyzing sex imbalance in EGA and dbGaP biological databases: Recommendations for better practices
Precision medicine aims at tailoring treatments to individual patient’s characteristics. In this regard, recognizing the significance of sex and gender becomes indispensable for meeting the distinct healthcare needs of diverse populations. To this end, continuing a trend of improving data quality observed since 2014, the European Genome-phenome Archive (EGA) established a policy in 2018 that mandates data providers to declare the sex of donor samples, aiming to enhance data accuracy and prevent imbalance in sex classification. We analyzed sex classification imbalance in human data from EGA and the U.S. counterpart, the database of Genotypes and Phenotypes (dbGaP). Our findings show a significant decrease in samples classified as unknown in EGA, potentially promoting better sex reporting during data collection. Based on our findings, we raise awareness of sample imbalance problems and provide a list of recommendations for enhancing biomedical research practices.
RUIZ SERRA Victoria;
BUSLÓN Nataly;
PHILIPPE Olivier R.;
SABY Diego;
MORALES María;
PONTES Camila;
MUÑOZ ANDIRKÓ Alejandro;
HOLLIDAY Gemma L.;
JENÉ Aina;
MOLDES Mauricio;
RAMBLA Jordi;
VALENCIA Alfonso;
REMENTERÍA NUÑEZ Maria Jose;
CORTÉS Atia;
CIRILLO Davide;
2024-11-15
CELL PRESS
JRC139060
2589-0042 (online),
https://www.sciencedirect.com/science/article/pii/S258900422402056X,
https://publications.jrc.ec.europa.eu/repository/handle/JRC139060,
10.1016/j.isci.2024.110831 (online),
Additional supporting files
File name | Description | File type | |