Background: Polycystic ovary syndrome (PCOS) is a common endocrine disorder influenced by genetic and environmental factors. Di(2-ethylhexyl) Phthalate (DEHP), a widespread endocrine disruptor, is suspected to increase PCOS risk, yet its molecular mechanisms remain unclear.
Methods: We integrated transcriptomic data from human ovarian granulosa cells across five GEO datasets. Following batch correction, differential expression analysis, and WGCNA, we identified PCOS-related genes. DEHP targets were sourced from ChEMBL, SwissTargetPrediction, CTD, and SEA. The intersection formed the DEHP-PCOS gene set, which underwent functional enrichment and immune infiltration analysis via CIBERSORT. A machine learning framework (including Lasso, Ridge, Elastic Net, SVM, and boosting) was trained on two datasets and validated on three independent sets. SHAP analysis interpreted the optimal model to identify hub genes. Molecular Docking (MD) and 100 ns Molecular Dynamics Simulations assessed DEHP-protein interactions.
Results: We identified 247 DEHP-PCOS genes significantly enriched in pathways related to secretion regulation, extracellular matrix organization, growth factor signaling, and steroid hormone metabolism. The RF model demonstrates superior predictive performance (Area Under the Curve, AUC = 0.793). SHAP analysis prioritized five hub genes: SPNS3, GBP2, MRC1, SLCO2B1, and RAB27A. These hub genes exhibited high individual predictive accuracy (AUCs: 0.823~0.883). Immune infiltration analysis revealed significant alterations in T cells, follicular helper, and Monocytes. SPNS3 expression was positively correlated with the B-cell memory infiltration. MD confirmed stable binding between DEHP and all five hub proteins, with SPNS3 showing the strongest affinity (binding energy: -7.0 kcal/mol). Molecular Dynamics Simulations further affirmed the stability of the DEHP-SPNS3 complex.
Conclusion: Our study defined a novel gene network and established a robust toxicity prediction model, which provides a computational tool for assessing PCOS risk by linking signature gene expression associated with DEHP exposure. We identify five hub genes as robust predictive biomarkers, with SPNS3 emerging as a key mediator, and we link its expression to the immune microenvironment. These findings provide mechanistic insights into how DEHP exposure may be a risk factor for PCOS.
If you have any questions about submitting your review, please email us at [email protected].