Background. Understanding how organisms adapt to environmental stress remains a central challenge in biology, with implications for both ecosystem resilience and biotechnological innovation. Chlamydomonas reinhardtii is a powerful model for exploring transcriptional responses to salinity, where glycerol biosynthesis plays a critical osmoprotective role mediated by glycerol-3-phosphate dehydrogenase (GPD) enzymes. While GPD2 and GPD3 are known salt-responsive genes sharing high sequence identity and similar metabolic functions, the regulatory mechanisms governing their expression remain poorly characterized. In this study, we employed a novel integrative framework that allows the reuse of publicly available time-course RNA-seq data to address previously unexplored regulatory differences between these genes, combining network analysis and machine learning to reveal unique transcriptional regulatory programs.
Methods. Weighted Gene Co-expression Network Analysis (WGCNA) was performed on Chlamydomonas reinhardtii transcriptomes under salt stress and control conditions. Modules containing GPD genes were analyzed for functional enrichment, transcription factor associations, and cis-regulatory elements, while network robustness was assessed using a Random Forest classifier to validate gene-to-module assignment.
Results. GPD2 and GPD3 clustered into distinct co-expression modules with contrasting temporal and functional profiles. GPD2 showed early induction and co-expression with bZIP and GATA factors, whereas GPD3 displayed delayed responses and association with MYB, SBP, and ALFIN-like factors. Distinct promoter motifs supported these regulatory differences, and Random Forest validation (95.4 % accuracy) confirmed the biological coherence of the network.
Conclusions. Our integrative approach demonstrates that, despite their high sequence similarity, GPD2 and GPD3 are embedded in fundamentally distinct regulatory networks during salt stress. To our knowledge, this study represents the first application of supervised machine learning to validate WGCNA modules in microalgae, providing a rigorous framework for assessing co-expression network robustness. This work proposes a methodological strategy that integrates data reuse, network analysis, and machine learning validation to enhance the study of gene regulation in microalgae and other organisms.
If you have any questions about submitting your review, please email us at [email protected].