Construction the first gene co-expression-based interactome in cattle

Innovation Team of Cattle Genetics and Breeding, Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
The School of Public Health, Institute for Chemical Carcinogenesis, Guangzhou Medical University, Guangzhou, China
Department of Animal Science, Washington State University, Pullman, USA
School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Queensland, Australia
DOI
10.7287/peerj.preprints.3178v1
Subject Areas
Agricultural Science, Genetics, Genomics
Keywords
Co-expression, WGCNA, network, systems biology, functional enrichment, cattle
Copyright
© 2017 Chen et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Chen Y, Liu Y, Du M, Zhang W, Gao X, Zhang L, Gao H, Xu L, Li J, Zhao M. 2017. Construction the first gene co-expression-based interactome in cattle. PeerJ Preprints 5:e3178v1

Abstract

Integrating genomic information into cattle breeding is an important approach to exploring the molecular mechanism for complex traits related to diary and meat production. To assist with genomic-based selection, a reference map of interactome is needed to fully understand genotype-phenotype relationships. To this end we constructed a co-expression analysis of 92 tissues and this represents the first systematic exploration of gene-gene relationship in cattle. By using robust WGCNA (Weighted Gene Correlation Network Analysis), we described the gene co-expression network of 13,405 protein-coding genes from the cattle genome. Using the 5,000 genes with majority variations in expression across 92 tissues, we compiled a network with 72,306 co-associations and that provides functional insights into thousands of poorly characterized proteins. Further module identifications found 55 highly organized functional clusters representing diverse cellular activities. To demonstrate the re-use of our interaction for functional genomics analysis, we extracted a sub-network associated with DNA binding genes in cattle. The subnetwork was enriched within regulation of transcription from RNA polymerase II promoter representing central cellular functions. In addition, we identified 28 novel linker genes associated with more than 100 DNA binding genes. Our WGCNA-based co-expression network reconstruction will be a valuable resource for exploring the molecular mechanisms of incompletely characterized proteins and for elucidating larger-scale patterns of functional modulization in the cattle genome.

Author Comment

This is a submission to PeerJ for review.

Supplemental Information

Table S1. The expression profile for the top 5,000 most variant genes across 92 tissue samples

DOI: 10.7287/peerj.preprints.3178v1/supp-1

Table S2. The top five gene modules with most genes in WGCNA analysis

DOI: 10.7287/peerj.preprints.3178v1/supp-2

Table S3. The eigengenes for the gene modules from WGCNA analysis

DOI: 10.7287/peerj.preprints.3178v1/supp-3

Table S4. The number of connections for all the genes in the co-expression network from WGCNA

DOI: 10.7287/peerj.preprints.3178v1/supp-4

Table S5. The gene related to DNA binding in cattle

DOI: 10.7287/peerj.preprints.3178v1/supp-5

Table S6. The gene types for the extracted sub-network related to DNA binding

DOI: 10.7287/peerj.preprints.3178v1/supp-6