Reusing microarray clinical data from a complex disease with bioinformatics tool

Department of Sciences, University of Basilicata, Potenza, Italy
Institute of Food Sciences, Italian National Research Council, Avellino, Italy
Department of Computer Science and Technology, Computer Laboratory, University of Cambridge, Cambridge, United Kingdom
DOI
10.7287/peerj.preprints.27398v1
Subject Areas
Bioinformatics, Computational Biology, Pathology, Statistics, Data Science
Keywords
reusing data, reproducible research, coeliac disease comorbidities, clinical bioinformatics, gene set enrichment analysis, semantic similarity, microarray dataset, complex disease
Copyright
© 2018 Del Prete et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Del Prete E, Facchiano A, Liò P. 2018. Reusing microarray clinical data from a complex disease with bioinformatics tool. PeerJ Preprints 6:e27398v1

Abstract

Clinical bioinformatics, translational bioinformatics and personalised medicine are connected by the common task of analysing and integrating clinical data and results, in order to find important biomarkers related to pathologies and facilitate their prediction, diagnosis and treatment. New technologies provides the possibility to have more and more clinical data available in online databases. This data can be reused for studying complex disease from novel point of views. This work show how it is possible considering online microarray data from coeliac disease and some of its comorbidities, combining both the data and the results. The main goal is the extraction of common evidences among the selected pathologies, from genes to different kinds of functional annotation, showing which biological processes are more involved in these autoimmune disorders and quantifying the similarity between coeliac disease and its comorbidities. The pipeline of the work is developed in R language, and it is semi-automated. Methodologically, the advantage of this work is the possibility of performing the entire analysis starting from a different pathology; clinically, scientists can have the possibility of using data already published to highlight old and new evidences, with the possibility of improve the knowledge on a complex disease according to the availability of new microarray data.

Author Comment

This is an abstract which has been accepted for the BBCC2018 Conference