From raw ion mobility measurements to disease classification: a comparison of analysis processes

Salome Horsch; Dominik Kopczynski; Jörg Ingo Baumbach; Jörg Rahnenführer; Sven Rahmann

doi:10.7287/peerj.preprints.1294v1

Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

NOT PEER-REVIEWED

"PeerJ Preprints" is a venue for early communication or feedback before peer review. Data may be preliminary.

German Conference on Bioinformatics 2015 Collection thumbnail

Highlighted in German Conference on Bioinformatics 2015 Collection

From raw ion mobility measurements to disease classification: a comparison of analysis processes

Salome Horsch ¹, Dominik Kopczynski², Jörg Ingo Baumbach³, Jörg Rahnenführer¹, Sven Rahmann^2,4

1 Department of Statistics, TU Dortmund University, Dortmund, Germany

2 Bioinformatics, Computer Science XI, TU Dortmund University, Dortmund, Germany

3 Faculty of Applied Chemistry, Reutlingen University, Reutlingen, Germany

4 Genome Informatics, Institute of Human Genetics, University Hospital Essen, University of Duisburg-Essen, Essen, Germany

DOI: 10.7287/peerj.preprints.1294v1

Published: 2015-08-11
Accepted: 2015-08-11

Subject Areas: Bioinformatics, Respiratory Medicine, Statistics, Computational Science
Keywords: Ion mobility spectrometry, peak detection, clustering, classification, analysis process

Copyright: © 2015 Horsch et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.

Cite this article: Horsch S, Kopczynski D, Baumbach JI, Rahnenführer J, Rahmann S. 2015. From raw ion mobility measurements to disease classification: a comparison of analysis processes. PeerJ PrePrints 3:e1294v1 https://doi.org/10.7287/peerj.preprints.1294v1

Abstract

Ion mobility spectrometry (IMS) is a technology for the detection of volatile compounds in the air of exhaled breath that is increasingly used in medical applications. One major goal is to classify patients into disease groups, for example diseased versus healthy, from simple breath samples. Raw IMS measurements are data matrices in which peak regions representing the compounds have to be identified and quantified. A typical analysis process consists of pre-processing and peak detection in single experiments, peak clustering to obtain consensus peaks across several experiments, and classification of samples based on the resulting multivariate peak intensities. Recently several automated algorithms for peak detection and peak clustering have been introduced, in order to overcome the current need for human-based analysis that is slow, subjective and sometimes not reproducible. We present an unbiased comparison of a multitude of combinations of peak processing and multivariate classification algorithms on a disease dataset. The specific combination of the algorithms for the different analysis steps determines the classification accuracy, with the encouraging result that certain fully-automated combinations perform even better than current manual approaches.

Author Comment

The two last authors, Jörg Rahnenführer and Sven Rahmann, contributed equally. This work has been presented at the German Conference on Bioinformatics 2015.

Add your feedback

Before adding feedback, consider if it can be asked as a question instead, and if so then use the Question tab. Pointing out typos is fine, but authors are encouraged to accept only substantially helpful feedback.

Some Markdown syntax is allowed: _italic_ **bold** ^superscript^ ~subscript~ %%blockquote%% [link text](link URL)

By posting this you agree to PeerJ's commenting policies

Questions

Ask a question

Learn more about Q&A

Links

Add a link

Content

Alert

Just enter your email

Add your feedback

Top referrals unique visitors

Share this preprint

Metrics

Download article