cual-id: globally unique, correctable, and human-friendly sample identifiers for comparative -omics studies
- Published
- Accepted
- Subject Areas
- Bioinformatics, Biotechnology, Ecology, Genomics, Microbiology
- Keywords
- microbiome, bioinformatics, microbial ecology, genomics, metagenomics
- Copyright
- © 2015 Chase et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
- Cite this article
- 2015. cual-id: globally unique, correctable, and human-friendly sample identifiers for comparative -omics studies. PeerJ PrePrints 3:e1431v1 https://doi.org/10.7287/peerj.preprints.1431v1
Abstract
The number of samples in high-throughput comparative “omics” studies is increasing rapidly due to the declining experimental costs. To keep sample data and metadata manageable, and ensure the integrity of scientific results as the scale of these projects continue to increase, it is essential that we transition to better designed sample identifiers. Ideally, sample identifiers will be: globally unique across projects, project teams and institutions; be short to facilitate manual transcription; be correctable with respect to common types of transcription errors; be opaque, meaning they do not contain information about the samples; and be compatible with existing standards. We present cual-id, a lightweight command line tool that creates, or mints, sample identifiers that meet these criteria without reliance on centralized infrastructure. cual-id allows users to assign Universally Unique Identifiers, or UUIDs, that are globally unique to their samples. UUIDs are too long to be conveniently written on sampling materials such as swabs or microcentrifuge tubes however, so cual-id additionally generates human-friendly 4-12 character identifiers (CualIDs) that map to their UUIDs and are unique within a project. CualIDs are used by humans when they are manually writing or entering identifiers, while the longer UUIDs are used by computers to unambiguously reference a sample. The adoption of identifiers that are globally unique, correctable, and easily hand-written or manually entered into a computer will be a major step forward for sample tracking in comparative -omics studies within and across projects and project teams.
Author Comment
This manuscript has been submitted for peer-review at ASM mSystems.
Supplemental Information
Figure S1: Example of cual-id PDF output
The Code 128 barcodes decode to the identifiers listed under each barcode (free smartphone apps are available for decoding barcodes, and can be tested by scanning the barcodes in this PDF). This PDF is formatted for printing on Electronic Imaging Materials CryoLabel® sticker sheets (#80402).