Quantitative color profiling of digital images with earth mover's distance using the R package colordistance
- Published
- Accepted
- Subject Areas
- Biodiversity, Evolutionary Studies, Statistics
- Keywords
- R packages, color, mimicry, image processing, phylogenetics, camouflage, earth mover's distance, statistics
- Copyright
- © 2018 Weller et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
- Cite this article
- 2018. Quantitative color profiling of digital images with earth mover's distance using the R package colordistance. PeerJ Preprints 6:e26487v2 https://doi.org/10.7287/peerj.preprints.26487v2
Abstract
Biological color may be adaptive or incidental, seasonal or permanent, species- or population-specific, or modified for breeding, defense or camouflage. Although color is a hugely informative aspect of biology, quantitative color comparisons are notoriously difficult. Color comparison is limited by categorization methods, with available tools requiring either subjective classifications, or expensive equipment, software, and expertise. We present an R package for processing images of organisms (or other objects) in order to quantify color profiles, gather color trait data, and compare color palettes on the basis of color similarity and amount. The package treats image pixels as 3D coordinates in a “color space", producing a multidimensional color histogram for each image. Pairwise distances between histograms are computed using earth mover's distance, a technique borrowed from computer vision that compares histograms using transportation costs. Users choose a color space, parameters for generating color histograms, and a pairwise comparison method to produce a color distance matrix for a set of images. The package is intended as a more rigorous alternative to subjective, manual digital image analyses, not as a replacement for more advanced techniques that rely on detailed spectrophotometry methods unavailable to many users. Here, we outline the basic functions colordistance, provide guidelines for the available color spaces and quantification methods, and compare this toolkit with other available methods. The tools presented for quantitative color analysis may be applied to a broad range of questions in biology and other disciplines.
Author Comment
We have made extensive changes to the framing of the paper, clarified sections on the functionality of the colordistance package, and made it clearer where that functionality is novel and where it overlaps with other applications. We have also updated the package itself to include CIE Lab color space, which is a more robust color space when working with color vision. The update also includes new functions, including a function for combining color palettes of images (combineClusters).