Preprints (not yet peer-reviewed)

795 downloads
2,416 views

Over the last three decades data has become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis have struggled to keep up. In April 2014 we launched a program of nine courses,...

["Science and Medical Education","Statistics","Human-Computer Interaction","Computational Science"]
doi:10.7287/peerj.preprints.3195v1
45 downloads
102 views

Single cell studies increasing reveal myriad cellular subtypes beyond those postulated or observed through optical and fluorescence microscopy as well as DNA sequencing studies. While gene sequencing at the single cell level offer a path towards illuminating, in...

["Biochemistry","Biotechnology","Cell Biology","Microbiology","Molecular Biology"]
doi:10.7287/peerj.preprints.3193v1
356 downloads
1,597 views

Computers are a central tool in the research process, enabling complex and large scale data analysis. As computer-based research has increased in complexity, so have the challenges of ensuring that this research is reproducible. To address this challenge, we review...

["Anthropology","Computational Biology","Science and Medical Education","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.3192v1
89 downloads
113 views

Empirical evidence is important to develop effective conservation policies. The documentation and assessment of the status and threats towards a species and its habitat are essential steps toward developing appropriate policies to protect its population and mitigate...

["Biodiversity","Conservation Biology"]
doi:10.7287/peerj.preprints.3191v1
416 downloads
1,660 views

Forecasting is a common data science task that helps organizations with capacity planning, goal setting, and anomaly detection. Despite its importance, there are serious challenges associated with producing reliable and high quality forecasts — especially when...

["Data Science"]
doi:10.7287/peerj.preprints.3190v1
32 downloads
83 views

Biological databases are of great importance for managing biological research data. Building databases has been a code-based process that requires integrative coding skills of different languages. Herein, we present a code-free pipeline that helps biologists to...

["Bioinformatics","Cardiology","Medical Genetics","Data Science"]
doi:10.7287/peerj.preprints.3189v1
260 downloads
1,076 views

R has always provided an application programming interface (API) for extensions. Based on the C language, it uses a number of macros and other low-level constructs to exchange data structures between the R process and any dynamically-loaded component modules authors...

["Data Science"]
doi:10.7287/peerj.preprints.3188v1
51 downloads
158 views

The USA National Center for Biotechnology Information (NCBI) is one of the world’s most important sources of biological information. NCBI databases like PubMed and GenBank contain millions of records describing bibliographic, genetic, genomic, and medical data....

["Bioinformatics","Genetics","Genomics","Molecular Biology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.3179v2
38 downloads
139 views

Background. Carrot is a multi-nutritional food source. It is an important root vegetable, rich in natural bioactive compounds with health-promoting properties, such as antioxidants that have anti-carcinogenic properties. Aim. This review summarises the occurrences...

["Agricultural Science","Biochemistry","Food Science and Technology","Plant Science","Nutrition"]
doi:10.7287/peerj.preprints.3187v1
28 downloads
135 views

The regulatory code that determines whether and how a given genetic variant affects the function of a regulatory element remains poorly understood for most classes of regulatory variation. Indeed the large majority of bioinformatics tools have been developed to...

["Bioinformatics","Computational Biology","Computational Science","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.3185v1
35 downloads
180 views

DNA analysis of predator feces using high-throughput amplicon sequencing (HTS) enhances our understanding of predator-prey interactions. However, conclusions drawn from this technique are constrained by biases that occur in multiple steps of the HTS workflow. To...

["Bioinformatics","Ecology","Entomology","Molecular Biology","Zoology"]
doi:10.7287/peerj.preprints.3184v1
217 downloads
911 views

At Airbnb, R has been amongst the most popular tools for doing data science in many different contexts, including generating product insights, interpreting experiments, and building predictive models. In a recent survey of the Airbnb team, 73% of Data Scientists...

["Data Science","Programming Languages"]
doi:10.7287/peerj.preprints.3182v1
589 downloads
2,427 views

Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this paper offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses....

["Computational Biology","Science and Medical Education","Statistics","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.3183v1
131 downloads
566 views

Modern statistics is fundamentally a computational discipline, but too often this fact is not reflected in our statistics curricula. With the rise of big data and data science it has become increasingly clear that students both want, expect, and need explicit training...

["Computer Education","Data Science","Graphics","Scientific Computing and Simulation","Software Engineering"]
doi:10.7287/peerj.preprints.3181v1
248 downloads
601 views

Studies investigating changes in community composition in response to recent global warming are mostly restricted to one-dimensional (e.g. elevational or latitudinal) gradients, whereas species movements are in reality three dimensional (i.e. elevational, latitudinal...

["Biodiversity","Climate Change Biology","Freshwater Biology"]
doi:10.7287/peerj.preprints.1034v2
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas