Preprints (not yet peer-reviewed)

17 downloads
88 views

In recent years, the pharmaceutical industry has been confronted with rising R&D costs paired with decreasing productivity. Attrition rates for new molecules are tremendous, with a substantial number of molecules failing in an advanced stage of development. Repositioning...

["Bioinformatics","Drugs and Devices","Computational Science","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.27002v1
34 downloads
201 views

Bacterial small non-coding RNAs (sRNAs) are involved in the control of several cellular processes. Hundreds of putative sRNAs have been identified in many bacterial species through RNA sequencing. The existence of putative sRNAs is usually validated by Northern...

["Bioinformatics","Microbiology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.26974v1
64 downloads
119 views

To accelerate scientific progress on remote tree classification—as well as biodiversity and ecology sampling—The National Institute of Science and Technology created a community-based competition where scientists were invited to contribute informatics methods for...

["Ecology","Data Mining and Machine Learning","Data Science","Forestry","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26971v1
39 downloads
124 views

Background. Biogeographers assess how species distributions and abundances affect the structure, function, and composition of ecosystems. Yet we face a major challenge: it is difficult to precisely map species across landscapes. Novel Earth observations could obviate...

["Biogeography","Ecology","Data Mining and Machine Learning","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26972v1
237 downloads
494 views

Ecology has reached the point where data science competitions, in which multiple groups solve the same problem using the same data by different methods, will be productive for advancing quantitative methods for tasks such as species identification from remote sensing...

["Ecology","Data Mining and Machine Learning","Data Science","Forestry","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26966v1
641 downloads
894 views

Random forest and similar Machine Learning techniques are already used to generate spatial predictions, but spatial location of points (geography) is often ignored in the modeling process. Spatial auto-correlation, especially if still existent in the cross-validation...

["Biogeography","Soil Science","Computational Science","Data Mining and Machine Learning","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26693v2
140 downloads
238 views

Bitcoin is a crypto currency system that has been rapidly adopted due to its anonymity and decentralized properties. Blockchain is the underpinning technology that maintains the Bitcoin transaction ledger. The blockchain network operates in a state of consensus,...

["Adaptive and Self-Organizing Systems","Autonomous Systems","Cryptography","Data Mining and Machine Learning","Network Science and Online Social Networks"]
doi:10.7287/peerj.preprints.26911v1
112 downloads
328 views

Clustering is a scientific method which finds the clusters of data and many related methods are traditionally researched for long terms. Bayesian nonparametrics is statistics which can treat models having infinite parameters. Chinese restaurant process is used...

["Data Mining and Machine Learning","Data Science","Software Engineering"]
doi:10.7287/peerj.preprints.26533v2
152 downloads
351 views

With an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing...

["Bioinformatics","Algorithms and Analysis of Algorithms","Data Mining and Machine Learning","Software Engineering","Visual Analytics"]
doi:10.7287/peerj.preprints.26869v1
205 downloads
1,010 views

A new layer of complexity, constituted of networks of information token recurrence, has been identified in socio-technical systems such as the Wikipedia online community and the Zooniverse citizen science platform. The identification of this complexity reveals...

["Data Mining and Machine Learning","Data Science","Network Science and Online Social Networks","Social Computing","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2789v2
40 downloads
264 views

We present PromoterPredict, a dynamic multiple regression approach to predict the strength of Escherichia coli promoters binding the σ70 factor of RNA polymerase. σ70 promoters are ubiquitously used in recombinant DNA technology, but characterizing their strength...

["Bioengineering","Bioinformatics","Biotechnology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.26759v2
212 downloads
438 views

High-throughput sequencing of environmental DNA (eDNA) offers a simple and cost-effective solution for marine biodiversity assessments. Yet several analytical challenges remain, including the incorporation of statistical inference in the assignment of taxonomic...

["Biodiversity","Bioinformatics","Computational Biology","Marine Biology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.26812v1
395 downloads
707 views

Potential Natural Vegetation (PNV) is the vegetation cover in equilibrium with climate, that would exist at a given location non-impacted by human activities. PNV is useful for raising public awareness about land degradation and for estimating land potential. This...

["Biogeography","Computational Biology","Plant Science","Data Mining and Machine Learning","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26811v1
156 downloads
468 views

The advent of next-generation sequencing has resulted in transcriptome-based approaches to investigate functionally significant biological components in a variety of non-model organism. This has resulted in the area of “venomics”: a rapidly growing field using...

["Bioinformatics","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.26733v1
97 downloads
204 views

There are numerous models for affective states classification and social behavior description. Despite proving their reliability, some of these classifications turn out to be redundant, while others — insufficient for certain practical purposes. In this paper we...

["Psychiatry and Psychology","Science and Medical Education","Human-Computer Interaction","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.26729v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Refine by manuscript type

Top subject areas - Preprints

Top subject areas - People

View all subject areas