Preprints (not yet peer-reviewed)

17 downloads
88 views

In recent years, the pharmaceutical industry has been confronted with rising R&D costs paired with decreasing productivity. Attrition rates for new molecules are tremendous, with a substantial number of molecules failing in an advanced stage of development. Repositioning...

["Bioinformatics","Drugs and Devices","Computational Science","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.27002v1
64 downloads
119 views

To accelerate scientific progress on remote tree classification—as well as biodiversity and ecology sampling—The National Institute of Science and Technology created a community-based competition where scientists were invited to contribute informatics methods for...

["Ecology","Data Mining and Machine Learning","Data Science","Forestry","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26971v1
237 downloads
494 views

Ecology has reached the point where data science competitions, in which multiple groups solve the same problem using the same data by different methods, will be productive for advancing quantitative methods for tasks such as species identification from remote sensing...

["Ecology","Data Mining and Machine Learning","Data Science","Forestry","Spatial and Geographic Information Science"]
doi:10.7287/peerj.preprints.26966v1
201 downloads
411 views

We propose a simple neural network model which can learn relation between sentences by passing their representations obtained from Long Short Term Memory(LSTM) through a Relation Network. The Relation Network module tries to extract similarity between multiple...

["Artificial Intelligence","Data Science","Natural Language and Speech"]
doi:10.7287/peerj.preprints.26847v2
139 downloads
405 views

A problem facing healthcare record systems throughout the world is how to share the medical data with more stakeholders for various purposes without sacrificing data privacy and integrity. Blockchain, operating in a state of consensus, is the underpinning technology...

["Computer Networks and Communications","Cryptography","Data Science","Security and Privacy"]
doi:10.7287/peerj.preprints.26942v1
57 downloads
104 views

Flow cytometry (FCM) is a powerful analytical tool that is widely used worldwide, as it allows the depiction of the innate complexity of a vast range of biological systems in few seconds. It is a technique based on the spectroscopic properties of suspended particles...

["Bioinformatics","Computational Biology","Ecology","Ecosystem Science","Data Science"]
doi:10.7287/peerj.preprints.26934v1
141 downloads
471 views

Statistical books are an opportunity for accessing relatively deeper insights into statistics and software even outside the introductory classroom setting. There are however many resources available to the practitioner in addition to the traditional text model....

["Bioinformatics","Computer Education","Data Science","Programming Languages"]
doi:10.7287/peerj.preprints.26924v1
59 downloads
148 views

The leachate generated by the direct disposal of solid waste into the soil of terrace number one of the Romerillos sanitary landfill site contaminates the environment. This is visible from an average distance of up to 20m with the presence of dry vegetation cover...

["Data Science","Environmental Impacts"]
doi:10.7287/peerj.preprints.26907v2
664 downloads
619 views

The rapid increase in volume and complexity of biomedical data requires changes in research, communication, training, and clinical practices. This includes learning how to effectively integrate automated analysis with high-data-density visualizations that clearly...

["Computational Biology","Science and Medical Education","Human-Computer Interaction","Data Science"]
doi:10.7287/peerj.preprints.26896v1
112 downloads
328 views

Clustering is a scientific method which finds the clusters of data and many related methods are traditionally researched for long terms. Bayesian nonparametrics is statistics which can treat models having infinite parameters. Chinese restaurant process is used...

["Data Mining and Machine Learning","Data Science","Software Engineering"]
doi:10.7287/peerj.preprints.26533v2
901 downloads
3,437 views

There is a massive crisis of confidence in statistical inference, which has largely been attributed to overemphasis on and abuse of hypothesis testing. Much of the abuse stems from failure to recognize that statistical tests not only test hypotheses, but countless...

["Science and Medical Education","Science Policy","Statistics","Data Science"]
doi:10.7287/peerj.preprints.26857v1
205 downloads
1,010 views

A new layer of complexity, constituted of networks of information token recurrence, has been identified in socio-technical systems such as the Wikipedia online community and the Zooniverse citizen science platform. The identification of this complexity reveals...

["Data Mining and Machine Learning","Data Science","Network Science and Online Social Networks","Social Computing","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2789v2
238 downloads
510 views

Many data include time or have longitudinal dimensionalilty. When these data include an index of time, i.e. measures at regular or periodically successive intervals, statistics that use time sequencing in some capacity are appropriate. Consequently, investing time...

["Bioinformatics","Computational Biology","Data Science"]
doi:10.7287/peerj.preprints.26842v1
138 downloads
291 views

Identifying and counting individual fish on videos is a crucial task to cost-effectively monitor marine biodiversity, but it remains a difficult and time-consuming task. In this paper, we present a method to assist the automated identification of fish species on...

["Biodiversity","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.26818v1
57 downloads
165 views

To better understand the species latitudinal and depth gradients in the NW Pacific and its adjacent Arctic Ocean, distribution records of all marine species were extracted from the Ocean Biogeographic Information System (OBIS) and Global Biodiversity Information...

["Biodiversity","Biogeography","Marine Biology","Data Science"]
doi:10.7287/peerj.preprints.26756v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Refine by manuscript type

Top subject areas - Preprints

Top subject areas - People

View all subject areas