Preprints (not yet peer-reviewed)

442 downloads
2,485 views

There is a massive crisis of confidence in statistical inference, which has largely been attributed to overemphasis on and abuse of hypothesis testing. Much of the abuse stems from failure to recognize that statistical tests not only test hypotheses, but countless...

["Science and Medical Education","Science Policy","Statistics","Data Science"]
doi:10.7287/peerj.preprints.26857v1
131 downloads
737 views

A new layer of complexity, constituted of networks of information token recurrence, has been identified in socio-technical systems such as the Wikipedia online community and the Zooniverse citizen science platform. The identification of this complexity reveals...

["Data Mining and Machine Learning","Data Science","Network Science and Online Social Networks","Social Computing","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2789v2
29 downloads
105 views

We propose a simple Neural Network model which can learn relation between sentences by modeling the task as Earth Mover's Distance(EMD) calculation. Underlying hypothesis is that a neural module can learn to approximate the flow optimization in EMD calculation...

["Artificial Intelligence","Data Science","Natural Language and Speech"]
doi:10.7287/peerj.preprints.26847v1
117 downloads
327 views

Many data include time or have longitudinal dimensionalilty. When these data include an index of time, i.e. measures at regular or periodically successive intervals, statistics that use time sequencing in some capacity are appropriate. Consequently, investing time...

["Bioinformatics","Computational Biology","Data Science"]
doi:10.7287/peerj.preprints.26842v1
89 downloads
160 views

Identifying and counting individual fish on videos is a crucial task to cost-effectively monitor marine biodiversity, but it remains a difficult and time-consuming task. In this paper, we present a method to assist the automated identification of fish species on...

["Biodiversity","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.26818v1
26 downloads
65 views

To better understand the species latitudinal and depth gradients in the NW Pacific and its adjacent Arctic Ocean, distribution records of all marine species were extracted from the Ocean Biogeographic Information System (OBIS) and Global Biodiversity Information...

["Biodiversity","Biogeography","Marine Biology","Data Science"]
doi:10.7287/peerj.preprints.26756v1
39 downloads
76 views

IRT models are often applied when observed items are used to measure a unidimensional latent variable. Originally used in educational research, IRT models are now widely used when focus is on physical functioning or psychological well-being. Modern applications...

["Data Science","Graphics","Programming Languages"]
doi:10.7287/peerj.preprints.26740v1
1,640 downloads
3,823 views

Computers are a central tool in the research process, enabling complex and large scale data analysis. As computer-based research has increased in complexity, so have the challenges of ensuring that this research is reproducible. To address this challenge, we review...

["Anthropology","Computational Biology","Science and Medical Education","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.3192v2
35 downloads
69 views

Nowadays, there is a large number of machine learning models that could be used for various areas. However, different research targets are usually sensitive to the type of models. For a specific prediction target, the predictive accuracy of a machine learning model...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.26714v1
170 downloads
381 views

Emotion expression encompasses various types of information, including face and eye movement, voice and body motion. Most of the studies in automated affective recognition use faces as stimuli, less often they include speech and even more rarely gestures. Emotions...

["Neuroscience","Human-Computer Interaction","Computational Science","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.26688v1
28 downloads
152 views

Background. The institutional affiliations and associated collaborative networks that scientists foster during their research careers are salient in the production of high quality science. The phenomenon of multiple institutional affiliations and its relationship...

["Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.26654v1
68 downloads
123 views

Many people make their opinions available on the Internet nowadays, and researchers have been proposing methods to automate the task of classifying textual reviews as positive or negative. Usual supervised learning techniques have been adopted to accomplish such...

["Computational Linguistics","Data Mining and Machine Learning","Data Science","Natural Language and Speech","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.26618v1
85 downloads
159 views

R has many capabilities most of which are not known by many users, yet waiting to be discovered. For this reason we provide more tips on how to write really efficient code without having to program in C++, programming advice, and tips to avoid errors and numerical...

["Bioinformatics","Computational Biology","Computational Linguistics","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.26605v1
82 downloads
161 views

Background. Automatic contradiction detection or conflicting statements detection in text consists of identifying discrepancy, inconsistency and defiance in text and has several real world applications in questions and answering systems, multi-document summarization,...

["Artificial Intelligence","Computational Linguistics","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.26589v1
67 downloads
273 views

Motivated by a desire to curb "predatory" publishing, we created FlourishOA, a one-stop shop for authors, publishers, funders, librarians, and policy makers to find high-quality, cost-effective Open Access (OA) journals. FlourishOA provides Article Processing Charge...

["Algorithms and Analysis of Algorithms","Data Mining and Machine Learning","Data Science","Databases"]
doi:10.7287/peerj.preprints.26586v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Refine by manuscript type

Top subject areas - Preprints

Top subject areas - People

View all subject areas