Preprints (not yet peer-reviewed)

3 downloads
44 views

We present in this article a lightweight ontology named PGxO and a set of rules for its instantiation, which we developed as a frame for reconciling and tracing pharmacogenomics (PGx) knowledge. PGx studies how genomic variations impact variations in drug response...

["Bioinformatics","Artificial Intelligence","Data Mining and Machine Learning","Databases","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.3140v1
84 downloads
250 views

Metabarcoding and metagenomic approaches are becoming routine techniques in biodiversity assessment and ecological studies. The assignment of taxonomic information to sequences is challenging, as many reference libraries are lacking information on certain taxonomic...

["Biodiversity","Bioinformatics","Ecology","Molecular Biology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.3133v1
56 downloads
65 views

Minor syntax errors are made by novice and experienced programmers alike; however, novice programmers lack the years of intuition that help them resolve these tiny errors. Standard LR parsers typically resolve syntax errors and their precise location poorly. We...

["Data Mining and Machine Learning","Software Engineering"]
doi:10.7287/peerj.preprints.3123v1
34 downloads
202 views

Acoustic classification of frogs has received increasing attention for its promising application in ecological studies. Various studies have been proposed for classifying frog species, but most recordings are assumed to have only a single species. In this study,...

["Artificial Intelligence","Data Mining and Machine Learning","Multimedia"]
doi:10.7287/peerj.preprints.3007v1
473 downloads
1,416 views

We present a CUDA based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the GPU and shows high performance with a variety of datasets and settings, including...

["Artificial Intelligence","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.2911v1
65 downloads
700 views

Despite recent algorithmic improvements, learning the optimal structure of a Bayesian network from data is typically infeasible past a few dozen variables. Fortunately, domain knowledge can frequently be exploited to achieve dramatic computational savings, and...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science","Distributed and Parallel Computing"]
doi:10.7287/peerj.preprints.2872v1
136 downloads
125 views

This study investigates the effects of using a large data set on supervised machine learning classifiers in the domain of Intrusion Detection Systems (IDS). To investigate this effect 12 machine learning algorithms have been applied. These algorithms are: (1) Adaboost,...

["Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.2838v1
10 downloads
69 views

This paper presents the latest developments on the VIALACTEA Science Gateway in the context of the FP7 VIALACTEA project. This science gateway operates as a central workbench for the VIALACTEA community in order to allow astronomers to process the new-generation...

["Data Mining and Machine Learning","Distributed and Parallel Computing","Scientific Computing and Simulation"]
doi:10.7287/peerj.preprints.2818v2
265 downloads
462 views

Software energy consumption is a performance related non-functional requirement that complicates building software on mobile devices today. Energy hogging applications are a liability to both the end-user and software developer. Measuring software energy consumption...

["Data Mining and Machine Learning","Mobile and Ubiquitous Computing","Software Engineering"]
doi:10.7287/peerj.preprints.2419v3
62 downloads
662 views

The nonparametric minimum hypergeometric (mHG) test is a popular alternative to Kolmogorov-Smirnov (KS)-type tests for determining gene set enrichment. However, these approaches have not been compared to each other in a quantitative manner. Here, I first perform...

["Computational Biology","Algorithms and Analysis of Algorithms","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.1962v3
107 downloads
126 views

Recognition of human emotions from the imaging templates is useful in a wide variety of human-computer interaction and intelligent systems applications. However, the automatic recognition of facial expressions using image template matching techniques suffer from...

["Human-Computer Interaction","Algorithms and Analysis of Algorithms","Artificial Intelligence","Computer Vision","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.2794v1
35 downloads
350 views

Motivated by the increasing amount of voices who ask for careful consideration of what context-rich data analysis methods can tell us about the activities of human collectives, we contribute an argumentation that employs a dialectic of literature on the philosophy...

["Data Mining and Machine Learning","Data Science","Network Science and Online Social Networks","Social Computing","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2789v1
178 downloads
389 views

Background. The availability of large databases containing high resolution three-dimensional (3D) models of proteins in conjunction with functional annotation allows the exploitation of advanced supervised machine learning techniques for automatic protein function...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.2778v1
41 downloads
66 views

Feature selection in machine learning is of great interest since it is reckoned as creating more efficient predictive models in several engineering domains. It is even of special importance in the pulp and paper transformation industry as the knowledge of this...

["Artificial Intelligence","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.2749v1
43 downloads
165 views

Flight simulators are systems composed of numerous off-the-shelf components that allow pilots and maintenance crew to prepare for common and emergency flight procedures for a given aircraft model. A simulator must follow severe safety specifications to guarantee...

["Data Mining and Machine Learning","Scientific Computing and Simulation","Software Engineering"]
doi:10.7287/peerj.preprints.2670v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas