Peer-reviewed Articles - Biology

104 downloads
724 views

We present a CUDA-based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the graphics processing unit (GPU) and shows high performance with a variety of...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.127
134 downloads
593 views

Background The availability of large databases containing high resolution three-dimensional (3D) models of proteins in conjunction with functional annotation allows the exploitation of advanced supervised machine learning techniques for automatic protein function...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning"]
doi:10.7717/peerj-cs.124
66 downloads
622 views

Despite recent algorithmic improvements, learning the optimal structure of a Bayesian network from data is typically infeasible past a few dozen variables. Fortunately, domain knowledge can frequently be exploited to achieve dramatic computational savings, and...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science","Distributed and Parallel Computing"]
doi:10.7717/peerj-cs.122
80 downloads
743 views

Motivation Scientists increasingly rely on intelligent information systems to help them in their daily tasks, in particular for managing research objects, like publications or datasets. The relatively young research field of Semantic Publishing has been addressing...

["Human-Computer Interaction","Artificial Intelligence","Data Mining and Machine Learning","Digital Libraries"]
doi:10.7717/peerj-cs.121
128 downloads
818 views

We developed a web-based cloud-hosted system that allow users to archive, listen, visualize, and annotate recordings. The system also provides tools to convert these annotations into datasets that can be used to train a computer to detect the presence or absence...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning"]
doi:10.7717/peerj-cs.113
162 downloads
799 views

Music transcription involves the transformation of an audio recording to common music notation, colloquially referred to as sheet music. Manually transcribing audio recordings is a difficult and time-consuming process, even for experienced musicians. In response,...

["Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.109
248 downloads
1,043 views

Online Social Networks (OSNs) have been widely adopted as a means of news dissemination, event reporting, opinion expression and discussion. As a result, news and events are being constantly reported and discussed online through OSNs such as Twitter. However, the...

["Data Mining and Machine Learning","Network Science and Online Social Networks","Social Computing"]
doi:10.7717/peerj-cs.107
1 citation
151 downloads
909 views

Synthesizing human movement is useful for most applications where the use of avatars is required. These movements should be as realistic as possible and thus must take into account anthropometric characteristics (weight, height, etc.), gender, and the performance...

["Data Mining and Machine Learning","Graphics","Scientific Computing and Simulation"]
doi:10.7717/peerj-cs.102
8 citations
3,177 downloads
26,030 views

Recent advances in Natural Language Processing and Machine Learning provide us with the tools to build predictive models that can be used to unveil patterns driving judicial decisions. This can be useful, for both lawyers and judges, as an assisting tool to rapidly...

["Artificial Intelligence","Computational Linguistics","Data Mining and Machine Learning","Data Science","Natural Language and Speech"]
doi:10.7717/peerj-cs.93
3 citations
294 downloads
1,352 views

Ascribing function to sequence in the absence of biological data is an ongoing challenge in bioinformatics. Differentiating the toxins of venomous animals from homologues having other physiological functions is particularly problematic as there are no universally...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning"]
doi:10.7717/peerj-cs.90
212 downloads
1,184 views

We describe a method for assessing data set complexity based on the estimation of the underlining probability distribution and Hellinger distance. In contrast to some popular complexity measures, it is not focused on the shape of a decision boundary in a classification...

["Algorithms and Analysis of Algorithms","Artificial Intelligence","Data Mining and Machine Learning"]
doi:10.7717/peerj-cs.76
4 citations
267 downloads
2,432 views

A successful software project is the result of a complex process involving, above all, people. Developers are the key factors for the success of a software development process, not merely as executors of tasks, but as protagonists and core of the whole development...

["Data Mining and Machine Learning","Data Science","Software Engineering"]
doi:10.7717/peerj-cs.73
217 downloads
888 views

It is a well-known fact that some criminals follow perpetual methods of operations known as modi operandi. Modus operandi is a commonly used term to describe the habits in committing crimes. These modi operandi are used in relating criminals to crimes for which...

["Algorithms and Analysis of Algorithms","Artificial Intelligence","Data Mining and Machine Learning"]
doi:10.7717/peerj-cs.65
2 citations
382 downloads
1,378 views

We consider the problem of detecting and quantifying the periodic component of a function given noise-corrupted observations of a limited number of input/output tuples. Our approach is based on Gaussian process regression, which provides a flexible non-parametric...

["Data Mining and Machine Learning","Optimization Theory and Computation"]
doi:10.7717/peerj-cs.50
14 citations
2,756 downloads
16,863 views

Probabilistic programming allows for automatic Bayesian inference on user-defined probabilistic models. Recent advances in Markov chain Monte Carlo (MCMC) sampling allow inference on increasingly complex models. This class of MCMC, known as Hamiltonian Monte Carlo,...

["Data Mining and Machine Learning","Data Science","Scientific Computing and Simulation"]
doi:10.7717/peerj-cs.55

Top subject areas - Articles

Top subject areas - People

View all subject areas