Peer-reviewed Articles - Biology

32 downloads
161 views

We describe the Coefficient-Flow algorithm for calculating the bounding chain of an $(n-1)$-boundary on an $n$-manifold-like simplicial complex $S$. We prove its correctness and show that it has a computational time complexity of O(|S(n−1)|) (where S(n−1) is the...

["Algorithms and Analysis of Algorithms","Data Science","Scientific Computing and Simulation"]
doi:10.7717/peerj-cs.153
58 downloads
479 views

Over the last decades, clinical decision support systems have been gaining importance. They help clinicians to make effective use of the overload of available information to obtain correct diagnoses and appropriate treatments. However, their power often comes at...

["Data Mining and Machine Learning","Data Science","Optimization Theory and Computation"]
doi:10.7717/peerj-cs.150
92 downloads
652 views

Most of Python and R scientific packages incorporate compiled scientific libraries to speed up the code and reuse legacy libraries. While several semi-automatic solutions exist to wrap these compiled libraries, the process of wrapping a large library is cumbersome...

["Data Science","Scientific Computing and Simulation","Programming Languages","Software Engineering"]
doi:10.7717/peerj-cs.149
2 citations
359 downloads
3,127 views

This article describes the motivation, design, and progress of the Journal of Open Source Software (JOSS). JOSS is a free and open-access journal that publishes articles describing research software. It has the dual goals of improving the quality of the software...

["Data Science","Digital Libraries","Scientific Computing and Simulation","Software Engineering"]
doi:10.7717/peerj-cs.147
175 downloads
892 views

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs...

["Data Science","Visual Analytics"]
doi:10.7717/peerj-cs.145
1 citation
378 downloads
2,656 views

We describe best practices for providing convenient, high-speed, secure access to large data via research data portals. We capture these best practices in a new design pattern, the Modern Research Data Portal, that disaggregates the traditional monolithic web-based...

["Computer Networks and Communications","Data Science","Distributed and Parallel Computing","Security and Privacy","World Wide Web and Web Science"]
doi:10.7717/peerj-cs.144
3 citations
343 downloads
3,573 views

Computer science offers a large set of tools for prototyping, writing, running, testing, validating, sharing and reproducing results; however, computational science lags behind. In the best case, authors may provide their source code as a compressed archive and...

["Data Science","Digital Libraries","Scientific Computing and Simulation","Social Computing"]
doi:10.7717/peerj-cs.142
105 downloads
1,132 views

Background Software maintenance is an important activity in the development process where maintenance team members leave and new members join over time. The identification of files which are changed together frequently has been proposed several times. Yet, existing...

["Data Science","Software Engineering"]
doi:10.7717/peerj-cs.135
2 citations
353 downloads
1,786 views

Gathering up-to-date information on food prices is critical in developing regions, as it allows policymakers and development practitioners to rely on accurate data on food security. This study explores the feasibility of utilizing social media as a new data source...

["Data Science","Network Science and Online Social Networks","Social Computing"]
doi:10.7717/peerj-cs.126
4 citations
914 downloads
5,064 views

We present a CUDA-based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the graphics processing unit (GPU) and shows high performance with a variety of...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.127
239 downloads
1,283 views

Despite recent algorithmic improvements, learning the optimal structure of a Bayesian network from data is typically infeasible past a few dozen variables. Fortunately, domain knowledge can frequently be exploited to achieve dramatic computational savings, and...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science","Distributed and Parallel Computing"]
doi:10.7717/peerj-cs.122
1 citation
414 downloads
2,665 views

The ability to promptly recognise new research trends is strategic for many stakeholders, including universities, institutional funding bodies, academic publishers and companies. While the literature describes several approaches which aim to identify the emergence...

["Artificial Intelligence","Data Science","Digital Libraries"]
doi:10.7717/peerj-cs.119
219 downloads
1,041 views

Shotgun metagenomics of microbial communities reveal information about strains of relevance for applications in medicine, biotechnology and ecology. Recovering their genomes is a crucial but very challenging step due to the complexity of the underlying biological...

["Bioinformatics","Computational Biology","Data Science"]
doi:10.7717/peerj-cs.117
2 citations
531 downloads
2,859 views

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare,...

["Bioinformatics","Data Science","Databases","Emerging Technologies","World Wide Web and Web Science"]
doi:10.7717/peerj-cs.110
392 downloads
1,431 views

Music transcription involves the transformation of an audio recording to common music notation, colloquially referred to as sheet music. Manually transcribing audio recordings is a difficult and time-consuming process, even for experienced musicians. In response,...

["Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.109

Refine by manuscript type

Top subject areas - Articles

Top subject areas - People

View all subject areas