Peer-reviewed Articles - Computer sci

95 downloads
748 views

The increased interest in analyzing and explaining gender inequalities in tech, media, and academia highlights the need for accurate inference methods to predict a person’s gender from their name. Several such services exist that provide access to large databases...

["Data Mining and Machine Learning","Data Science","Databases","Digital Libraries"]
doi:10.7717/peerj-cs.156
83 downloads
497 views

We describe the Coefficient-Flow algorithm for calculating the bounding chain of an $(n-1)$-boundary on an $n$-manifold-like simplicial complex $S$. We prove its correctness and show that it has a computational time complexity of O(|S(n−1)|) (where S(n−1) is the...

["Algorithms and Analysis of Algorithms","Data Science","Scientific Computing and Simulation"]
doi:10.7717/peerj-cs.153
86 downloads
725 views

Over the last decades, clinical decision support systems have been gaining importance. They help clinicians to make effective use of the overload of available information to obtain correct diagnoses and appropriate treatments. However, their power often comes at...

["Data Mining and Machine Learning","Data Science","Optimization Theory and Computation"]
doi:10.7717/peerj-cs.150
162 downloads
1,126 views

Most of Python and R scientific packages incorporate compiled scientific libraries to speed up the code and reuse legacy libraries. While several semi-automatic solutions exist to wrap these compiled libraries, the process of wrapping a large library is cumbersome...

["Data Science","Scientific Computing and Simulation","Programming Languages","Software Engineering"]
doi:10.7717/peerj-cs.149
2 citations
437 downloads
3,782 views

This article describes the motivation, design, and progress of the Journal of Open Source Software (JOSS). JOSS is a free and open-access journal that publishes articles describing research software. It has the dual goals of improving the quality of the software...

["Data Science","Digital Libraries","Scientific Computing and Simulation","Software Engineering"]
doi:10.7717/peerj-cs.147
278 downloads
1,144 views

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs...

["Data Science","Visual Analytics"]
doi:10.7717/peerj-cs.145
1 citation
450 downloads
3,136 views

We describe best practices for providing convenient, high-speed, secure access to large data via research data portals. We capture these best practices in a new design pattern, the Modern Research Data Portal, that disaggregates the traditional monolithic web-based...

["Computer Networks and Communications","Data Science","Distributed and Parallel Computing","Security and Privacy","World Wide Web and Web Science"]
doi:10.7717/peerj-cs.144
4 citations
389 downloads
3,931 views

Computer science offers a large set of tools for prototyping, writing, running, testing, validating, sharing and reproducing results; however, computational science lags behind. In the best case, authors may provide their source code as a compressed archive and...

["Data Science","Digital Libraries","Scientific Computing and Simulation","Social Computing"]
doi:10.7717/peerj-cs.142
119 downloads
1,342 views

Background Software maintenance is an important activity in the development process where maintenance team members leave and new members join over time. The identification of files which are changed together frequently has been proposed several times. Yet, existing...

["Data Science","Software Engineering"]
doi:10.7717/peerj-cs.135
2 citations
399 downloads
2,042 views

Gathering up-to-date information on food prices is critical in developing regions, as it allows policymakers and development practitioners to rely on accurate data on food security. This study explores the feasibility of utilizing social media as a new data source...

["Data Science","Network Science and Online Social Networks","Social Computing"]
doi:10.7717/peerj-cs.126
4 citations
1,055 downloads
5,839 views

We present a CUDA-based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the graphics processing unit (GPU) and shows high performance with a variety of...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.127
278 downloads
1,534 views

Despite recent algorithmic improvements, learning the optimal structure of a Bayesian network from data is typically infeasible past a few dozen variables. Fortunately, domain knowledge can frequently be exploited to achieve dramatic computational savings, and...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science","Distributed and Parallel Computing"]
doi:10.7717/peerj-cs.122
1 citation
450 downloads
2,955 views

The ability to promptly recognise new research trends is strategic for many stakeholders, including universities, institutional funding bodies, academic publishers and companies. While the literature describes several approaches which aim to identify the emergence...

["Artificial Intelligence","Data Science","Digital Libraries"]
doi:10.7717/peerj-cs.119
254 downloads
1,174 views

Shotgun metagenomics of microbial communities reveal information about strains of relevance for applications in medicine, biotechnology and ecology. Recovering their genomes is a crucial but very challenging step due to the complexity of the underlying biological...

["Bioinformatics","Computational Biology","Data Science"]
doi:10.7717/peerj-cs.117
3 citations
581 downloads
3,278 views

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare,...

["Bioinformatics","Data Science","Databases","Emerging Technologies","World Wide Web and Web Science"]
doi:10.7717/peerj-cs.110

Refine by manuscript type

Top subject areas - Articles

Top subject areas - People

View all subject areas