23 downloads
182 views

The ability to promptly recognise new research trends is strategic for many stakeholders, including universities, institutional funding bodies, academic publishers and companies. While the literature describes several approaches which aim to identify the emergence...

["Artificial Intelligence","Data Science","Digital Libraries"]
doi:10.7717/peerj-cs.119
17 downloads
135 views

Scaling up the analysis of sensitive or confidential documents frequently stumbles on the limited number of individuals with the necessary clearance to access the documents. The availability of cryptographic protocols compatible with text processing methods can...

["Cryptography","Data Science","Natural Language and Speech"]
doi:10.7287/peerj.preprints.2994v1
83 downloads
324 views

Sharing and reusing data in research is a welcome and encouraged practice since it maximises the scientific outcomes given limited financial, material and human resources. Interdisciplinary research is considered to benefit from this practice, uniting researchers...

["Bioinformatics","Computational Biology","Data Science","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.2248v4
18 downloads
66 views

Background. There is huge amount of full-text biomedical literatures available in public repositories like PubMed Central (PMC). However, a substantial number of the papers are in Portable Document Format (PDF) and do not provide plain text format ready for text...

["Bioinformatics","Data Science","Databases","Digital Libraries","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2993v1
52 downloads
231 views

Shotgun metagenomics of microbial communities reveal information about strains of relevance for applications in medicine, biotechnology and ecology. Recovering their genomes is a crucial but very challenging step due to the complexity of the underlying biological...

["Bioinformatics","Computational Biology","Data Science"]
doi:10.7717/peerj-cs.117
111 downloads
805 views

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare,...

["Bioinformatics","Data Science","Databases","Emerging Technologies","World Wide Web and Web Science"]
doi:10.7717/peerj-cs.110
111 downloads
472 views

Docker allows packaging an application with its dependencies into a standardized, self-contained unit (a so-called container), which can be used for software development and to run the application on any system. Dockerfiles are declarative definitions of an environment...

["Data Science","Software Engineering"]
doi:10.7287/peerj.preprints.2905v1
83 downloads
629 views

Music transcription involves the transformation of an audio recording to common music notation, colloquially referred to as sheet music. Manually transcribing audio recordings is a difficult and time-consuming process, even for experienced musicians. In response,...

["Data Mining and Machine Learning","Data Science"]
doi:10.7717/peerj-cs.109
180 downloads
609 views

A detailed review of a recent data science book by Hadley Wickham and Garrett Grolemund is developed herein. Technical book reviews should provide a guide to the readers, a sense of the appropriate audience, the specifics of the software/language, and identify...

["Computational Biology","Data Science"]
doi:10.7287/peerj.preprints.2873v1
54 downloads
651 views

Despite recent algorithmic improvements, learning the optimal structure of a Bayesian network from data is typically infeasible past a few dozen variables. Fortunately, domain knowledge can frequently be exploited to achieve dramatic computational savings, and...

["Artificial Intelligence","Data Mining and Machine Learning","Data Science","Distributed and Parallel Computing"]
doi:10.7287/peerj.preprints.2872v1
23 downloads
48 views

Over the past 18 months, we have been working on a dashboard concept that enables researchers a means of interacting with existing research. This work was motivated by the National Data Service (NDS), which is an emerging vision of how scientists and researchers...

["Human-Computer Interaction","Computer Architecture","Data Science","World Wide Web and Web Science","Software Engineering"]
doi:10.7287/peerj.preprints.2845v1
724 downloads
720 views

ATLAS (Automatic Tool for Local Assembly Structures) is a comprehensive multi-omics data analysis pipeline that is massively parallel and scalable. ATLAS contains a modular analysis pipeline for assembly, annotation, quantification and genome binning of metagenomics...

["Bioinformatics","Computational Biology","Data Science","Scientific Computing and Simulation","Software Engineering"]
doi:10.7287/peerj.preprints.2843v1
8 downloads
93 views

Modern biomedical research aims at drawing biological conclusions from large, highly complex biological datasets. Nowadays, it is common practice to make extensive use of high-throughput technologies that produce big amounts of heterogeneous data. In addition to...

["Bioinformatics","Computational Biology","Data Science","Databases","Distributed and Parallel Computing"]
doi:10.7287/peerj.preprints.2839v1
94 downloads
98 views

This study investigates the effects of using a large data set on supervised machine learning classifiers in the domain of Intrusion Detection Systems (IDS). To investigate this effect 12 machine learning algorithms have been applied. These algorithms are: (1) Adaboost,...

["Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.2838v1
13 downloads
71 views

Climate and biodiversity systems are closely interlaced across a wide range of scales. To better understand the mutual interaction between climate change and biodiversity there is a strong need for multidisciplinary skills, tools and a large variety of heterogeneous,...

["Data Science","Scientific Computing and Simulation","Software Engineering"]
doi:10.7287/peerj.preprints.2834v1

Top subject areas - Articles & Preprints

Top subject areas - People

View all subject areas