Preprints (not yet peer-reviewed)

Scaling up the analysis of sensitive or confidential documents frequently stumbles on the limited number of individuals with the necessary clearance to access the documents. The availability of cryptographic protocols compatible with text processing methods can...

["Cryptography","Data Science","Natural Language and Speech"]
doi:10.7287/peerj.preprints.2994v1
75 downloads
273 views

Sharing and reusing data in research is a welcome and encouraged practice since it maximises the scientific outcomes given limited financial, material and human resources. Interdisciplinary research is considered to benefit from this practice, uniting researchers...

["Bioinformatics","Computational Biology","Data Science","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.2248v4

Background. There is huge amount of full-text biomedical literatures available in public repositories like PubMed Central (PMC). However, a substantial number of the papers are in Portable Document Format (PDF) and do not provide plain text format ready for text...

["Bioinformatics","Data Science","Databases","Digital Libraries","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2993v1
3 views

Distributed open source software development has largely turned to GitHub, a pull-based software development collaboration platform. Recent studies have deployed data science techniques on the large datasets available about millions of projects on GitHub. Some...

["Human-Computer Interaction","Social Computing","Programming Languages","Software Engineering"]
doi:10.7287/peerj.preprints.2989v1
148 downloads
264 views

Landscape genetics combines population genetics, landscape ecology, and spatial analysis to identify landscape and genetic factors that influence genetic and genomic variation. Progress in the field depends on a strong conceptual foundation and the means of identifying...

["Computational Biology","Scientific Computing and Simulation","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.2225v4
74 downloads
174 views

Digital repositories bring direct impact and influence on the research community and society but measuring their value using formal metrics remains challenging. their value. It is challenging to define a single perfect metric that covers all quality aspects. Here,...

["Bioinformatics","Computational Biology","Digital Libraries","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2688v2
17 downloads
91 views

Transferring a huge amount of data between different network locations over the network links depends on the heterogeneous wireless network. Such a network consists of several networks with different access technologies. Traditionally, a mobile device may be moved...

["Computer Networks and Communications","Mobile and Ubiquitous Computing","Network Science and Online Social Networks"]
doi:10.7287/peerj.preprints.2970v1
89 downloads
367 views

Despite most Spatial Data Infrastructures are offering service-based visualization of geospatial data, requirements are often at a very basic level leading to poor quality of maps. This is a general observation for any geospatial architecture as soon as open standards...

["Spatial and Geographic Information Systems","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.2415v2
19 downloads
102 views

Github is a key online collaborative software development environment. In this paper we describe a new category of Github project: Curation projects collect, evaluate, and preserve resources for software developers. We investigated 1) what motivates software developers...

["Human-Computer Interaction","Social Computing","Software Engineering"]
doi:10.7287/peerj.preprints.2952v1
137 downloads
565 views

Whole genome alignments and comparative analysis are key methods in the quest of unraveling the dynamics of genome evolution. Interactive visualization and exploration of the generated alignments, annotations, and phylogenetic data are important steps in the interpretation...

["Bioinformatics","Computational Biology"]
doi:10.7287/peerj.preprints.2348v2
12 downloads
57 views

Three approaches to implement genetic programming on GPU hardware are compilation, interpretation and direct generation of machine code. The compiled approach is known to have a prohibitive overhead compared to other two. This paper investigates methods to accelerate...

["Artificial Intelligence","Distributed and Parallel Computing"]
doi:10.7287/peerj.preprints.2936v1
61 downloads
235 views

An immune system inspired Artificial Immune System (AIS) algorithm is presented, and is used for the purposes of automated program verification. Relevant immunological concepts are discussed and the field of AIS is briefly reviewed. It is proposed to use this AIS...

["Computational Biology","Adaptive and Self-Organizing Systems","Theory and Formal Methods"]
doi:10.7287/peerj.preprints.2690v2
163 downloads
965 views

We present a CUDA based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the GPU and shows high performance with a variety of datasets and settings, including...

["Artificial Intelligence","Data Mining and Machine Learning"]
doi:10.7287/peerj.preprints.2911v1
201 downloads
1,026 views

Large-scale molecular evolutionary analyses of protein coding sequences requires a number of preparatory inter-related steps from finding gene families, to generating alignments and phylogenetic trees and assessing selective pressure variation. Each phase of these...

["Bioinformatics","Computational Biology"]
doi:10.7287/peerj.preprints.1895v2
90 downloads
417 views

Docker allows packaging an application with its dependencies into a standardized, self-contained unit (a so-called container), which can be used for software development and to run the application on any system. Dockerfiles are declarative definitions of an environment...

["Data Science","Software Engineering"]
doi:10.7287/peerj.preprints.2905v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas