Preprints (not yet peer-reviewed)

20 downloads
32 views

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs...

["Data Science","Visual Analytics"]
doi:10.7287/peerj.preprints.3448v1
327 downloads
847 views

The sharing and re-use of data has become a cornerstone of modern science. Multiple platforms now allow quick and easy data sharing. So far, however, data publishing models have not accommodated on-going scientific improvements in data: for many problems, datasets...

["Computational Biology","Ecology","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.3401v1
48 downloads
324 views

In translational medicine, the technology of RNA sequencing (RNA-seq) continues to prove powerful, and transforming the RNA-seq data into biological insights has become increasingly imperative. We present the Transcriptomics profiler for Easy Discovery (TED) toolkit,...

["Bioinformatics","Genomics","Translational Medicine","Computational Science","Data Science"]
doi:10.7287/peerj.preprints.3385v1
72 downloads
570 views

There are few truly bad ideas in authentic science. We need to embrace science as a process- driven human endeavour to better understand the world around us. Products are important, but through better transparency, we can leverage ideas, good and bad, ours and...

["Ecology","Human-Computer Interaction","Data Science"]
doi:10.7287/peerj.preprints.3282v2
37 downloads
141 views

Ecological niche modeling (ENM) is increasingly being used in studying the relationship between species distributions and environmental conditions. The development of ENM software/algorithms is heading toward open-source programming, for the advantage of efficiency...

["Biogeography","Bioinformatics","Ecology","Zoology","Data Science"]
doi:10.7287/peerj.preprints.3346v1
42 downloads
92 views

The conditional mutual information \(I(X;Y|Z)\) measures the average information that X and Y contain about each other given Z. This is an important primitive in many learning problems including conditional independence testing, graphical model inference, causal...

["Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.3345v1
192 downloads
556 views

Computational models in biology encode molecular and cell biological processes. These models often can be represented as biochemical reaction networks. Studying such networks, one is mostly interested in systems that share similar reactions and mechanisms. Typical...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning","Data Science"]
doi:10.7287/peerj.preprints.1479v3
28 downloads
156 views

It has been estimated that up to 80% of all data stored in health care databases may have spatial components. To fully exploit such components, there is a need of improving existing tools or developing novel spatio-temporal functionalities. Geographic information...

["Data Science","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.3335v1
57 downloads
163 views

Nowadays, a huge amount of biomedical data of different biological entities is provided by many online databases and services, each with its own data model, user interface and query language. However, typical bioinformatics scenarios require the use of more than...

["Bioinformatics","Data Science"]
doi:10.7287/peerj.preprints.3309v1
33 downloads
75 views

We present GenotypeAnalytics (GA), a RESTFul service that makes it possible to mine association rules from Single Nucleotide Polymorphism (SNP) datasets using standard web browsers. GA can speed up and simplify the analysis of this massive amount of data, highlighting...

["Bioinformatics","Computational Biology","Data Mining and Machine Learning","Data Science","World Wide Web and Web Science"]
doi:10.7287/peerj.preprints.3299v1
27 downloads
102 views

REDCap (Research Electronic Data Capture) is one of the most popular web-based applications to support data capture for research studies and registries. i2b2 (Informatics for Integrating Biology and the Bedside) is a widely adopted data warehouse to re-use clinical...

["Data Science","Databases"]
doi:10.7287/peerj.preprints.3294v1
4,890 downloads
8,771 views

Forecasting is a common data science task that helps organizations with capacity planning, goal setting, and anomaly detection. Despite its importance, there are serious challenges associated with producing reliable and high quality forecasts — especially when...

["Data Science"]
doi:10.7287/peerj.preprints.3190v2
105 downloads
348 views

Background. Artificial enrichment of lakes has posed serious management problems for water supply. In results many European lakes had already undergone significant eutrophication. It seems that a good tool to determine the influence of catchment use on the trophic...

["Data Science","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.2203v2
90 downloads
430 views

Face to the urban resiliency two major environmental threats are widely recognized: the increasing summer air temperatures and the soil consumption that affects a large number of city in Italy. The work have the goal to present preliminary the actual Heat Summer...

["Data Science","Spatial and Geographic Information Systems"]
doi:10.7287/peerj.preprints.2234v2
135 downloads
561 views

Severe weather impact identification and monitoring through social media data is a good challenge for data science. In last years we assisted to an increase of natural disasters, also due to climate change. Many works showed that during such events people tend...

["Data Science","Emerging Technologies","Natural Language and Speech","Network Science and Online Social Networks"]
doi:10.7287/peerj.preprints.2241v2
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas