Achieving human and machine accessibility of cited data in scholarly publications
- Published
- Accepted
- Subject Areas
- Science Policy, Human-Computer Interaction, Computational Science
- Keywords
- data citation, machine accessibility, data citation, data archiving
- Copyright
- © 2015 Starr et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
- Cite this article
- 2015. Achieving human and machine accessibility of cited data in scholarly publications. PeerJ PrePrints 3:e697v3 https://doi.org/10.7287/peerj.preprints.697v3
Abstract
Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of research data. However, many common approaches for primary data publication in use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor do they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies recommending uniform direct citation of data archived in persistent repositories. Data are to be considered as first-class scholarly objects, and treated similarly in many ways to cited and archived scientific and scholarly literature. Here we briefly review the most current and widely agreed set of principle-based recommendations for scholarly data citation, the Joint Declaration of Data Citation Principles (JDDCP). We then present a framework for operationalizing the JDDCP; and a set of initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data. The main target audience for the common implementation guidelines in this article consists of publishers, scholarly organizations, and persistent data repositories, including technical staff members in these organizations. But ordinary researchers can also benefit from these recommendations. The guidance provided here is intended to help achieve widespread, uniform human and machine accessibility of deposited data, in support of significantly improved verification, validation, reproducibility and re-use of scholarly/scientific data.
Author Comment
This version is a major rewrite of version 2. It provides significantly more background on the need for data citation in scholarly publications and the development of the Joint Declaration of Data Citation Principles (JDDCP). It presents further material in the text on each of the exemplar JDDCP-conformant identifier schemes, in addition to presenting them in tables. It also discusses in more detail the common metadata models for landing pages, serving landing pages, and access to datasets.