Formatting Open Science: agile creation of multiple document types by writing academic manuscripts in pandoc markdown

Pandoc Development Team, Lübeck, Germany
Biotechnology and Biochemistry, CINVESTAV Unidad Irapuato, Irapuato, Mexico
DOI
10.7287/peerj.preprints.2648v1
Subject Areas
Human-Computer Interaction, Computer Education, Computer Networks and Communications, Digital Libraries, World Wide Web and Web Science
Keywords
open science, markdown, latex, publishing, typesetting, document formats
Copyright
© 2016 Krewinkel et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Krewinkel A, Winkler R. 2016. Formatting Open Science: agile creation of multiple document types by writing academic manuscripts in pandoc markdown. PeerJ Preprints 4:e2648v1

Abstract

The timely publication of scientific results is essential for dynamic advances in science. The ubiquitous availability of computers which are connected to a global network made the rapid and low-cost distribution of information through electronic channels possible. New concepts, such as Open Access publishing and preprint servers are currently changing the traditional print media business towards a community-driven peer production. However, the cost of scientific literature generation, which is either charged to readers, authors or sponsors, is still high. The main active participants in the authoring and evaluation of scientific manuscripts are volunteers, and the cost for online publishing infrastructure is close to negligible. A major time and cost factor though is the formatting of manuscripts in the production stage. In this article we demonstrate the feasibility to write scientific manuscripts in plain markdown (MD) text files, which can be easily converted into common publication formats, such as PDF, HTML or EPUB, using pandoc. The simple syntax of markdown assures the long-term readability of raw files and the development of software and workflows. We show the implementation of typical elements of scientific manuscripts -- formulas, tables, code blocks and citations -- and present tools for editing, collaborative writing and version control. We give an example on how to prepare a manuscript with distinct output formats, a DOCX file for submission to a journal and a LATEX/PDF version for deposition as a PeerJ preprint. Reducing the work spent on manuscript formatting translates directly to time and cost savings for writers, publishers, readers and sponsors. Therefore, the adoption of the MD format contributes to the agile production of open science literature.

Author Comment

This is a submission to PeerJ Computer Science for review.