Visitors   Views   Downloads
NOT PEER-REVIEWED
"PeerJ Preprints" is a venue for early communication or feedback before peer review. Data may be preliminary.

Supplemental Information

Figure 1a: Histograms of commit message cross-entropies, initial model

Note the tall bins which contain a large number of auto-generated commit messages that were not foreseen when training this model.

DOI: 10.7287/peerj.preprints.1771v1/supp-1

Figure 1b: Histograms of commit message cross-entropies, refined model

We recalculated the histogram, removing auto-generated commits, as well as many non-English commit messages.

DOI: 10.7287/peerj.preprints.1771v1/supp-2

Figure 2: Empirical cumulative distribution functions of commits

Empirical cumulative distribution functions of number of passed (in green), failed (in purple), and errored (in orange) commits as cross-entropy (“unusualness”) increases. Note that failed, initially grows slower than passed and errored; by 10 bits, however, failed is indistinguishable from passed and errored.

DOI: 10.7287/peerj.preprints.1771v1/supp-3

Additional Information

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Eddie A Santos conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, performed the computation work.

Abram Hindle contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Data Deposition

The following information was supplied regarding data availability:

Repository for data utilities: https://github.com/eddieantonio/judging-commits

The SQLite3 database full of all data:

https://drive.google.com/open?id=0ByMXxDHxG3WSbzEtc1BoTk1NcTA

Funding

The authors received no funding for this work.


Add your feedback

Before adding feedback, consider if it can be asked as a question instead, and if so then use the Question tab. Pointing out typos is fine, but authors are encouraged to accept only substantially helpful feedback.

Some Markdown syntax is allowed: _italic_ **bold** ^superscript^ ~subscript~ %%blockquote%% [link text](link URL)
 
By posting this you agree to PeerJ's commenting policies