Judging a commit by its cover; or can a commit message predict build failure?

Eddie A Santos; Abram Hindle

doi:10.7287/peerj.preprints.1771v1

Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

NOT PEER-REVIEWED

"PeerJ Preprints" is a venue for early communication or feedback before peer review. Data may be preliminary.

Judging a commit by its cover; or can a commit message predict build failure?

Eddie A Santos , Abram Hindle

Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada

DOI: 10.7287/peerj.preprints.1771v1

Published: 2016-02-23
Accepted: 2016-02-23

Subject Areas: Data Mining and Machine Learning, Natural Language and Speech, Software Engineering
Keywords: commit messages, build status, travis-ci, github, language model, mining software repositories

Copyright: © 2016 Santos et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.

Cite this article: Santos EA, Hindle A. 2016. Judging a commit by its cover; or can a commit message predict build failure? PeerJ PrePrints 4:e1771v1 https://doi.org/10.7287/peerj.preprints.1771v1

Abstract

Developers summarize their changes to code in commit messages. When a message seems “unusual,” however, this puts doubt into the quality of the code contained in the commit. We trained \(n\)-gram language models and used cross-entropy as an indicator of commit message “unusualness” of over 120 000 commits from open source projects. Build statuses collected from Travis-CI were used as a proxy for code quality. We then compared the distributions of failed and successful commits with regards to the “unusualness” of their commit message. Our analysis yielded significant results when correlating cross-entropy with build status.

Author Comment

This is our work on the MSR 2016 challenge using Boa. The preprint has a little bit more content than the version submitted to MSR; we found the four-page limit a bit limiting, and may want to extend this work into a full paper.

Supplemental Information

Figure 1a: Histograms of commit message cross-entropies, initial model

Note the tall bins which contain a large number of auto-generated commit messages that were not foreseen when training this model.

DOI: 10.7287/peerj.preprints.1771v1/supp-1

Download

Figure 1b: Histograms of commit message cross-entropies, refined model

We recalculated the histogram, removing auto-generated commits, as well as many non-English commit messages.

DOI: 10.7287/peerj.preprints.1771v1/supp-2

Download

Figure 2: Empirical cumulative distribution functions of commits

Empirical cumulative distribution functions of number of passed (in green), failed (in purple), and errored (in orange) commits as cross-entropy (“unusualness”) increases. Note that failed, initially grows slower than passed and errored; by 10 bits, however, failed is indistinguishable from passed and errored.

DOI: 10.7287/peerj.preprints.1771v1/supp-3

Download

Add your feedback

Before adding feedback, consider if it can be asked as a question instead, and if so then use the Question tab. Pointing out typos is fine, but authors are encouraged to accept only substantially helpful feedback.

Some Markdown syntax is allowed: _italic_ **bold** ^superscript^ ~subscript~ %%blockquote%% [link text](link URL)

By posting this you agree to PeerJ's commenting policies

Questions

Ask a question

Learn more about Q&A

Links

Add a link

Content

Alert

Just enter your email

Supplemental Information

Figure 1a: Histograms of commit message cross-entropies, initial model

Figure 1b: Histograms of commit message cross-entropies, refined model

Figure 2: Empirical cumulative distribution functions of commits

Add your feedback

Top referrals unique visitors

Share this preprint

Metrics

Download article