Isolated guitar transcription using a deep belief network

View article
PeerJ Computer Science
Timbre refers to several attributes of an audio signal that allows humans to attribute a sound to its source and to differentiate between a trumpet and a piano, for instance. Timbre is often referred to as the “colour” of a sound.

Main article text

 

Introduction

Deep Belief Networks

Unsupervised pretraining

Supervised fine-tuning

Isolated Instrument Transcription

Audio signal preprocessing

Note pitch estimation

Note tracking

Frame-level smoothing

Onset quantization

Music notation arrangement

Note pitch estimation metrics

Polyphonic transcription metrics

  • The pitch name and octave number of the note event estimate and ground-truth note event must be equivalent.

  • The note event estimate’s onset time is within ±250 ms of the ground-truth note event’s onset time.

  • Only one ground-truth note event can be associated with each note event estimate.

Experimental method and evaluation

Ground-truth dataset

Algorithm parameter selection

Frame-level pitch estimation evaluation

Note event evaluation

Multiple guitar model evaluation

Number of network hidden layers

Discussion

Limitations of synthesis

Conclusion

Future work

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Gregory Burlet conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, performed the computation work, reviewed drafts of the paper, algorithm design.

Abram Hindle conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, performed the computation work, reviewed drafts of the paper, algorithm design.

Data Availability

The following information was supplied regarding data availability:

Deep learning guitar transcriptions:

https://archive.org/details/DeepLearningIsolatedGuitarTranscriptions.

Funding

This research was funded by an Alberta Innovates Technology Futures Graduate Student Scholarship and an Alberta Innovation and Advanced Education Graduate Student Scholarship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

6 Citations 4,122 Views 999 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more