cwitkowitz / guitar-transcription-continuous

Code for the paper "FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription".
MIT License
23 stars 2 forks source link

Continuous-Valued Guitar Transcription (FretNet)

Code for the paper "FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription". This repository contains scripts which do the following (and more):

The repository utilizes amt-tools, a more general music transcription repository, and guitar-transcription-with-inhibition, a repository built for previous research on reformulating and applying inhibition to the tablature output layer of TabCNN.

Installation

Clone the repository, install the requirements, then install the package:

git clone https://github.com/cwitkowitz/guitar-transcription-continuous
pip install -r guitar-transcription-continuous/requirements.txt
pip install -e guitar-transcription-continuous/

Usage

TODO

TODO

Six-Fold Cross-Validation on GuitarSet

The scripts experiment.py and evaluation.py under six_fold_cv_scripts are available as a more complete example of how to train and evaluate the proposed model under the six-fold cross-validation schema using amt-tools.

Generated Files

Execution of six_fold_cv_scripts/experiment.py will generate the following under <root_dir> (defined at the top of the script):

Additionally, ground-truth and features will be saved under the path specified by gset_cache, unless save_data=False.

1An additional folder (n += 1) containing similar files is created for each execution with the same experiment name <EX_NAME>.

Analysis

During training, losses and various validation metrics can be analyzed in real-time by running:

tensorboard --logdir=<root_dir>/models --port=<port>

Here we assume the current working directory contains <root_dir>, and <port> is an integer corresponding to an available port (port = 6006 if unspecified).

After running the above command, navigate to [http://localhost:&lt;port&gt;]() with an internet browser to view any reported training or validation observations within the tensorboard interface.

Cite

ICASSP 2023 Paper (Link)
@article{cwitkowitz2023fretnet,
  title     = {FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription},
  author    = {Cwitkowitz, Frank and Hirvonen, Toni and Klapuri, Anssi},
  year      = 2023,
  booktitle = {Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)}
}