MarkGotham / When-in-Rome

meta-corpus of and code library for the functional harmonic analysis of music
58 stars 11 forks source link

TAVERN Cadenzas - use long fake long measures in rntxt analyses #40

Open jonnybluesman opened 2 years ago

jonnybluesman commented 2 years ago

Hello, I am currently parsing the analysis files in the corpus via the music21 converter. By doing this, I found a couple of annotations with potentially inconsistent timings (see below).

Hope this helps, and thanks a lot for this fantastic corpus!

MarkGotham commented 2 years ago

Hi @jonnybluesman! Thanks for reporting these.

Let us know if you spot anything else.

MarkGotham commented 2 years ago

@napulen can you update us on your solutions to Mozart variation cadenzas?

napulen commented 2 years ago

I believe my general solution to cadenzas was to either:

I did this arbitrarily for what best served the purpose of training the machine learning model, so I wouldn't impose the specific decisions here.

MarkGotham commented 1 year ago

Hey @napulen,

We now have a consistent system for keeping the "original conversion" (if you'll permit the oxymoronic turn of phrase ...) alongside an alteration. See for example changes to the Beethoven piano sonata + BPS original.

Let's frame this that way, so ...

In all cases analysis.txt is the recommended version.

Thanks!

(p.s. Stay tuned for flexible run-time alignment of score and analysis)

napulen commented 1 year ago

I'm unable to provide a pull request at this time, but all matters TAVERN for AugmentedNet are clearly laid out in this file, which is the authority for everything that the neural network is using: https://github.com/napulen/AugmentedNet/blob/main/AugmentedNet/data/tavern.py

Note that sometimes, the entry will point to the "original conversion", whereas sometimes it will point to a so-called "correction", as in this case:

    "tavern-mozart-k353-a": (
        "rawdata/corrections/WiR/Corpus/Variations_and_Grounds/Mozart,_Wolfgang_Amadeus/_/K353/analysis_A.txt",
        "rawdata/corrections/Tavern/Mozart/K353.mxl",
    ),

Those corrections are often pairs of modified RomanText and MusicXML files. The tuples in that python module are to be trusted and always point to a score-annotation pair that have been manually verified to deliver a minimum amount of errors/misalignment.

napulen commented 1 year ago

All data, including the "corrected" (modified is a better term) pairs are publicly available in the repository.