anthonio9 / penn

Pitch Estimating Neural Networks (PENN)
MIT License
0 stars 0 forks source link

Thesis Story #10

Open anthonio9 opened 8 months ago

anthonio9 commented 8 months ago

Thesis story wrap-up:

anthonio9 commented 8 months ago

FretNet has multiple evaluator classes like PitchListEvaluator or MultipitchEvaluator. Each class has an unpack() method and evaluate method. The idea now is to derive from PitchListEvaluator class, use the same unpack() function, but a completely different evaluate().

anthonio9 commented 7 months ago

Why does the reference differ so much from estimated in time stamps, but only in the pitch_list?

Image

anthonio9 commented 7 months ago

As for the previous question, it seems that only every 4th timestamp is present in the predicted set. This means that FretNet is only made to handle larger buffers and latency is larger then what was designed in PPN.

anthonio9 commented 7 months ago

Thesis story wrap-up:

* [ ]  should contain a comparison to PENN when with polyphonic pitch audio to show that it does not work at all

How should this be presented?

Plot of an example track with ground truth and predicted pitch, both over a spectrogram - This should be pretty good. In addition to that a metric for FRMSE and FRPA would be a great addition.

anthonio9 commented 7 months ago

Thesis Layout

Thesis should be 30-40-50 pages.

Introduction:

Section 3: Proposed method

Section 4: experiments

Results 5: results and analysis

anthonio9 commented 6 months ago

For the next meeting: the table of contents + anything extra is nice.

anthonio9 commented 6 months ago

Main results table: String Agnostic RMSE, String Agnostic RPA Non-String Agnosic RMSE, Non-String Agnostic RPA,

if possible, copy String-Agnostic Note from FretNet

anthonio9 commented 5 months ago
anthonio9 commented 4 months ago