localaization / pentagrom

We want to open source a machine learning model using a [(7 rows x constant) x 3 columns] + key signature matrix.
GNU General Public License v3.0
0 stars 0 forks source link

[5] MACHINE LEARNING - As a developer I want to optimize the preprocessing steps so that the modeling is easier for neural sequence model #10

Closed localaization closed 1 year ago

localaization commented 1 year ago

Description: We are evaluating the A. Event-based Representation for Music of the document MuseMorphose. We need to check with Jaime this method.

Documentation: From MuseMorphose and example midi file.zip

Image

From This Time with Feeling: Learning Expressive Musical Performance The piano is used in this example

Image

From SEQUENCE-TO-SEQUENCE PIANO TRANSCRIPTION WITH TRANSFORMERS

pentagrom-presentation-014

From MT3: MULTI-TASK MULTITRACK MUSIC TRANSCRIPTION

pentagrom-presentation-013

From Twitter Image

Piano mapping

Does not univocally map any written note. Image

Pentagrom mapping

It maps univocally any written note. Image

Solution

  1. Read A. Event-based Representation for Music chapter of the MuseMorphose doc.
  2. Take a look at the documentation (in the section above).
  3. Identify and check if the variables and tokens are suitable for the pentagrom structure.
  4. Map the identified elements which are suitable to the actual pentagrom elements.

Q&A:

  1. Check this paper and see if a 7x3 matrix instead of 8x1 make sense and solve the questions: [Piano Genie] (https://arxiv.org/pdf/1810.05246.pdf)

Definition of Done: To have an agreement whether or not section "A. Event-based Representation for Music" from MuseMorphose is a good approximation.

localaization commented 1 year ago

From This Time with Feeling: Learning Expressive Musical Performance

Image

TuWebO commented 1 year ago

There is a lot of information that we could potentially get from: https://magenta.tensorflow.org/

I didn't know about it until now. I will take note about some interesting links: https://magenta.tensorflow.org/ddsp-vst-blog https://openreview.net/forum?id=B1x1ma4tDr&noteId=lcOwh022Vta https://github.com/magenta/mt3 https://archives.ismir.net/ismir2021/paper/000030.pdf https://openreview.net/pdf?id=iMSjopcOn0p

TuWebO commented 1 year ago

I've updated the documentation with the latest articles I've seen. I will meet Jaime tomorrow and leave this task unassigned for now.

localaization commented 1 year ago

Jaime and myself have been talking about this issue. We have red the A. Event-based Representation for Music from the document MuseMorphose, also another papers and pianogenie project. We think the pentagrom system could have one main advantage:

Image

Image

Image

localaization commented 1 year ago

@isamu-isozaki this is the task, if you think it is solved feel free to mark it as done, otherwise assign it to me and back to in progress.

isamu-isozaki commented 1 year ago

@localaization yup nice work! I think it's done. I think for machine learning, we just go through your format -> remi -> model which should do the job. Possibly midi in between