chochinlu / suno-song-generator

6 stars 2 forks source link

Text extraction pipeline to cater for all songs. #11

Open cmm25 opened 1 month ago

cmm25 commented 1 month ago

Summary: Currently, the project relies on YouTube’s captioning system for lyrics extraction. However, only a limited number of YouTube videos have captions enabled, restricting the number of songs from which we can extract lyrics. This issue proposes the development of a text extraction pipeline that can work independently of YouTube captions, enabling us to extract lyrics from a broader range of songs.

Problem:

Proposed Solution:

  1. Build a text extraction pipeline that:

    • Works with audio files directly, rather than relying on captions.
    • Uses techniques such as automatic speech recognition (ASR) to convert song audio into text (lyrics).
    • Optionally incorporates NLP post-processing to clean and format the extracted lyrics.
  2. Consider alternative APIs or services that might offer more comprehensive lyric extraction.

Challenges:

Next Steps:

Milestone:

Labels:


chochinlu commented 1 month ago

Sure. Just do it! 👍

Creating a correct translated text can be a challenge. The current approach in this project is simply providing a textbox for the user to manually correct the translation. 😆