Mubashirshariq / music_transcription

music_transcription
GNU General Public License v3.0
2 stars 1 forks source link

Musical Transcription and assistance via Generative AI

This repository contains a web application for music transcription. The frontend is built with React, and the backend is powered by FastAPI. The application allows users to record audio or upload audio files, and then it transcribes the audio into MIDI files.

Features

Tech Stack

Setup Instructions

Prerequisites

Frontend Setup

  1. Clone the repository:

    git clone https://github.com/yourusername/music_transcription.git
    cd music_transcription
  2. Navigate to the frontend directory:

    cd frontend
  3. Install the dependencies:

    npm install
  4. Start the frontend development server:

    npm run dev

Backend Setup

Now create one more terminal to run the backend

  1. Navigate to the backend directory:

    cd music_transcription
    cd backend
  2. Create a virtual environment:

    python -m venv myenv
  3. Activate the virtual environment:

    • On Windows:
      myenv\Scripts\activate
    • On macOS and Linux:
      source myenv/bin/activate
  4. Install the dependencies:

    pip install -r requirements.txt
  5. Run the FastAPI server:

    uvicorn main:app --reload

Usage

Once both the frontend and backend servers are running, you can access the application by navigating to http://localhost:5173 in your web browser.

Example Usage

  1. Record audio or upload an audio file using the web interface.
  2. The backend will process the audio and transcribe it to a MIDI file.
  3. The midi file is saved in MIDI folder inside the backend folder as of now.

License

This project is licensed under the GNU Public License. See the LICENSE file for details.

Acknowledgments