us / Hark

Hark is a real-time speech-to-text tool that seamlessly integrates with GPT-4, offering multi-language support, meeting summaries, and a user-friendly interface for efficient audio-to-text conversions. 🚀
Other
10 stars 0 forks source link

Hark: The Ultimate Real-Time Speech-to-Text-to-LLM* 🚀

forthebadge

screenshot of app

chatgpt cheat

Hark is your new favorite gadget for turning live audio into text, all while mingling with OpenAI’s GPT-4 for some extra brainpower! Whether you're capturing epic meetings or casual chats, Hark’s got you covered with its slick features and nerdy charm.

🌟 Key Features

🚀 Getting Started

Gear up and get ready to roll! Make sure you have Node.js and Yarn installed on your machine.

Installation

  1. Clone the repo:

    git clone https://github.com/us/hark.git
    cd hark
  2. Install dependencies:

    npm install yarn
    yarn

🎧 Usage Guide

Audio Input Setup

macOS (OS X)

To capture system audio on macOS, grab BlackHole—a nifty virtual audio driver.

  1. Install BlackHole: Download and install BlackHole.

  2. Create a Multi-Output Device:

    • Open Audio MIDI Setup.
    • Hit the + button and choose "Create Multi-Output Device."
    • Add your speakers and BlackHole to this device.
    • Set it as your system audio output.
  3. Set BlackHole as Input:

    • In Hark, select BlackHole from the audio input device dropdown.

Windows

To achieve a similar setup on Windows, use Voicemeeter.

  1. Install Voicemeeter: Download and install Voicemeeter.

  2. Configure Voicemeeter:

    • Open Voicemeeter.
    • Set Hardware Input 1 as your default microphone and send it only to B.
    • Also, send the virtual input to both A and B (with A for hearing through your default speakers and B for virtual output).
    • Set Hardware Out A1 as your default output, typically your system speakers.
    • Double-check the Windows sound settings in the system tray to ensure Voicemeeter hasn’t changed your default speaker output. (Keep your sound output as your default device, not voicemeeter!)
  3. Configure Audio in Google Meet and Hark:

    • In Google Meet, set the input as your default mic and output as Voicemeeter Input.
    • In Hark, choose B1 as the input device.

Run the Application

Fire up your local server with:

yarn dev

Then check out your app at http://localhost:3000.

🔧 Configuration

đź“ś How It Works

  1. Select Audio Device: Choose BlackHole to capture system sound.
  2. Start Recording: Hit "Start Recording" to capture and transcribe audio in real-time.
  3. Language Selection: Pick your preferred language from the dropdown.
  4. Ask GPT-4: Use "Answer the Latest Question" to get smart responses from GPT-4.
  5. Summarize Meeting: Click "What’s This Meeting About?" for a quick summary of your discussion.
  6. Stop Recording: End the session with "Stop Recording."
  7. Clear Results: Hit "Clear" to reset and prep for the next session.

Gollum Image

đź”® Future Features

🔍 Final Checklist Before Using Hark

Before you dive into using Hark, make sure you've completed these steps for a seamless experience:

  1. Audio Routing: Ensure that audio routing is correctly set up with BlackHole (or a similar virtual audio driver). BlackHole captures system audio, allowing Hark to process sound from other applications.
  2. Input Device Configuration: Verify that BlackHole is selected as the input device within Hark. This ensures the app captures all system sounds accurately.
  3. API Key Setup: Enter your OpenAI API key in app.js to enable GPT-4 interactions.
  4. Model Selection: Choose the appropriate GPT-4 model for your needs.
  5. Application Testing: Start listening with Hark, and test by asking questions to ensure everything works as expected.

By following these steps, you ensure that Hark is fully functional and ready to provide a smooth, real-time transcription and interaction experience.

🤝 Contributing

Got ideas or want to help out? We’re all ears! Submit a pull request or open an issue to join the fun.

How this will help you:


“Sharing knowledge is the most fundamental act of friendship. Because it is a way you can give something without losing something.”

— Richard Stallman

đź“ś License

This project is licensed under GLWTPL (GOOD LUCK WITH THAT PUBLIC LICENSE)

⚠️ Disclaimer

It is built for educational purposes only. If you choose to use it otherwise, the developers will not be held responsible. Please, do not use it with evil intent.

📬 Contact

Questions, suggestions, or just want to chat? Hit us up at rahmetsaritekin@gmail.com.