ognistik / km-ai-memos

Dictate with Just Press Record and transcribe with Whisper AI using Keyboard Maestro
3 stars 0 forks source link
ai creativity dictation memos note-taking productivity transcription

AI MEMOS

A Just Press Record & Whisper AI Automation

~ Video Overview & Setup ~

WHAT IS IT

AI Memos is a powerful automation that allows you to capture ideas, take notes, and create content effortlessly using voice dictation. It combines the ease of recording with Just Press Record, the accuracy of OpenAI's Whisper for transcription, and the processing power of LLM models for text refinement. With AI Memos, you can dictate on any Apple device and have your thoughts automatically transcribed, processed, and added to your preferred note-taking app. This tool is perfect for anyone who wants to streamline their note-taking process, capture fleeting ideas, or create content on-the-go without the hassle of manual transcription or editing.

REQUIREMENTS

  1. OpenAI API token (SIGN UP HERE | GET IT HERE)
  2. Keyboard Maestro with the latest AI Memos macro
  3. Latest AI Memos Shortcut
  4. Just Press Record

SETUP

Shortcut Setup

  1. Insert your OpenAI API token in the Shortcut setup.
  2. If you use Bear, you can test the Shortcut immediately by dictating a note WITH THE SHORTCUT.
  3. If you don't use Bear, modify the Shortcut to use your preferred note-taking app that integrates with Shortcuts.
  4. Run the Shortcut once with a short transcription to ensure it's working correctly.

Optional Shortcut Configuration

Keyboard Maestro Setup

  1. This should work out of the gate, but ensure the correct location of the Just Press Record folder on iCloud for automatic triggering.
  2. The macro can be triggered manually if preferred. In this case, the macro will process oldest all the audio files starting from the oldest.
  3. You must run the macro manually once for the creation of the necessary files and directory structure in the Shortcuts/AI Memos folder.
  4. Set up any text replacements in the replacements.json file in the Shortcuts/AI Memos Folder. Replacements are case-dependent.
  5. Go ahead and record with Just Press Record!

ADDITIONAL NOTES


Using AI Memos

In actual use, AI Memos is as simple as recording with Just Press Record in any Apple device. Then, just wait a bit and as soon as your system detects the audio file, it will transcribe it. It also works fine for multiple audio files, even if they are all synced at once. If your file was processed but you don't see it as a note, make sure you are not in List Mode.

List Mode

Voice Commands

You may start your audio recording with the following commands:

All voice commands—except for "Help me" or "Listen"—can be dictated both alone or with additional content. For example, you can say "Start list" followed by your text, and everything after "Start list" will be added to the list. Similarly, you can say "Process list," and any text you dictate after that will be processed as a separate note, not as part of the list. There are more options for ongoing or paused lists in the Shortcut itself (tap it on your phone to check them out), but these only work if your files have synced properly to your device.

Troubleshooting

  1. Ensure that the macro is being triggered automatically. If not, it means that the folder trigger is not pointing to the correct location.
  2. Ensure that the yellow-colored actions are pointing both to the JPR folder, and to the AI Memos folder in the Shortcuts folder. Note that the paths do not start with ~, this is set at the scripts level.
  3. The blue/purple action block is the waiting time value which is related to your Internet connection. Keyboard Maestro will only be triggered when a file or folder is detected at the base level, but Just Press Record places files in subfolders. So, Keyboard Maestro detects the folder creation from JPR and waits then X amount of seconds for a file to appear.
    • Default: 30 seconds
    • Faster/Stable internet: Can be reduced (in the video sample above I had it at 5 seconds)
    • Slower internet: May need to be increased

Closing

One of my main uses for this automation is capturing highlights from physical books. You can read more about how I use it with Readwise on my Substack. But I also use it all the time when I'm walking, driving, or going about my day and need a quick way to capture thoughts. I hope many of you find this useful.

If you have any issues, ideas, or want to contribute, please open an issue on GitHub. I'm always happy to hear your feedback. I'm not making any money from this tool, so if you find it helpful, I'd be incredibly grateful if you'd buy me a coffee!

Buy Me A Coffee