OpenPecha / stt-split-audio

MIT License
0 stars 0 forks source link

STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #32

Open jim-gyas opened 1 week ago

jim-gyas commented 1 week ago

Description:

Develop a process that improves the quality of inference transcriptions for audio files using Claude AI by aligning them with a verified transferred text. The transferred text is known for its spelling accuracy, while the inference transcription captures the audio's context more effectively. The goal is to create a corrected transcription that combines the strengths of both. The process will involve generating inference transcriptions, feeding both inference transcriptions and transferred text into Claude AI, and using its output to replace the transferred text in the original dataset.

Resources:

1) Inference Transcription and Transferred Text : news-catalog

Completion Criteria:

Implementation:

Image

Subtasks:

jim-gyas commented 6 days ago

API account has insufficient credit to access the Claude API.

Image

jim-gyas commented 5 days ago

@kaldan007 and @gangagyatso4364, could you please review my card?

gangagyatso4364 commented 5 days ago

add the step to make cer comparison between all three kinds of transcript generated at last.

gangagyatso4364 commented 5 days ago

looks good go ahead.