pucardotorg / dristi_experiments

For Pucar Solutions Team
1 stars 4 forks source link

Transcription POC - RFC #7

Open ChakshuGautam opened 2 weeks ago

ChakshuGautam commented 2 weeks ago

TODO

Technology

Specifications

  1. File Name - <sessionID>.<length>.<original/modified>.<wav/txt>
  2. Session - sessionID, startTime, endTime
ChakshuGautam commented 2 weeks ago

https://github.com/suyashgautam to guide Sarvesh.

xorsuyash commented 2 weeks ago

cc @ChakshuGautam cc @GautamR-Samagra

Format to hit autotune api for force aligning audios in specific length

format of of transcription.txt must be like this Screenshot from 2024-06-14 13-38-07 audio name and space separated transcript

After creating workflow you we can force align using

             curl -X POST   -H "Content-Type: application/json"   -d '{"dataset":"xorsuyash/asr_datasetp2","workflow_id":"b23fe059-e941-4045-ad6c-bf9330e88455","save_path":"SamagraDataGov/asr_dataset_test_p9","transcript_available":"true","time_duration":5.0}'   https://autotune.dev.bhasai.samagra.io/v1/workflow/force-align

Here