I am interested in training my own Bangla language model, but I'm not sure where to start. I have my own Bangla dataset with audio and text, and I would like to use it to train a model that can transcribe Bangla speech to text offline. I am looking for guidance on how to preprocess my data, train a model, and evaluate its performance.
Can someone please provide detailed instructions or point me to a tutorial or guide that can help me with this process? Here are some specific questions I have:
What are the best practices for preprocessing Bangla audio and text data?
How do I create a Bangla language model and generate the necessary files for training a model?
What are the recommended training parameters and settings for training a Bangla language model ?
How do I evaluate the performance of my trained model, and what metrics should I use?
Installation guidelines from scratch.
I would appreciate any help or advice that can be provided. Thank you in advance!
I am interested in training my own Bangla language model, but I'm not sure where to start. I have my own Bangla dataset with audio and text, and I would like to use it to train a model that can transcribe Bangla speech to text offline. I am looking for guidance on how to preprocess my data, train a model, and evaluate its performance.
Can someone please provide detailed instructions or point me to a tutorial or guide that can help me with this process? Here are some specific questions I have:
I would appreciate any help or advice that can be provided. Thank you in advance!