SuryaKrishna02 / maya-dataset-creation

The Repository contains the code for dataset creation for the Training the Maya: Multilingual Aya Model
MIT License
1 stars 2 forks source link

Translation Script Efficiency and Modularization #1

Open SuryaKrishna02 opened 2 months ago

SuryaKrishna02 commented 2 months ago
  1. Analyze the code to find out the computation bottlenecks and make them more efficient.
    • Separating out the Preprocessing and post processing from API Calling.
  2. Make the code flexible to run for different languages with minimal changes and ideally with a config file.
  3. Modularize the script for reusability.