sul-dlss / dlme-airflow

This is a new repository to capture the work related to the DLME ETL Pipeline and establish airflow
Apache License 2.0
1 stars 0 forks source link

Test Persian auto transliterations #563

Open jacobthill opened 2 weeks ago

jacobthill commented 2 weeks ago

https://pypi.org/project/PersianG2p/

If the above library works well, we can automatically generate Persian transliterations which could benefit the Golistan project and DLME generally. This would involve adding a new task to our airflow workflow. The task would not apply to all DAGs, only those with Persian script metadata. Since we do the language tagging in traject, we would need to implement this after the transform task.