Arabic words (non-ascii characters) training data

Currently the training data looks like this

nawar-spec-00001.npy|nawar-mel-00001.npy|1223| وَرَجَّحَ التَّقْرِيرُ الَّذِي أَعَدَّهُ مَعْهَدُ أَبْحَاثِ هَضَبَةِ التِّبِتِ فِي الْأَكَادِيمِيَّةِ الصِّينِيَّةِ لِلْعُلُومِ - أَنْ تَسْتَمِرَّ دَرَجَاتُ الْحَرَارَةِ وَمُسْتَوَيَاتُ الرُّطُوبَةِ فِي الْإِرْتِفَاعِ طَوَالَ هَذَا الْقَرْنْ

Do you think that using phonetised words instead of arabic words would make training easier?

An example of training data of phonetised words is shown below which is similar to cumdict phonetised words

nawar-spec-00001.npy|nawar-mel-00001.npy|1223| W R A0 J A E A L T AE Q R E2 ..........

Thank you

keithito / tacotron

Arabic words (non-ascii characters) training data #195