Tried out various input sentences. In pretty much every sentence, often in verbs, the last letter remains without diacritization.
But more importantly, every so often, a word gets disambiguated completely different to what the Camlira Website would do, and the weightings are also different.
Example:
Input: وهي مدرسة
Output: وَهِيَ مَدْرَسَةٌ
Camelira Website Output: وَهِيَ مُدَرِّسَةٌ
For some words, not only are the weightings or the chosing between two 1.0 results different, but the analysis is completely different.
Example:
Input: مهمة
Output: مَهَمَّةً
Camelia Website Output: 30 versions of مُهِمَّةٌ; mahammah is not once included.
Thanks in advance for your help. I'm a CS Student and have been interested in linguistics and Arabic for a few years now; I'm a big fan of your work. This would really help me.
I installed the project. Did all things.
Used the example code from https://camel-tools.readthedocs.io/en/stable/api/disambig/bert.html#examples.
Tried out various input sentences. In pretty much every sentence, often in verbs, the last letter remains without diacritization.
But more importantly, every so often, a word gets disambiguated completely different to what the Camlira Website would do, and the weightings are also different.
Example: Input: وهي مدرسة Output: وَهِيَ مَدْرَسَةٌ Camelira Website Output: وَهِيَ مُدَرِّسَةٌ
For some words, not only are the weightings or the chosing between two 1.0 results different, but the analysis is completely different.
Example: Input: مهمة Output: مَهَمَّةً Camelia Website Output: 30 versions of مُهِمَّةٌ; mahammah is not once included.
Thanks in advance for your help. I'm a CS Student and have been interested in linguistics and Arabic for a few years now; I'm a big fan of your work. This would really help me.
Windows 10, Python 3.9