AI4Bharat / IndicTrans2

Translation models for 22 scheduled languages of India
https://ai4bharat.iitm.ac.in/indic-trans2
MIT License
217 stars 59 forks source link

Bug during Translation #23

Closed saifulhaq8 closed 11 months ago

saifulhaq8 commented 11 months ago

IndicTrans2 hangs up while translating the following line:

"About $5 in the drugstore & the product will last for months.It doesn't hurt/burn or any of that jazz.--------------------------------------- Â\xad------."

I diagnosed the reason to be "--------------------------------------- Â\xad------."

Can we do something about this?

Adding more details:

I did some experiments to find out the cause (note the full-stop):

  1. "About $5 in the drugstore & the product will last for months.It doesn't hurt/burn or any of that jazz." gets successfully translated.
  2. "--------------------------------------- Â\xad------." gets successfully translated.
  3. ".--------------------------------------- Â\xad------" gets successfully translated.
  4. ".--------------------------------------- Â\xad------." gets successfully translated.
  5. "About $5 in the drugstore & the product will last for months.It doesn't hurt/burn or any of that jazz--------------------------------------- Â\xad------." gets successfully translated.
  6. "About $5 in the drugstore & the product will last for months.It doesn't hurt/burn or any of that jazz.--------------------------------------- Â\xad------" hangs up.
  7. "About $5 in the drugstore & the product will last for months.It doesn't hurt/burn or any of that jazz.--------------------------------------- Â\xad------." hangs up.

in the 5th experiment, i removed the full stop between "jazz" and "-". in the 6th experiment, i removed the full-stop at the end of the line.

Hope this helps in resolving the bug.

PranjalChitale commented 11 months ago

Hi @saifulhaq8 this issue should be resolved now. Thank you.

saifulhaq8 commented 11 months ago

Thank you, the bug got resolved!