benob / recasepunc

Model for recasing and repunctuating ASR transcripts
BSD 3-Clause "New" or "Revised" License
129 stars 20 forks source link

Low puncatuation accuracy for French #3

Open BMouhcine opened 2 years ago

BMouhcine commented 2 years ago

Hello, Thanks for the work done here. I tried to punctuate a text written in French, but the output result wasn't too accurate. How can I improve the results? Thanks.

benob commented 2 years ago

You need to be more specific about your issue. What kind of dataset did you process? What accuracy are you expecting on that genre? What is the accuracy of typical baselines on that data (such as hidden-event n-gram language model)? Are there some errors in the logs that presume of a bug?