-
I'm not sure at the moment where the parse failure is happening, but other validators confirm the file (from Unicode) is valid XML.
(rename file from `ee.txt` to `ee.xml` because GH doesn't like …
-
https://github.com/tesseract-ocr/tesseract/issues/648#issuecomment-271987456
>Indic may be troubled by the length of the compressed codes used.
@theraysmith Can you explain a little more about t…
-
This error seems to be happening easily with certain scripts. In the example below, it's reproduced after adding Devanagari.
## Steps to Reproduce
1. Execute `flutter run` on the code sample on …
-
This is a spinoff of comments in https://github.com/scylladb/scylla/issues/5273 and https://github.com/scylladb/scylla/pull/4528:
As noted in https://github.com/scylladb/scylla/issues/5273, using `…
-
As per a conversation in [the antlr-discussions Google group](https://groups.google.com/g/antlr-discussion/c/-g5CF0MuUBk), there is confusion as to use of the grammars in this Github repo that are mis…
-
Hi,
I plan to use phoneme level embedding instead of char level embedding to train a model on my English custom dataset.
As far as I've observed from the codes, there is no available phoneme level …
-
### Summary
I have a large text file generated by a software which does not have line breaks. Being a console text editor I expected it to outperform all other in term of speed. However it proved to …
-
Grab what you want, mark it and make a PR. These exercises need updates:
- [ ] grade school (@Elahi-cs )
- [ ] matching-brackets (@sz245 )
- [ ] protein-translation (@vaeng) #912
- [ ] high-sco…
vaeng updated
3 weeks ago
-
Hi. I was able to train an italian model almost perfectly with the exception of few words that are intrinsecally ambiguous without context. Since your model is similar to the bert transformer what do …
-
As detailed in https://github.com/latex3/latex2e/issues/987#issuecomment-1570309342 and alluded to in https://unicode-org.atlassian.net/browse/ICU-12845#icft=ICU-12845, breathing marks need special tr…