-
Sometimes it is desirable to be able to say that a token is in a language different from the main language of the file, and to specify the foreign language. Some corpora have occasional code switching…
-
I'm using the package to parse Tigrinya. I tried l3.anal("ti", word) but encountered this error. l3.anal("am", word) works
-
I saw the great idea for combined models here:
https://stanfordnlp.github.io/stanza/combined_models.html
Is there a process to request more of these? Specifically I was thinking of Hebrew right …
-
I'm currently storing raw sentences at index time like this: raw = " ".join([j.raw for j in tokens]), which up to now has been fine except now we want to pull tokens from sentences. This method does n…
-
### System Info
- `transformers` version: 4.46.2
- Platform: Linux-5.14.0-427.22.1.el9_4.x86_64-x86_64-with-glibc2.34
- Python version: 3.11.10
- Huggingface_hub version: 0.26.1
- Safetensors ver…
-
I try to run coreference resolution by
python heb_pipe.py -c example_in.txt
but I get this:
```
# text = עיפרון עיפרון הוא כלי כתיבה ידני לשם כתיבה וציור, לרוב על דפי נייר. העיפרון מורכב ממוט …
-
Post questions here for one or both of week's orienting readings:
Evans, James and Pedro Aceves. 2016. [“Machine Translation: Mining Text for Social Theory”](https://www.annualreviews.org/doi/abs/…
-
### Your question
Hey,
First of all great work , this library is exactly what I was looking for.
One thing that can be awesome is the ability to use embeddings models, we all know that open source …
-
This is issue is to discuss the current short-comings regarding Arabic script and how/if it can be resolved given our current architecture.
[Emad Mohamed](http://www.cs.qatar.cmu.edu/users/emohamed) …
ghost updated
8 years ago
-
Post questions here for one or both of week's orienting readings:
Michel, Jean-Baptiste et al. 2010. “[Quantitative Analysis of Culture Using Millions of Digitized Books](http://www.sciencemag.org…