-
I see the following condition:
https://github.com/huggingface/transformers/blob/16f0b7d72c6d4e122957392c342b074aa2c5c519/src/transformers/models/marian/convert_marian_to_pytorch.py#L462
While trai…
-
## 🚀 Feature Request
Provide a simple inference pipeline for the `wav2vec 2.0` model.
### Motivation
Current inference script `examples/speech_recognition/infer.py` handles a lot of cases, result…
-
**Name of the Spark NLP feature whose docs need improvement:**
Linear Chain CRF
**What you think the docs should say:**
Hi, I want to thank you for this great NLP project first.
I am new to N…
-
I think the current implementation returns keyphrases that are potential subsets of each other, that this is due to the use of `noun_chunks` and `ents`, and that this is not the desired output. Specif…
-
Awesome library! From prior posts it seems like the error "__init__() got an unexpected keyword argument 'balanced_tree' " is related to scipy. In the meantime before scipy is updated, is there any wo…
agt64 updated
2 years ago
-
I’am using embeddings from example https://nlp.johnsnowlabs.com/2020/09/23/labse.html and output vectors although close, but not equal to original vectors https://tfhub.dev/google/LaBSE/1 Why?
How o…
-
BPO | [31484](https://bugs.python.org/issue31484)
--- | :---
Nosy | @malemburg, @terryjreedy, @pitrou, @vstinner, @benjaminp, @ezio-melotti, @methane, @serhiy-storchaka, @zhangyangyu
PRs | python/cpyt…
-
I am comparing the performance of the most popular lemmatization tools. I have found benchmark results for [Stanza](https://stanfordnlp.github.io/stanza/v100performance.html), [Trankit](https://tranki…
-
release: october 2022
Wanted:
- Infrastructure:
- [x] cpython-3.10.6
- [x] cpython-3.11.0 (10% to 25% speed-up vs 3.10 on interactive Python applications)
- [x] github as a full second…
-
This issue is to track the progress to making generally available 100's if not 1000's of Bible translations with machine alignments. This has a few audiences:
* multi-corpora offline analysis
* Se…