-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
Traceback (most recent call last):
File "···/chinese_sentiment-master/data/hotel_comment/raw_data/fix_coupus.py", line 30, in
fix_corpus(POS, FIX_POS)
File "···/chinese_sentiment-master/da…
-
I was reading about [lz4](https://code.google.com/p/lz4/) and noticed they use a [specific dataset](http://sun.aei.polsl.pl/~sdeor/index.php?page=silesia) for the published benchmarks.
It looks like …
-
This issue isn't a problem to be fixed, rather it is a record to keep track of undergraduate student's work on TREC 2024. In a series of comments, contributors can write about their work (at a high le…
-
En SPARQL-endpoint hos Riksdagens data är önskvärt. SPARQL integrerar enkelt i Wikipedia med att listor etc, kan skapas som uppdateras se äldre [test med Nobelprize.org](https://www.wikidata.org/wiki/…
-
**Is your feature request related to a problem? Please describe.**
When I'm working with a corpus that is a mixture of documents in American English and British English spelling, the two versions of …
-
Right now, when using the data loader:
```python
corpus, queries, qrels = GenericDataLoader(data_dir).load(split=split)
```
Tqdm will always show up. there should be a way to disable it, e.g.:
…
-
When getting vault counts for a large number of users with large mailboxes using gam 6.80.11, the errors come back as a separate json rows, and the rows with the email shows a count of 0 (which is not…
-
`KneserNeyInterpolated.generate()` takes too long to run.
Consider the following example:
```python
from nltk.corpus import brown
from nltk.lm.preprocessing import padded_everygram_pipeline
f…
-