-
Tag: 2.1.3
https://github.com/sebastianbergmann/comparator/pull/30/files
Missing changes, failing on PHPUnit
-
I am not sure but it seems that there are conflicts when it gets:
https://raw.githubusercontent.com/metanorma/metanorma-build-scripts/master/gemfile-to-bundle-add.sh
```
Running with gitlab-r…
-
### System Info
Transformer Version: 4.20.1
Python : 3.8
ubuntu : 18.04
### Who can help?
@patil-suraj
### Information
- [ ] The official example scripts
- [X] My own modified scripts
##…
-
ThaiWordFilter is an offender in TestRandomChains because it creates positions and updates offsets.
---
Migrated from [LUCENE-4984](https://issues.apache.org/jira/browse/LUCENE-4984) by Adrien Gran…
-
### System Info
- `transformers` version: 4.22.0.dev0
- Platform: Linux-5.10.133+-x86_64-with-debian-bullseye-sid
- Python version: 3.7.12
- Huggingface_hub version: 0.8.1
- PyTorch version (GPU?…
-
## 🚀 Feature Request
Fast forwarding our on-the-fly tokenizer can be very slow when our data shards are very large, taking over an hour in some cases.
One easy solution is to just chop the data in…
-
ICUNormalizer2CharFilter is fast most of the times but we've had some report in Elasticsearch that some unrealistic data can slow down the process very significantly. For instance an input that consis…
-
I've been taking a peak at The Pile + C4 which are huge beefy English based datasets.
I also noted that The Pile has streaming support using HF datasets, and if that works that might be a game chan…
-
I've a dataset of about 1.1mil records that I'm trying to apply the sparknlp pipeline on for topic modelling. In each document there's on average 1-5 short sentences since I've pre-cleaned t…
-
# 🐞 bug report
### Affected Rule
The issue is caused by the rule: `container_push`
### Is this a regression?
no
### Description
In Cloud Build, we run the `container_push` r…