-
### Describe the bug
Image upgrade to 0.12.1, running Qwen1.5-14B-Chat-GPTQ-Int4 is much slower compared to 0.11.0.
### To Reproduce
docker image has been upgraded to 0.12.1, which is much slower…
-
datax 注意力大部分是reader/writer, datax的设计核心在于星型结构,即,reader/writer可以随意撤换,中间exchanger作适配,Transfomer应该是最重要的角色,但datax本身Transfomer比较简单,本组使用xql/规则实现Transfomer构建,适配reader/writer,目前以图库为writer,但可作为探索,实现低代码datax
…
-
As part of our public API, we support the ability to filter out parts of the estimator chain. The APIs that support this are not documented, nor do we provide any context in our Cookbooks. As we are d…
-
http://rythmengine.org/doc/user_defined_transfomer.md
-
transfomers版本太旧
-
Remove pinned transfomer version from generate_conda_files
https://github.com/interpretml/interpret-text/blob/97416a0a9cc3e60bcb1221f878577762c64df02e/tools/generate_conda_files.py#L65
-
### System Info
`node:alpine-18 Docker Container`
### Environment/Platform
- [ ] Website/web-app
- [ ] Browser extension
- [X] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Desktop app (e.g…
-
The Transfomer code in MLPerf is a bit outdated, and has a BLEU score lower than the official TensorFlow version:
https://github.com/tensorflow/models/tree/master/official/transformer
-
-
In transfomer_dataset.py line 250, if after "B-" label is "B-", then will merge two entities of different types into one. Is there a data processing error here?