-
Hi, guys. Thank you for your research. It is extremely interesting and valuable for the community, and I mean it! I am curious why didn’t you use lemmatization or stemming of the words prior to analys…
-
In create_hf_model, what's the purpose of resizing the model embedding?
model.config.end_token_id = tokenizer.eos_token_id
--
44 | model.config.pad_token_id = model.config.eos_token_id
…
-
### Elasticsearch Version
7.17.15
### Installed Plugins
smartcn, icu, kuromoji, nori
### Java Version
17.0.8 2023-07-18 LTS
### OS Version
Mac OS Sonoma 14.5
### Problem Description
I was r…
-
### Your current environment
The output of `python collect_env.py`
```text
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N…
-
Hi,I also have a problem with training TWOSOME in Tomato Salad environment `sh scripts/tomato_salad_ppo_llm.sh` and encountered the following error:
pygame 2.4.0 (SDL 2.26.4, Python 3.9.…
-
## Background
- here in mindspore, there is text part in dataset for doing data_augmentation in some nlp task
- in this process, we need some tokenizer to deal with origin text data, firstly tokeniz…
-
### System Info
Why is the performance worse than release 0.12.0 when I run the benchmark of release 0.13.0
(fp16,fp8,int8) llama2 70b
### Who can help?
_No response_
### Information
- [x] The of…
-
Repro:
```dart
import 'package:html/parser.dart' as html;
void main(List args) {
var dom = html.parse('', generateSpans: true);
print(dom.querySelector('div').attributeSpans);
}
```
…
-
### System Info
transformers.js 2.17.2
### Environment/Platform
- [X] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Desktop app (e.g., Electron)
- [ ] O…
-
It may be a problem with Plotly, and I should file an issue there.
However, I am observing a crash in the HTML sanitizer. It says `this.buffer.charCodeAt` is not a function.
The cause is relatively …