-
-
Hi !
Currently the tokenizer uses this:
```javascript
lunr.tokenizer.separator = /[\s\-]+/
```
It works okay, but I think it would be a good idea to add apostrophes too:
```javascript
l…
-
We switched production from sphinx 3.1 to manticore 3.1.2 47b6bc2c@190822 release and faced some very strange bug.
Query: `SELECT * FROM `bulletins` WHERE ((price
-
The variable token_list is null after the tokenizer is done.
I can't parse the tokens if I only get NULL.
-
PEP 508 says that a `version_many` looks like `version_one (wsp* ',' version_one)*` and the documentation comment in `_parse_version_many` says that a `version_many` looks like `(SPECIFIER (WS? COMMA …
-
Hello,
First of all thanks for developing this tool and making it available ! I'm trying to use crisperwhisper to annotate a naturalistic language production experiment in german. The files are 1mn…
-
### What happened + What you expected to happen
Testing hyperparam search with huggingface and wandb. I have copied the code from [this](https://huggingface.co/blog/ray-tune) guide and added a wandb …
-
### System Info
GPU 2* A30, TRT-LLM branch main, commid id: 66ef1df492f7bc9c8eeb01d7e14db01838e3f0bd
### Who can help?
_No response_
### Information
- [x] The official example scripts
- [ ] …
-
The tokenizer module seems to be incorrectly modifying AWS S3 presigned URLs. The request made in the browser (via Network dev console) does not match what was input into the field. Same presigned UR…
-
Platform: MTL iGPU, 64G DDR5, ubuntu 22.04
[test_glm-4v-9b.zip](https://github.com/user-attachments/files/17161208/test_glm-4v-9b.zip)
In the attachment, convert_ipex_model.py is for converting the …