-
它有两种接口`/v1/completions`和`/v1/chat/completions`,前者自定义渠道也不能用,不知道该如何添加渠道?
```
curl --request POST \
--url https://api.fireworks.ai/inference/v1/completions \
-H 'Accept: application/json' \
-H…
-
Hi friends,
I was trying to test the finetune/finetune.py script. It seems that state.best_model_checkpoint always return None leading to a failure at the end of the program. Is it that the program…
-
how to change this code to make it work on 2 11G 2080ti?:
```# make sure you have accelerate and bitsandbytes installed
from transformers import AutoModelForCausalLM, AutoTokenizer
tokenizer = Au…
-
In the published paper on FunSearch, there is a mention of using pre-trained large language models (LLMs) like Codey (based on the PaLM2 model family) and a reference to StarCoder, an open-source LLM,…
-
Hi all, this issue will track the feature requests you've made to TensorRT-LLM & provide a place to see what TRT-LLM is currently working on.
Last update: `Jan 14th, 2024`
🚀 = in development
#…
-
Cursor has a feature were you add a file named _.cursorrules_ with specifics to musts and don'ts for your specific codebase. It is applied as further context and allows for more concise and precise an…
-
**Describe the bug**
Getting segmentation fault on mac M1
```
~ tabby serve --device metal --model TabbyML/StarCoder-7B
2024-04-06T07:43:21.506323Z INFO tabby::serve: crates/tabby/src/serve.r…
-
please tell me hot wo generate the following files:
# WEIGHTS_TRAIN=/fsx/loubna/code/bigcode-data-mix/data/train_data_paths.txt.tmp
# WEIGHTS_VALID=/fsx/loubna/code/bigcode-data-mix/data/valid_data_…
-
Currently on 3 chat templates is present: https://github.com/TanvirOnGH/vscode-ollama-modelfile/blob/dev/snippets/modelfile.json#L37-L104.
## TODO Templates
- [x] ChatML (ccd461ac30c116110a7adda50…
-
Just to ensure that FIM is on the radar at jupyter-ai I leave this comment here.
FIM ([Fill-in-the-Middle](https://medium.com/@SymeCloud/what-is-fim-and-why-does-it-matter-in-llm-based-ai-53f333855…