-
https://lmsys.org/blog/2023-06-29-longchat/
https://arxiv.org/abs/2305.07185
https://www.reddit.com/r/LocalLLaMA/comments/14fgjqj/a_simple_way_to_extending_context_to_8k/
https://github.com/epfml…
-
Hello so i was fine-tuning a llama-2 model with unsloth using a tokenizer of my own, it has an extended vocabulary of around 48000 tokens in total, the tokenizer is compatible and checks have been mad…
-
This is a ticket to track a wishlist of items you wish LiteLLM had.
# **COMMENT BELOW 👇**
### With your request 🔥 - if we have any questions, we'll follow up in comments / via DMs
Respond …
-
I'm getting the following error when running `./train_gpt2cu` after building using `make train_gpt2cu USE_CUDNN=1`
```num_parameters: 124475904 ==> bytes: 248951808
allocated 237 MiB for model par…
wfoy updated
3 months ago
-
### Ticket Contents
Belongg is developing BelonggAI, a tool that will help development practitioners, researchers, funders, etc analyze their proposals, program documents, policy documents, etc to …
-
Google Colab (short for "Colaboratory") is a free cloud-based platform provided by Google that allows users to write and execute Python code in a Jupyter notebook environment. It is particularly popul…
-
Hi, I am trying to finetune LLaVA-NeXT with my custom dataset, using "finetune_clip.sh" shell file.
I gave some edits to the shell for my convenience and to satisfy my task so far, like this:
```
…
-
# Question Details
Hello, I encountered an error while using cmake. My system is Windows 10 with Python 3.11 and NVIDIA 3060. Below is the content of the error report.
And I have correctly install…
-
Thesis defense target: 21 June 2024. Survey target: end of July 2023.
Would like to have a fresh master thesis topic, not incremental improvement of other thesis work.
Starting roughly Q1 2023 or su…
-
Hi there. I'll preface this by saying I'm likely failing to understand something about how these logprobs calculations work.
So -- why is acc_norm consistently higher than acc? They're just ways of…