-
Hi there!
I looked through the corpuses and found that sometimes they are not 100% downloaded. Not sure, if the issue is with the downloading scripts. Below are some examples grepped from bookcorp…
-
## 🐛 Bug
Initializing a chat with gemma-2b, the process crashes and displays a stacktrace in the chat.
## To Reproduce
Steps to reproduce the behavior:
1. Unless previously done, download …
-
## 🐛 Bug
When applying ChatModule -> [Bug] Failed to set the allowed dynamic shared memory size to 73728
## To Reproduce
Steps to reproduce the behavior:
```
conda activate your-environm…
-
### System Info
System: Linux
Kernel: Linux version 5.15.0-1026-aws (buildd@lcy02-amd64-088) ( gcc (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0 )
Python version: 3.8.16
LangChain version: 0.0.157
##…
-
Domain: Scientific Articles
Not all articles posted on ArXiv are permissively licensed. Need to only do the CC ones.
Need to parse the LaTeX into plain text.
@craffel has already parsed some …
-
Hi, I tried to run this project on my PC. It was very slow, because it used CPU insted of GPU.
There was info:
`You are using config.init_device='cpu', but you can also use config.init_device="meta…
-
Hi,
Thanks for the great repo. I wanted to know whether the data/model used for training the 154M parameter model for calculating the perplexity is available?
-
## 🐛 Bug
I am trying to optimise the `Qwen/Qwen1.5-4B-Chat` model. As I have only 8GB RAM on my MAC M1, I use 3bit quantisation and a really small prefill chunk size = 2048. I get the following err…
-
I have been trying to train Llama2 with RedPajama through Data Mixture for good randomization. Out of box it doesn't work: the cluster keeps waiting for some chunks, while a few millions docs have bee…
-
### Bug Description
When running this code:
```python
# Initialize the ApiClient globally
configuration = pos_client.Configuration(host=f"http://{IP}:{PORT}")
api_client = pos_clien…
hra42 updated
6 months ago