-
### Describe the bug
Traceback (most recent call last):
File “D:\NEW_OOBA\oobabooga\oobabooga_windows\text-generation-webui\[server.py](http://server.py/)”, line 102, in load_model_wrapper
shared.m…
-
Hi,
Loved this paper and implementation. I implemented this for Phi2 with transformers==4.36.2 without caching. The outputs with in context size are even better at following instruction than actua…
-
Self extend is now supported for main: Link: https://github.com/ggerganov/llama.cpp/pull/4815
Link paper: https://arxiv.org/pdf/2401.01325.pdf
It would be great if it was also supported for the ser…
-
在使用源码中提供的test.source 以及下载的longlm_base模型 执行生成任务时会出现 RuntimeError: probability tensor contains either `inf`, `nan` or element < 0, 请问怎么解决呢?
-
您好,paper中提到WDC-Dialogue数据分别来源于社交平台的转发、网站论坛的评论转发、问答交流,请问能再分别详细说明下分别在哪些网站中通过什么方式采集的吗?
比如zhihu平台是什么入口,或者什么关键词搜索相关数据?
对这部分工作比较感兴趣,请帮忙说明下,谢谢~