llama3 Search Results - Githubissues

1000+ results
for llama3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

phidatahq/phidata #1347

agent-api fails with gemini models

A sample agent with following configuration fails when calling using localhost:8000/docs /run endpoint with the following error. ``` return Agent( name="Gemini Agent", agent_id="Ge…

anandanand84 updated 2 weeks ago
3
MeetKai/functionary #258

Finetuning

Hi! I'm a beginner to all of this. Can someone direct me how to finetune the v3 model? I saw #99 on how to structure the dataset https://github.com/MeetKai/functionary/blob/main/tests/test_case_v2.jso…

sjay8 updated 2 months ago
1
ollama/ollama #7364

Data persistence

I love that I can load extensive public domain resources directly from the internet into the sessions and add hundreds of thousands of data point. I can then run knowledge graph optimizations, as wel…

multiplicity-16 updated 3 weeks ago
1
Chainlit/chainlit #1499

Chat history is not getting retrieved on the chat resume

Hi I'm saving the chat history to the postgresdb through data layer but when I'm doing the chat resume the history is not getting loaded but the chat title is coming up in the side bar and chat is a…

ashish6ai updated 1 week ago
1
LlamaFamily/Llama-Chinese #330

llama3 8B中文微调模型Llama3-Chinese-8B-Instruct 存在对生成的答案一直重复的问题

提问任何问题，都会一直重复自己的答案，直到达到模型的max_token；

Ryan-0805 updated 3 months ago
9
vllm-project/vllm #4646

[Bug]: Setting best_of and n in SamplingParameters makes the…

### Your current environment ```text ollecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

Some-random updated 3 weeks ago
1
noamgat/lm-format-enforcer #145

RAM UTILISATION IS INCREASING RAPIDLY

For enforcing model to give response in json format, I am using ExLlamaV2TokenEnforcerFilter and ExLlamaV2PrefixFilter classes and appending to to filters list and passing as filters for generating ou…

UTSAV-44 updated 1 month ago
2
fe1ixxu/ALMA #42

OOM 问题, 显卡是A00 40G

用llama factory进行sft可以使用deepspeed zero2 微调llama3-8B的模型，但这个框架就算batch设为1，用deepspeed zero2也会报OOM。用zero3训练会变得很慢，出现这个问题： 2 pytorch allocator cache flushes since last step. this happens when there is hi…

gongye19 updated 1 week ago
5
CrazyBoyM/llama3-Chinese-chat #35

网页部署shareAI的V2版后为什么是gpt3.5？

shareAI系列： base预训练 + 直接中文SFT版: V2版 modelscope：https://modelscope.cn/models/baicai003/Llama3-Chinese_v2/summary

zhentouzhanshi updated 6 months ago
1
swarmauri/swarmauri-sdk #293

[Feature Research]: MiniCPM-v2.5

### Feature Name MiniCPM-v2.5 ### Feature Description Research about MiniCPM-v2.5 ### Research Findings MiniCPM-v2.5 is a Chinese language model developed by the Beijing Academy of Artificial Int…

abdulsamodazeez updated 1 month ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama3

1000+ results
for llama3