llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TransformerOptimus/SuperAGI #1365

Ollama support?

Is there any way to use Ollama to host the llm models?

BluePhi09 updated 6 months ago
2
OpenNMT/CTranslate2 #1431

Weird behavior on V100 32GB

Hi. I have been doing some benchmarks on nvidia V100 32GB gpu. First, I benchmarked Llama2-7B-chat using huggingface transformers and CTranslate2. I saw reduced latency when using ct2 ( 12 secon…

AmgadHasan updated 7 months ago
3
eoctet/llama-java-chat #2

Can not read model configuration file

Under Windows : java -jar chat-console.jar --model llama2 --system the result is : Error: chat.octet.exceptions.ServerException: Can not read model configuration file, please make sure it is …

lbarasc updated 7 months ago
1
hpcaitech/ColossalAI #4974

[FEATURE]: 长文本的继续预训练

### Describe the feature 请问我要在colossal-llama2-7B上面训练大量的长文本，应该怎么构造数据集？

linchen111 updated 8 months ago
4
ljy0ustc/LLaRA #4

About The Train

![image](https://github.com/ljy0ustc/LLaRA/assets/47343901/4bd566e6-5dc8-4f80-8b19-c60e26b6c414) Will you help me to solve this problem ? 👆

Albertchamberlain updated 2 months ago
9
vllm-project/vllm #2988

Limited Request Handling for AMD Instinct MI300 X GPUs with …

Reproducing steps: 1. Clone the vllm repo and switch to [tag v0.3.1](https://github.com/vllm-project/vllm/tree/v0.3.1) 2. Build the Dockerfile.rocm dockerfile with instructions from [Option 3: Bui…

Spurthi-Bhat-ScalersAI updated 3 months ago
6
huggingface/transformers #30663

Question about quantized model with zero3

### System Info - `transformers` version: 4.41.0.dev0 - Platform: Linux-5.15.0-92-generic-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.21.4 - Safetensors version…

mxjmtxrm updated 1 week ago
5
huggingface/optimum-neuron #299

Add additional parameter for model quantization + other feat…

NeuronXXXModel classes (i.e. NeuronDecoderModel - optimum/neuron/modeling_decoder.py) invoke transformers-neuronx to compile the target model, however these classes don't pass all the supported input …

samir-souza updated 2 weeks ago
4
explosion/spacy-llm #423

[Warning] the current text generation call will exceed the m…

When using LLM to do NER task, there is a warning saying "This is a friendly reminder - the current text generation call will exceed the model's predefined maximum length (4096). Depending on the mode…

yileitu updated 5 months ago
2
BerriAI/litellm #2026

[Bug]: Can't run Ollama with stream=True

### What happened? When trying to use Ollama with LiteLLM and with Stream=True, an exception is thrown. litellm version: 1.15.0 1. Serve ollama locally on port 11434 (or replace the port in the…

FlafyDev updated 1 month ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama2

1000+ results
for llama2