batch-chat Search Results

1000+ results
for batch-chat

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THUDM/GLM-4 #641

关于GLM4-9B模型训练过程中的OOMM问题

### System Info / 系統信息模型：glm4-9B-chat 配置文件： data_config: train_file: train.jsonl val_file: dev.jsonl test_file: dev.jsonl num_proc: 1 max_input_length: 3500 max_output_length: 250…

XCF-Mike updated 1 week ago
15
RocketChat/Rocket.Chat.ReactNative #5824

Cannot update a record with pending changes

### Description: Some messages within the conversation encounter the following issues. ``` Diagnostic error: Cannot update a record with pending changes ``` ### Environment Information…

KotoriMinami updated 3 months ago
4
NVIDIA/TensorRT-LLM #1243

Tensorrt-LLM and transformers infer result different in baic…

Hi, We encountered a problem when using Tensorrt-LLM to infer the baichuan2-7b-chat model，Can someone please help check what's wrong。 thanks all 1. Problem： use transformers + hf model and Tensorrt…

baby-care updated 2 weeks ago
2
microsoft/onnxruntime-genai #628

GPU suspended (887A0005) while running example in DML

Running the default example doesn't work: ```text Namespace(verbose=True, batch_size_for_cuda_graph=1, chat_template='', model='.\\example-models\\phi2-int4-directml') Loading model... Model loa…

skyline75489 updated 1 month ago
7
NVIDIA/TensorRT-LLM #318

memory leak happens on running llama-7b-hf with int8 kv cach…

When converting and building llama-7b-hf with int8 kv cache and weight, there might be a memory leaking in running, each time a batch is submitted, about 2~3GB memory growing could be observed by nvid…

forrestjgq updated 1 week ago
2
explodinggradients/ragas #1059

Invalid n value (currently only n = 1 is supported

File "/usr/local/lib/python3.10/dist-packages/ragas/langchain/evalchain.py", line 166, in evaluate dataset_with_scores = self.metric.score(dataset, callbacks=callbacks) File "/usr/local/lib/…

ARES3366 updated 1 month ago
2
huggingface/transformers #34824

Flash attention 2 broke when batch inference

### System Info - `transformers` version: 4.46.2 - Platform: Linux-5.15.0-120-generic-x86_64-with-glibc2.35 - Python version: 3.10.15 - Huggingface_hub version: 0.26.2 - Safetensors version: 0.…

pspdada updated 1 week ago
1
mlc-ai/mlc-llm #2992

[Bug] Concurrent requests are being run sequentially on AMD …

## 🐛 Bug Hello team, Thanks for creating such an amazing engine. I ran Llama-3-8B-Instruct-q4f16_1-MLC in server mode with different batch sizes (2-128) but I still see my requests are being run …

Said-Akbar updated 1 month ago
1
signalapp/Signal-Desktop #7102

Signal does not work on Macbook pro 2018

### Using a supported version? - [X] I have searched searched open and closed issues for duplicates. - [X] I am using Signal-Desktop as provided by the Signal team, not a 3rd-party package. ###…

hassanrana updated 3 days ago
5
openimsdk/open-im-server #2016

Regarding the issue of batch messaging

### Checklist - [X] I've searched for similar issues and couldn't find anything matching - [X] I've discussed this feature request in the [OpenIMSDK Slack](https://join.slack.com/t/openimsdk/shared_i…

zd668 updated 1 week ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for batch-chat

1000+ results
for batch-chat