-
qwen# python convert_checkpoint.py --model_dir /code/tensorrt-llm/Qwen1.5-32B-Chat/ --output_dir ./trt_ckpt/qwen1.5-32b/fp16 --dtype float16 --tp_size 4
[TensorRT-LLM] TensorRT-LLM version: 0.11.0.de…
-
I tried to evaluate the model on PDBBind dataset, but encounter the KeyError: 'l-rmsd', see below for detail messages. Could you help to look at what could be the problem? I also tried --debug paramet…
-
Hi, How to run yolact_edge in jetson AGX Xavier? Thanks!
-
NOTE: 11th gen Core works fine
We only see this issue with the 'ssdlite_mobilenet_v2' model, but not with caffenet
![image](https://user-images.githubusercontent.com/7730267/186473430-8c165a61-97a…
-
I'm currently testing Llama2 70B on DGX-A100 and DGX-H100. I'm running the gptManagerBenchmark as described [here](https://github.com/NVIDIA/TensorRT-LLM/tree/release/0.5.0/benchmarks/cpp) and compari…
jfolz updated
4 months ago
-
## Overview
We need to add support for using [llama.cpp](https://github.com/ggerganov/llama.cpp) as an inference server in our project. llama.cpp is known for its speed, cross-platform compatibility,…
-
Hello,
I am trying to use the evaluation code for prediction on COCO 2017 dataset for replicating/measuring zero-shot performance on coco2017/
Steps I followed:
1. created the DATASET folder…
-
### Issue type
Bug
### Have you reproduced the bug with TensorFlow Nightly?
Yes
### Source
source
### TensorFlow version
tf 2.14.0
### Custom code
Yes
### OS platform and distribution
Ubunt…
-
**Describe the bug**
Circular import error with PyTorch nightly. If I uninstall deepspeed it works fine.
```
Traceback (most recent call last):
File "/test/oss.py", line 322, in
mp.spawn…
-
### 问题确认 Search before asking
- [X] 我已经查询[历史issue](https://github.com/PaddlePaddle/PaddleDetection/issues),没有报过同样bug。I have searched the [issues](https://github.com/PaddlePaddle/PaddleDetection/issue…