-
### 🚀 The feature, motivation and pitch
The request is to extend the [tokenizer](https://github.com/pytorch/torchchat/tree/main/tokenizer) module in `torchchat` to support tokenizers that use the Hug…
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is.
Real time construction site monitoring using Web based Interface and Yolo b…
-
In the ever-evolving world of AI, even language models deserve a chance to find love (or at least, stimulating conversation). Let's build a decentralized dating app exclusively for AI language models!…
-
# 背景
我们发现绝大部分LLM推理引擎在报告推理性能的时候,都是关掉sampling功能的。但是在实际应用中,sampling几乎是必选项。为了给出尽可能贴近实际应用的benchmark,我们开了这个issue,报告 LMDeploy **在采样开启时**候的性能。
# 测试模型
1. llama2-7b
2. llama2-13b
3. internlm-20b
4. llama2…
-
Hello, I would like to know which decoder you used for the report generation task. I am working on report generation using a multimodal large language model approach. Does your method serve as an enco…
-
### Check for existing issues
- [X] Completed
### Describe the bug / provide steps to reproduce it
In version v0.156.0 (Preview) the setting `language_models.ollama.low_speed_timeout_in_seconds` is…
-
### 🚀 The feature, motivation and pitch
`MllamaForConditionalGeneration` models (such as, `meta-llama/Llama-3.2-90B-Vision-Instruct`, `meta-llama/Llama-3.2-11B-Vision`, etc.) are composed of `MllamaV…
-
Hi TensorRT-LLM team, Your work is incredible.
By following the READme file for [multi-modeling](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/multimodal/README.md), we were sucess to run…
-
## installable
- [ ] https://github.com/salesforce/LAVIS
- https://github.com/salesforce/BLIP
- https://github.com/salesforce/ALBEF
- [ ] https://github.com/facebookresearch/multimodal
- …
-
### Feature request
Recently, we have added the ability to load `gguf` files within [transformers](https://huggingface.co/docs/hub/en/gguf).
The goal was to offer the possibility to users …