efficient-llm Search Results

1000+ results
for efficient-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

enricoros/big-AGI #503

[Roadmap] Common Prompt Templates

**Why** To streamline user interactions with the language learning model (LLM) in the chat application, users will be able to quickly select from a variety of predefined prompt templates. This featur…

joriskalz updated 3 months ago
3
NVIDIA/TensorRT-LLM #1419

[FeatureRequest] Gather sparse logprobs

Hello team, We typically use `gather_all_token_logits` to collect the logit tensors for post-processing. Especially for large vocabulary sizes (128 000) this can require a lot of GPU memory. For ex…

Marks101 updated 2 months ago
7
vllm-project/vllm #1304

Could you support Attention Sink?

Efficient Streaming Language Models with Attention Sinks [paper](https://arxiv.org/abs/2309.17453) These repo has already implemented it: [attention_sinks](https://github.com/tomaarsen/attention_si…

dongxiaolong updated 4 months ago
9
your-papa/obsidian-Smart2Brain #113

How to make S2B work nicely with vault

### Feature Area vault data ### Painpoint I really like this project, but I have troubles using it with my vault. Do you guys have any tips or tricks how to creates notes that can be used by this, …

davidmarothi updated 1 month ago
2
vllm-project/vllm #7197

[Bug]: 调用查找不到FlexibleArgumentParser

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug Even though I have updated the package to the latest version, the function call is still fa…

asfadfaf updated 1 month ago
3
getappmap/appmap-js #1895

An RPC method exists to apply code

### Problem The CLI needs a new RPC method that allows for code changes to be applied to a specific file. This method should take in a file path and new code content, and then use the language model …

dustinbyrne updated 1 month ago
2
geekleteam/Hackathon_1_FAB_team_2 #13

Key Takeaways and Learnings

This was my first time working with LLMs as a Machine Learning Engineer. So, I've learned a few things: - Prompt engineering is very crucial for the performance and accuracy of the application and ev…

yash9904 updated 1 month ago
1
Seeed-Studio/wiki-documents #1553

[Page Add][Enhanced Function] Building a Voice-Interactive C…

We are building a voice-interactive chatbot that leverages cutting-edge technologies such as Speech-to-Text (STT), Text-to-Speech (TTS), and local Large Language Models (LLMs), with a focus on Ollama'…

elainedanwu updated 2 weeks ago
2
neonbjb/tortoise-tts #694

Batch Inference?

So we're having issues inferencing efficiently at scale, and of course we're processing the audio parts one by one as is default for inference, but is there any support for batch inference to speed th…

addytheyoung updated 3 months ago
1
aastroza/cachai #3

Agregar documentación sobre la historia de los modelos de te…

Referencias: - [ReLeLa](https://relela.com/) - [BETO: Spanish BERT](https://github.com/dccuchile/beto) - Los modelos de [Jorge Ortiz Fuentes](https://huggingface.co/jorgeortizfuentes) como [Tulio…

aastroza updated 1 month ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for efficient-llm

1000+ results
for efficient-llm