efficient-llm Search Results

1000+ results
for efficient-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

instill-ai/instill-core #1025

[Feature] [VDP] [Pipelines] Workflow automation-Code Review …

### Is There an Existing Issue for This? - [X] I have searched the existing issues ### Where do you intend to apply this feature? Instill Core, Instill Cloud ### Is your Proposal Related to a Prob…

YCK1130 updated 1 month ago
1
argilla-io/argilla #4951

[DOCS] Add tutorials pages to new documentation

Feel free to use at much as possible of these tutorials but it is also a good excuse to review and re-write. Some things to keep in mind: - Start by identifying a real-world problem and/or datas…

davidberenstein1957 updated 1 month ago
2
xorbitsai/inference #531

ENH: enable multiple models running on a single device

### Is your feature request related to a problem? Please describe Currently, our system assigns each model to a unique GPU device. While this approach ensures protection against out-of-memory (OOM) e…

UranusSeven updated 1 week ago
1
triton-inference-server/tensorrtllm_backend #532

API Usage Error: profileMinDims.d[i] <= dimensions.d[i]

### System Info - 1x H100 - Llama3 8B Instruct - TensorRT-LLM v0.10.0 - tensorrtllm_backend v0.10.0 - tritonserver 24.06 ### Who can help? @kaiyux ### Information - [X] The officia…

LanceB57 updated 1 week ago
3
sabszh/EER-chatbot-UI #10

Meta-tag / reflection in the retrieval method

Considering the potential impacts of implementing an additional layer in the retrieval method, by integrating a vectorstore enriched with meta information about the data. This enhancement could pro…

sabszh updated 2 months ago
1
saoudrizwan/claude-dev #14

Code is truncated

When the code file is long, it will make changes to the file and in places put comments like: // Rest of the code remains the same This essentially renders the code file useless.

vblues updated 1 day ago
3
MittaAI/webwright #34

Integrate Gemini LLM for coding assistance

This issue proposes integrating Google's Gemini large language model (LLM) into the Webwright project to provide advanced coding assistance capabilities. Gemini is a state-of-the-art LLM trained on a…

kordless updated 1 week ago
1
AgentOps-AI/agentops #250

Enhance Session Management and Functionality in llm_tracker.…

I have identified some opportunities to improve the session management and overall functionality in the llm_tracker.py, client.py, and session.py files. These changes aim to enhance the robustness and…

sarath59 updated 1 month ago
1
MARIO-Math-Reasoning/Super_MARIO #17

type of template for training

Hello! Thanks for sharing the details of your implementation. I'm wondering what llama factory template you used for your fine tuning, `alpaca` or `deepseek` or maybe a custom one? Also did you …

vgaraujov updated 1 day ago
5
iamtalwinder/gif-maker #2

use GPU if True else CPU

If GPU is available in the machine of the user. Instead of using CPU for processing the gif(s) files, using GPU would prove a much more efficient and effective solution in terms of time complexity. …

NitkarshChourasia updated 3 weeks ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for efficient-llm

1000+ results
for efficient-llm