-
### What feature would you like to be added?
Tutorial similar to the on in v0.2, starting with the most basic example.
### Why is this needed?
For the audience that are new to agents and LLM appli…
-
Hi all,
I have tested the ggml-model-tl1.gguf (Llama3-8B-1.58-100B-tokens) model on iOS (iPhone 15 Pro) using the llamaSwiftUI sample project. However, I encountered an EXC_BAD_ACCESS error at ggml_v…
-
When the Anthropic API key hits per-minute or daily rate limits the web app just keeps spinning "Claude is working".
Per-minute rate error:
```
2024-11-06T01:47:56.235Z [ERROR] Error calling An…
-
### Summary
Hi @DhanshreeA,
Below I am writing my thoughts around the LLM processing of publications and metadata. Feel free to take it from here, including breaking this into smaller tasks.
…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.5.1+cpu
Is debug build: False
CUDA used to build PyTorch…
-
**What problem or use case are you trying to solve?**
Not Diamond intelligently identifies which LLM is best-suited to respond to any given query. We want to implement a mechanism in OpenHands to s…
-
# URL
- https://arxiv.org/abs/2409.14924
# Affiliations
- Siyun Zhao, N/A
- Yuqing Yang, N/A
- Zilong Wang, N/A
- Zhiyuan He, N/A
- Luna K. Qiu, N/A
- Lili Qiu, N/A
# Abstract
- Large la…
-
To improve the ESQL task, we could try to retrieve some context from the customer cluster.
We could for example fetch the list of all indices / aliases / datastreams the current user has access to, a…
-
### System Info
cpu intel 14700k
gpu rtx 4090
tensorrt_llm 0.13
docker tritonserver:24.09-trtllm-python-py3
### Who can help?
@Tracin
### Information
- [X] The official example scri…
-
/kind feature
**Describe the solution you'd like**
This proposal outlines the need for an API to standardize the discovery, installation, and aggregation of LLM functions, agents or tools in Kub…