llm-testing Search Results

1000+ results
for llm-testing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

danny-avila/LibreChat #3570

a11y: Updates to the chat are not announced for screen reade…

**Issue description:** When using a screen reader and submitting a query or message there is no indication the chat thread has been updated. **WCAG Criteria**:[ SC 1.3.1 Info and Relationships](ht…

derekjackson-das updated 3 weeks ago
4
BerriAI/litellm #5345

[Help]: Set individual budgets for members in a team

### What happened? I am currently using litellm proxy via api only. This is the container image version `ghcr.io/berriai/litellm:main-v1.44.2` TLDR; team budget seems to work, individual me…

edwins updated 2 weeks ago
8
silverbulletmd/silverbullet #750

Different fetch behavior between client and server

I have a function in a plug that calls fetch like this, where body is a string (json-encoded object): ``` javascript const response = await fetch( aiSettings.openAIBaseUrl + "/chat/comp…

justyns updated 6 months ago
5
meta-introspector/tpUlysses #1

sample the runtime

An expert in TPU compiler writing can potentially introduce sampling techniques into programs for specific purposes. Here's a breakdown of the concept: **Sampling for TPU Programs:** * **Expert-…

jmikedupont2 updated 6 months ago
28
BerriAI/litellm #5359

[Bug]: Unable to continue async streaming after cancellation

### What happened? Here is an example usage: ``` chunks = [] try: async for chunk in stream_resp: text = chunk.choices[0].delta.content or "" yield text chunks.ap…

jamesleeht updated 2 weeks ago
5
run-llama/llama_index #15080

[Bug]: Facing error with SubQuestionQueryEngine using micros…

### Bug Description I'm doing RAG using llama-index.The model is Phi3-mini-4k. I have experimented all the models that supports sub-queryengine. When comparing those models, I got pretty good results…

arunnuve updated 1 month ago
11
MaartenGr/BERTopic #1599

Running Topics Over Time with just a subset of topics and do…

I'd like to run topic_model.topics_over_time() but only on a specific subset of documents and topics. Sometimes when working with a large corpus with lots of topics, running it on all documents and to…

yousufabdelfatah updated 10 months ago
4
ray-project/ray-llm #94

Multiple models second models always request GPU: 1

Using the instructions here: https://github.com/ray-project/ray-llm#how-do-i-deploy-multiple-models-at-once I'm trying to host two models on a single A100 80G. Two bundles are generated for the pla…

lynkz-matt-psaltis updated 9 months ago
2
abgulati/LARS #21

There was an error when loading the LLM in the method

![error](https://github.com/user-attachments/assets/c6a351db-0074-4db7-bc68-9b6eb9f3081f) After running the app.py file and putting the model in the web_app_storage/models folder. I get the this er…

manub14 updated 1 week ago
45
ggerganov/llama.cpp #8577

Support Mistral-Nemo-Instruct-2407 128K

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

mirek190 updated 1 day ago
54

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llm-testing

1000+ results
for llm-testing