llm-framework Search Results

1000+ results
for llm-framework

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

flexflow/FlexFlow #1454

Questions about the measurement of the latency

Hello, FlexFlow team! Thank you for your outstanding work! I am attempting to reproduce the experimental results from the paper "SpecInfer: Accelerating Generative Large Language Model Serving with…

QAZWSX0827 updated 1 month ago
2
biocypher/biocypher #374

Graph nodes Semantic search

### Issue description Somewhat a generalization of https://github.com/biocypher/biochatter/issues/204 Either a permutation of user query or a semantic search approach is necessary to avoid false ne…

winternewt updated 4 days ago
2
langgenius/dify #6589

Optimization Suggestions for Database Connection Release Iss…

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to su…

secbr updated 3 weeks ago
6
explodinggradients/ragas #927

Get Started section for "Generate a Synthetic Test Set" is b…

[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. The [get started section on synthetic data generation](https://docs.ragas.io/en/sta…

ckrapu-nv updated 3 months ago
3
vllm-project/vllm #4678

[Usage]: Out of Memory w/ multiple models

### Your current environment ```text torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 224.00 MiB. GPU ``` ### How would you like to use vllm I'm running a eval framework …

yudataguy updated 4 months ago
1
microsoft/RAG_Hack #91

Project: AI-Powered Research Paper Assistant

### Project Name Paper Mentor AI ### Description **Paper Mentor AI** is an intelligent research assistant designed to help students, researchers, and professionals streamline their research p…

iAMSagar44 updated 1 week ago
2
vllm-project/llm-compressor #106

[Bug]: Index Error tuple out of range

**Describe the bug** I'm trying to apply "W4A16" quantisation to the qwen2-7B model. In particular "cognitivecomputations/dolphin-2.9.2-qwen2-7b" though I've tried with other qwen2 models and had the…

SeanIsYoung updated 1 month ago
2
weaviate-tutorials/Hurricane #1

where is gpt4_compiled_hurricane.json come from?

in backend.py there is gpt4_compiled_hurricane.json,can u tell about it? when compile hurricane,there any parameters such as metric?

skyroot updated 1 month ago
1
Paitesanshi/LLM-Agent-Survey #26

One reference on LLM Agents playing Trust Games

Congratulations on your recent solid survey paper and impressive paper list! We have a related paper on LLM Agents playing Trust Games. Can Large Language Model Agents Simulate Human Trust Beha…

canyuchen updated 4 months ago
1
nestauk/dap_taltech #1

Create tutorial material for LLMs

1. [x] go over @ampudia19 's material 2. [x] slides framework 3. [x] notebook framework 4. [x] first draft LLM slides 5. [ ] tutorial for LLMs

india-kerle updated 1 year ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for llm-framework

1000+ results
for llm-framework