llm-search Search Results

1000+ results
for llm-search

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #8706

[Usage]: Is there any difference between max_tokens and max_…

### Your current environment ```text vllm=0.5.4 ``` llm = LLM( model=MODEL_NAME, trust_remote_code=True, gpu_memory_utilization=0.5, max_model_len=2048, tensor_paralle…

DankoZhang updated 1 month ago
5
amd/RyzenAI-SW #125

run_awq.py using qwen1.5-7b-chat when quantize error

python run_awq.py --model_name Qwen/Qwen1.5-7B-Chat --task quantize Namespace(model_name='Qwen/Qwen1.5-7B-Chat', target='aie', profile_layer=False, task='quantize', precision='w4abf16', flash_attenti…

Wikeolf updated 1 week ago
4
janhq/jan #3059

epic: Jan has Web Search

### # - [X] I have searched the existing issues ### Is your feature request related to a problem? Please describe it Please add a Web Search Feature in it . I think duckduckgo api will be best for …

aritralegndery updated 2 weeks ago
1
run-llama/llama_index #16652

[Question]: Using RouterRetriever and RetrieverTool to answe…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question I created two RetrieverTools for retrieving and answering specific questions, but for ot…

whisper-bye updated 1 week ago
11
langgenius/dify #9273

Concurrent API requests to Gemini vision model cause non-res…

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I hav…

Allamss updated 1 week ago
5
GraySwanAI/nanoGCG #22

Different results by chainging use_prefix_cache

Hi, I've just noticed that by setting use_prefix_cache=True/False, the results can change quite substantially. Take, for example, this code here: ``` llm = AutoModelForCausalLM.from_pretra…

GianlucaDeStefano updated 1 week ago
1
vllm-project/vllm #9517

[Feature]: google/gemma-2-2b supports 8K context length but …

### Your current environment vllm version: 0.6.3.post1 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I see on the official site of gemma: https://huggingface.co/google/gemma-2b, cont…

yananchen1989 updated 1 week ago
1
kubeflow/katib #2396

feat(llm): Determine the best LLM deployment config automati…

### What you would like to be added? Inspired by this research paper [Vidur: A Large-Scale Simulation Framework For LLM Inference](https://proceedings.mlsys.org/paper_files/paper/2024/file/b74a8de47d…

gaocegege updated 1 month ago
8
langgenius/dify #9747

Ability to control logs.

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to su…

Yusaney updated 6 days ago
1
opensearch-project/project-website #3313

[PARTNER] SWIRL Corporation

### Organization Name SWIRL Corporation ### Main office location 235 Bear Hill Rd, Suite 201, Waltham MA 02451 ### What regions of the world do you serve? Global, North America ### Business desc…

sid-swirl updated 3 weeks ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for llm-search

1000+ results
for llm-search