small-models Search Results

1000+ results
for small-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opensearch-project/ml-commons #2126

[FEATURE] Support the deployment of Small Language Models

This is a feature request to deploy Small Language Models (SLM) (3b or 1b). SLMs are improving quickly and are becoming good choice for narrowed scope usecases. Examples can be TinyLlama, Minichat…

asfoorial updated 1 week ago
32
webmachinelearning/webnn #780

Idea for Discussion: Device Memory Management Primitives

The idea is perhaps future-looking, but I'd like to bring it up for discussion. ## Motivations * Reduce the GPU/NPU memory required for completing a use case (e.g. text2image). * Reduce the mem…

wacky6 updated 2 days ago
5
meta-llama/llama #635

GQA for smaller models

Hello, could we please have 13b and 7b models with the updated architecture that includes grouped query attention? A lot of people are running these models on machines with low memory and this woul…

Dampfinchen updated 4 months ago
1
mlc-ai/web-llm #629

Structured JSON Generation

Hi All, Thank you for your amazing work! Where can we find a list of models that support Structured JSON Generation? Do all the models support that? We were able to find a list of models in the [HF…

AvivSham updated 19 hours ago
1
BerriAI/litellm #6741

[Feature]: json_schema support for Anthropic

### The Feature Currently, you only support a small amount of `json` format models : https://docs.litellm.ai/docs/completion/json_mode ### Motivation, pitch I would need to be able to do the same …

Seluj78 updated 3 hours ago
1
RobinDeSmet/BazelBot #15

Experiment with smaller models

The inference time is way to high, we should try to use a much smaller model from Ollama: * dolphin-phi (3B uncensored dolphin model)

RobinDeSmet updated 5 months ago
1
weberlab-hhu/Helixer #147

over-prediction small contigs

Hello, In the supplemental information of the BioRxiv preprint (https://www.biorxiv.org/content/10.1101/2023.02.06.527280v2.supplementary-material) I read that over-prediction in small contigs (small…

mhooykaas updated 1 day ago
4
city96/ComfyUI-GGUF #133

failed to quantize: unknown model architecture: 'flux'

Trying to quantise some flux models to lower the vram needs and I get that error. ``` (venv) C:\AI\llama.cpp\build>bin\Debug\llama-quantize.exe "C:\AI\ComfyUI_windows_portable\ComfyUI\models\chec…

GamingDaveUk updated 3 weeks ago
3
fedirz/faster-whisper-server #77

PRELOAD_MODELS doesn't work on last Docker image tag (but wi…

With local deployment, the PRELOAD_MODELS config variable works perfectly : ``` PRELOAD_MODELS='["Systran/faster-whisper-medium.en", "Systran/faster-whisper-small.en"]' MAX_MODELS=2 uvicorn main:a…

leoguillaume updated 4 days ago
11
Dooders/Experiments #8

Experiment: Implement Distillation, Quantization, and Crosso…

We aim to implement a system that leverages distillation and quantization to create a "child" neural network by combining parameters from two "parent" neural networks. The child network should inherit…

csmangum updated 4 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for small-models

1000+ results
for small-models