large-large-models Search Results

futo-org/whisper-acft #4

large models

Hello- Thank you for sharing this work. I've noticed how well these models work with smaller audio files. Is there any plans to release the large models? Thanks

eschmidbauer updated 4 days ago

Vignana-Jyothi/kp-gen-ai #11

[Theory] Large Langauge Models

- Advanced type of Language Model using Deep learning techniques using heavy text data. - Capable of generating human like text. QnA, Text2Text - Concepts like n-gram to Neural Networks are used. …

head-iie-vnr updated 5 days ago

ridgerchu/matmulfreellm #33

Larger models available or planned?

Anything larger than 2.7B cooking? I'm itching to test the larger capacity and its scaling against the larger small LLMs of comparable size (or comparable resource use).

jaggzh updated 3 days ago

jan-janssen/LangSim #48

Challenges of Large Language Models

**Open Source** Unfortunately most llama based and other free models fail to work with the tools defined by `langchain`. It works for single functions but already the current complexity of `langsim` …

jan-janssen updated 1 week ago

ilayn/harold #101

Performance issues for larger state space models

Statespace models are too slow in creation and method calls when A matrix order is ~1900. have attached two sets of matrices to reproduce the issue. [ssm_a.zip](https://github.com/user-attachments/…

jamestjat updated 1 day ago

casper-hansen/AutoAWQ #492

quantize models with large context

I want to quantize the CodeQwen model using a custom dataset, but all sample lengths exceed 512. Why doesn't AWQ support sample with lengths longer than 512? Are there any alternative methods for quan…

chennnM updated 1 month ago

patrick-kidger/equinox #778

Initialization of large models on multi-hosts environment

Hi all, I am wondering what is the preferred way to create a model that is too large to fit in a single device As a reference starting point, if I use data parallelism, I will first create per-…

kazewong updated 19 hours ago

mantasu/glasses-detector #14

Large models are not supported yet

`--------------------------------------------------------------------------- NotImplementedError Traceback (most recent call last) Cell In[15], [line 2](vscode-notebook-cell:?e…

godisme1220 updated 1 month ago

statdivlab/rigr #152

predictors dropped from large models

Consider the following: ``` library(rigr) library(tidyverse) library(survival) data(mri) mri_complete % complete.cases, ] mri_tte 1. Consider formula(paste(x, collapse = " ")) instead. …

adw96 updated 1 month ago

alan-turing-institute/arc-selective-forgetting #27

Infrastructure for training large models

e.g. with reference to deep speed code in TOFU codebase What type of training? (full fine-tuning vs. PEFT etc.)

jack89roberts updated 1 month ago

1000+ results for large-large-models

1000+ results
for large-large-models