architecture-models Search Results

1000+ results
for architecture-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #34238

GGUF support for BERT architecture

### Feature request I want to add the ability to use GGUF BERT models in transformers. Currently the library does not support this architecture. When I try to load it, I get an error TypeError: Ar…

Dimmension updated 21 hours ago
1
metatensor/metatrain #343

Downloading pre-trained models

A very nice feature would be able to download pre-trained models. To implement this the already existing `export` CLI command can be very nicely extended. The current API is ``` mtt export \ …

PicoCentauri updated 2 weeks ago
2
xinghaochen/SLAB #7

About RepBN

Thank you for the great work! Why do you add shortcut to BN in RepBN? Similar to the explanation in RepVGG, is it to construct a multi-branch architecture to make the model an implicit ensemble of nu…

leily578 updated 1 month ago
2
anarchy-ai/LLM-VM #130

Be able to run all models on all architectures.

The wrong architecture for a model shouldn't cause a hang or segfault or error, it should just be slow.

mmirman updated 7 months ago
1
k2-fsa/sherpa-onnx #1386

[WIP Documentation] German support

This issue is to track how to get German working and what options one need to consider. # dotnet-examples https://github.com/k2-fsa/sherpa-onnx/tree/master/dotnet-examples - [ ] keyword-spotting-…

GeorgeS2019 updated 2 weeks ago
7
a-r-r-o-w/cogvideox-factory #28

float8 matmul for inference + torchao fp8 training

Torch has support for float8 matmul kernels, and it seems like they are faster than bf16 on Ada and above architectures. TorchAO supports training in fp8. This has been explored in a few newer optimiz…

a-r-r-o-w updated 5 days ago
1
Aryan-Chharia/Computer-Vision-Projects #98

Implement Image Transformation using Cycle GAN

### Issue Title: **Implement Image Transformation using Cycle GAN** ### Issue Description: In this task, the goal is to transform images into different forms using **Cycle GAN**, a type of Ge…

Lonwwolf14 updated 6 days ago
1
unslothai/unsloth #1005

Issues with saving to hub -> Gemma based models

I have been using unsloth daily for quite sometime now , tried every architecture and there seems to be a problem with gemma based models when saving to hub. The tokenizer.model file never seems to g…

Ammar-Alnagar updated 1 month ago
2
automl/NASLib #134

Storing of models or architectures in optimizers

In the `get_candidates` function of the `zerocost` branch optimizers `Bananas` and `Npenas` there is a discrepancy between how candidates in the `next_batch` are stored. If the acquisition function…

jr2021 updated 2 years ago
1
NHirose/learning-language-navigation #2

some question

Recently, I had the pleasure of reading your paper LeLaN, and I must say it is an impressive piece of work. However, I have a few questions that I would appreciate your clarification on: 1. In Figure…

wlll123456 updated 2 days ago
3

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for architecture-models

1000+ results
for architecture-models