hidden-causal Search Results

1000+ results
for hidden-causal

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/optimum #2095

Add support for Musicgen Melody in the ONNX export

### Feature request Support Musicgen Melody's ONNX exportation with audio prompting. ### Motivation Currently, Optimum do not support export for Musicgen Melody models, The current implementation i…

rubeniskov updated 1 week ago
2
hunto/LocalMamba #35

Version Casual-conv1d and Mamba

Thank you for sharing the code! Could you please let me know which versions of **_Triton, Torch, Casual-conv1d, and Mamba-ssm_** you are using? I encountered some wired issues with mamba and casual…

EddieEduardo updated 1 month ago
2
unslothai/unsloth #440

Error in triton while running unsloth/mistral instruct v0.2 …

Used the following lines for env creation **** conda create --name unsloth_env python=3.10 conda activate unsloth_env conda install pytorch-cuda= pytorch cudatoolkit xformers -c pytorch -c nvi…

xlar-sanjeet updated 4 months ago
10
vllm-project/vllm #8453

[RFC]: Support encode only models by Workflow Defined Engine

### Motivation. As vllm supports more and more models and functions, they require different attention, scheduler, executor, and input output processor. . These modules are becoming increasingly com…

noooop updated 1 month ago
4
CUNY-CL/yoyodyne #182

Different sized encoder for TransformerDecoder

It would be convenient to allow the encoder [output_size](https://github.com/CUNY-CL/yoyodyne/blob/master/yoyodyne/models/modules/lstm.py#L99) to be different from the TransformerDecoder embedding siz…

Adamits updated 5 months ago
4
unslothai/unsloth #601

Error message when using ORPO fine-tuning

When using ORPO to fine-tune mistral-7b-instruct-v0.3-bnb-4bit, after clicking orpo_trainer.train() to start, the following error message appears: `-------------------------------------------------…

MRQJsfhf updated 5 months ago
1
unslothai/unsloth #1248

`AssertionError('initial value for logits` error [FIXED]

``` { "name": "CompilationError", "message": "at 53:4: loss_ptr += row_idx logsumexp_ptr += row_idx * N_CHUNKS + chunk_idx labels_ptr += row_idx col_offsets = chun…

daegonYu updated 1 week ago
10
vikhyat/moondream #120

Overflow with CPU Option

Hi Vik, Thanks for all the help! And it works perfectly with `cuda` option. Wondering if you have seen this before while using `cpu` The model is loaded by: ``` DEVICE = "cpu" DTYPE = torch.f…

hypernovas updated 4 months ago
4
pytorch/pytorch #133321

Flexattention: ValueError: Shape element 1 must be a power o…

### 🐛 Describe the bug ``` Python from functools import lru_cache from torch.nn.attention.flex_attention import flex_attention, create_block_mask import torch torch._dynamo.config.cache_s…

foreverpiano updated 3 months ago
10
PKU-YuanGroup/MoE-LLaVA #38

inference error in llavamistral

### Question Hello, I have trained a LlavaMistralForCausalLM model based on openchat (**not moe version**), but when I use predict.py I get the following error: ``` File ~/scripts/MoE-LLaVA/…

saeedkhaki92 updated 9 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for hidden-causal

1000+ results
for hidden-causal