mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xzhang2523/libmoon #4

Add module about model architecture

Dear Xiaoyuan Zhang, I am very interested in your project. This year, my research group published a paper titled "A Hyper-Transformer Model for Controllable Pareto Front Learning with Split Feasibi…

tuantran23012000 updated 2 months ago
3
DAMO-NLP-SG/VideoLLaMA2 #118

vision_tower load error?how to correctly load ckpt?

![5D190FA6EE718064BEC8DBD812DCF1B3](https://github.com/user-attachments/assets/f7fd1920-6046-46e7-9162-f6b30ee15a8e) I downloaded siglip-so400m-patch14-384 and write down the path. What else do I n…

Cece1031 updated 1 week ago
1
wang-kee/LiNeS #1

How to treat embedding layers for T5 Variant

I read in the paper that you also used this method on a T5 variant model. How did you treat the embedding layer and the output layer of the model?

owos updated 6 days ago
3
patrick-kidger/equinox #864

PyTorch Hooks in Equinox?

I would like to record some model activations in an architecture-invariant way. In PyTorch, we can use [forward hooks](https://pytorch.org/docs/stable/generated/torch.nn.modules.module.register_modul…

samuelstevens updated 1 month ago
2
NVIDIA/TensorRT-LLM #2255

[bug] --use_paged_context_fmha enable broken

My model is ```json { "mlp_bias": false, "attn_bias": false, "rotary_base": 300000, "rotary_scaling": null, "residual_mlp": false, "disable_weight_only_quant_plugin": false, …

akhoroshev updated 3 weeks ago
1
HKUDS/GraphGPT #84

运行graphgpt_eval.sh时报错config.json missing

报错内容： ``` Traceback (most recent call last): File "./graphgpt/eval/run_graphgpt.py", line 244, in run_eval(args, args.num_gpus) File "./graphgpt/eval/run_graphgpt.py", line 98, in run_ev…

CigarOVO updated 1 week ago
2
YihongDong/FAN #1

Why does the modified transformer model and transformer mode…

Why does the modified transformer model and transformer model have the same code in the timing prediction folder？

liu12138666 updated 3 weeks ago
1
zyxElsa/InST #51

a lot of missing keys while loading the FrozenCLIPEmbedder c…

**Description** Hi, I am trying to test the model with original settings but met this issue. It seems there could be something wrong with te embedding model, but I had no idea. This problem occurs ei…

musetee updated 4 days ago
1
pytorch-labs/gpt-fast #209

Error with stories15M and stories110M

When I use stories15M and stories110M I got an error. ``` File "D:\_LLM_project\Development\gpt-fast\generate.py", line 114, in speculative_decode torch.cat([cur_token.view(1), draft_tokens])…

deafTim updated 3 weeks ago
8
keras-team/keras-io #1907

Error in Vision Transformer examples

### Issue Type Documentation Bug ### Source source ### Keras Version 2.14 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 22.04 ### Python version 3.10 …

angelo-ml updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture