mlp Search Results - Githubissues

1000+ results
for mlp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lpiccinelli-eth/UniDepth #57

Finetuning

Hi, I am hoping to finetune the UniDepth model to a specific video. I tried finetuning all layers of the decoder but it is still relatively slow. Do you have any recommendations for which layers to fi…

Davidyao99 updated 2 weeks ago
1
KindXiaoming/pykan #38

Is KAN 10X slower per step of training, or does it need 10X …

Hi Ziming, In Section 6 of your paper, you mentioned that KANs are practically 10X slower than MLPs. I am curious what you meant by it. Did you mean a KAN takes 10X as many steps to converge in com…

cmsflash updated 2 months ago
2
LLaVA-VL/LLaVA-NeXT #62

llavaconfig error

File "/home/ma-user/anaconda3/envs/llava/lib/python3.10/site-packages/transformers/configuration_utils.py", line 264, in __getattribute__ return super().__getattribute__(key) AttributeError: '…

homiec updated 1 week ago
5
arcee-ai/mergekit #361

Thera are still some problems with moe merge qwen with other…

Here is one piece of code In the file of mergekit/mergekit/moe/qwen.py `for model_ref in ( [config.base_model] + [e.source_model for e in config.experts] + [e…

aoyinke updated 36 minutes ago
2
TencentARC/ST-LLM #12

Some weights of the model checkpoint at stllm/output/instruc…

dear author: when loading pretrained ckpt, some weights are not used, is it normal?? ``` Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████…

dragen1860 updated 1 month ago
1
xmu-xiaoma666/External-Attention-pytorch #51

MLP Confusion

https://github.com/xmu-xiaoma666/External-Attention-pytorch/blob/2f80b03ef1cdd835d4a2d21eff6f8b3534e5d601/model/attention/CoAtNet.py#L21 Correct me, if I am wrong but isn't MLP usually a collection…

abhimanyuchadha96 updated 1 year ago
1
yanx27/Pointnet_Pointnet2_pytorch #264

PointNetSetAbstraction question

Hello, I made a small modification in the class PointNetSetAbstraction(nn.Module) by adding 1 to the variable in_channel. After making this change, I encountered an issue in the subsequent code: py…

Lemon2048 updated 3 weeks ago
1
flucoma/flucoma-core #268

MLP should do the actual JSON check

Todo was place-holder :) https://github.com/flucoma/flucoma-core/blob/ab9c6501e8de8f118d313260ae02ebc5ba5ee2d2/include/data/FluidJSON.hpp#L406-L412

tremblap updated 2 months ago
2
NVIDIA/TensorRT-LLM #1724

Conditionals seems to be evaluated eagerly

### System Info - TensorRT-LLM version: 0.10.0.dev2024050700 (I doubt any other information is relevant) ### Who can help? @kaiyux ### Information - [ ] The official example scripts - […

CrimsonRadiator updated 18 hours ago
5
SynodicMonth/ChebyKAN #9

Poor generalization

I tried using ChebyKAN to train signal waveforms, but it showed poor generalization. What may be the reason？？ ![image](https://github.com/SynodicMonth/ChebyKAN/assets/137387186/dc9a32ee-a567-44f4-aaf…

buptlittlecabbage updated 5 days ago
6

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for mlp

1000+ results
for mlp