-
Hi, I am hoping to finetune the UniDepth model to a specific video. I tried finetuning all layers of the decoder but it is still relatively slow. Do you have any recommendations for which layers to fi…
-
Hi Ziming,
In Section 6 of your paper, you mentioned that KANs are practically 10X slower than MLPs. I am curious what you meant by it. Did you mean a KAN takes 10X as many steps to converge in com…
-
File "/home/ma-user/anaconda3/envs/llava/lib/python3.10/site-packages/transformers/configuration_utils.py", line 264, in __getattribute__
return super().__getattribute__(key)
AttributeError: '…
-
Here is one piece of code In the file of mergekit/mergekit/moe/qwen.py
`for model_ref in (
[config.base_model]
+ [e.source_model for e in config.experts]
+ [e…
-
dear author:
when loading pretrained ckpt, some weights are not used, is it normal??
```
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████…
-
https://github.com/xmu-xiaoma666/External-Attention-pytorch/blob/2f80b03ef1cdd835d4a2d21eff6f8b3534e5d601/model/attention/CoAtNet.py#L21
Correct me, if I am wrong but isn't MLP usually a collection…
-
Hello, I made a small modification in the class PointNetSetAbstraction(nn.Module) by adding 1 to the variable in_channel. After making this change, I encountered an issue in the subsequent code:
py…
-
Todo was place-holder :)
https://github.com/flucoma/flucoma-core/blob/ab9c6501e8de8f118d313260ae02ebc5ba5ee2d2/include/data/FluidJSON.hpp#L406-L412
-
### System Info
- TensorRT-LLM version: 0.10.0.dev2024050700
(I doubt any other information is relevant)
### Who can help?
@kaiyux
### Information
- [ ] The official example scripts
- […
-
I tried using ChebyKAN to train signal waveforms, but it showed poor generalization. What may be the reason??
![image](https://github.com/SynodicMonth/ChebyKAN/assets/137387186/dc9a32ee-a567-44f4-aaf…