-
### Zig Version
0.13.0-dev.266+0b0625ccf
### Steps to Reproduce and Observed Behavior
I am learning Zig by experimenting with writing a simple neural network code and experience a compiler crash. I…
xhook updated
4 weeks ago
-
Hello!
I was playing around with the parallelization of MLP-MDs in mlp-train.
I have used the mlptrain/sampling/tests/test_umbrella.py as a template for my tests. I have notinced that each window in…
-
Thanks for the great work on KAN and for sparking such interest! I haven't seen anyone report this issue but may have missed it:
The MLP training method from the Knot Theory Invariant Colab not…
-
I tried using ChebyKAN to train signal waveforms, but it showed poor generalization. What may be the reason??
![image](https://github.com/SynodicMonth/ChebyKAN/assets/137387186/dc9a32ee-a567-44f4-aaf…
-
During model inference, model weight is frozen and won't change between iterations. CPU prefers special weight layout to accelerate the execution, then we need to prepack the model weight before model…
-
I am trying to predict certain function coefficients (output: a, b) based on its curve (input: frequency_response) with the help of [Kolmogorov-Arnold Network](https://arxiv.org/abs/2404.19756) and yo…
-
Dear team,
I am trying to replicate the results by using the same training script for dataset=perov_5 following instructions here.
When loading the model in a Jupyter notebook, I get the following…
-
How can I modify it so that it can operate on the last dimension like a real MLP?
-
In the file megatron/core/models/gpt/gpt_layer_specs.py line 95, on the line "linear_fc1=TELayerNormColumnParallelLinear if use_te else ColumnParallelLinear" why is it TELayerNormColumnParallelLinear …
-
The the example for [time series classification with transformer](https://keras.io/examples/timeseries/timeseries_classification_transformer/), the the function `build_model()` is defined as:
```
…