-
Hi @Liuhong99 ,
I am a big fan of sophia used it cited it everytime. Just thought of suggesting you a new and less resource intensive experiment.
a) Karpathy updated the nano_gpt2 training [cod…
-
Hi there,
First of all, thank you for sharing your work here! It's been incredibly insightful.
I have a question regarding the use of min-max scaling in the online adaptive learning stage of the…
-
I want to use the npy2ckpt.py to transfer my own resnet50 pre-train model:
the layer name in my pre-train resnet50 model are:
bn4c_branch2c
bn5b_branch2b
res3d_branch2b
res2b_branch2b
…
y-kl8 updated
6 years ago
-
Using mean and standard deviation normalization is a common procedure in standard Computer Vision, and can be applied e.g. by using `torchvision`'s [Normalize](https://pytorch.org/vision/main/generate…
-
### Describe the bug
When `n_features > 1` and `normalization_y` is `False`, the `GaussianProcessRegressor.predict` seems to return bad std and cov results, as it doesn't consider the scale of the di…
-
The new family and approach of BatchNorm-free NN architectures look very perspective due to the lack of BatchNorm training support.
In the paper ["High-Performance Large-Scale Image Recognition Wit…
-
Hello and thank you for your effort
I want to train and test a model using the embedded code
But in both cases, it gives me the following error:
![Screenshot 2024-08-19 141631](https://github.com…
-
### Feature Name
To enhance the Deep Learning Playground's audio data processing capabilities, we aim to integrate the M5 network architecture, inspired by the M5 network. This architecture is crucia…
-
I am using your custom_definition.py as a model for training and I am facing this error when training start.
```
WARNING:tensorflow:Gradients do not exist for variables ['sync_batch_normalizat…
-
### Deep Learning Simplified Repository (Proposing new issue)
:red_circle: **Project Title** : Web Application for Deaf Community
:red_circle: **Aim**: Assisting individuals with hearing impairment…