gradient-activation Search Results

1000+ results
for gradient-activation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EleutherAI/gpt-neox #1248

batch_input and elapsed time per iteration suddenly slow dow…

# Batch_input and elapsed time per iteration slow down during model training ![微信图片编辑_20240629150957](https://github.com/EleutherAI/gpt-neox/assets/140717408/dae875c7-c01f-47e0-8767-aa8fe53cd476) …

Yuhanleeee updated 1 month ago
4
OpenLMLab/LOMO #47

为什么LOMO并没有火起来呢？

个人感觉全参数FT还是会比LoRA这种Adapter的效果要好的，那为什么LOMO没有火起来呢？个人已经试过2张24GB的显卡用LOMO FT一个7B的BLOOM，感觉整体流程还蛮丝滑的，为什么在各个平台搜不到太多用LOMO的人呢，好奇怪。

Flywolfs updated 9 months ago
5
psi4/psi4 #3077

Erying Equation and Enthalpy of Activation

Hi, Forgive me if this is elementary. I want to compute reaction rates between an open and closed ring in a molecule. If we take a z-matrix of the input and output And pass it throug…

Sulstice updated 11 months ago
2
carpedm20/simulated-unsupervised-tensorflow #30

activation_fn in refiner and discriminator is default None.

In layers.py ```python def conv2d(inputs, num_outputs, kernel_size, stride, layer_dict={}, activation_fn=None, #weights_initializer=tf.random_normal_initializer(0, 0.001), …

shimacos37 updated 5 years ago
1
raghakot/keras-vis #109

Visualising MNIST Filters

Hi I'm trying this visualisation library and ran on a simple MNIST network. I'm comparing activation maximisation used here from the the one described here: https://blog.keras.io/how-convolutional-…

CMCDragonkai updated 5 years ago
1
jax-ml/jax #23085

Performance Discrepancy in `jax.value_and_grad` for Differen…

### Description I'm experiencing a significant performance difference when using `jax.value_and_grad` with different `argnums` values. Specifically, when setting `argnums=0`, the computation is abo…

demon2036 updated 1 month ago
6
mllam/neural-lam #19

Feature Request: Add Functionality to Apply Constraints to P…

I am proposing the addition of a new method to our model class, designed to apply constraints to predictions to ensure that the values fall within specified bounds. This functionality would be useful …

sadamov updated 5 months ago
1
yosinski/deep-visualization-toolbox #41

Question: what does the maximally activate some neuron mean …

How to define the maximally activate some neuron exactly ?

thincal updated 7 years ago
2
t-kalinowski/deep-learning-with-R-2nd-edition-code #6

"object 'optimizer' not found" error when fit() custom model

Hi! It looks like `compile()` ignores an _optimizer_ argument when compiling/training a custom model. When i try this code: `model %>% compile(optimizer = optimizer_rmsprop())` (766th row in the bo…

ggeeoorrgg updated 1 year ago
4
distillpub/post--growing-ca #10

Separating evolution from representation

Hello, First thank you for this amazing post. I tried to modify the model, to separate the evolution form the representation, meaning that I have a function that evolve the state and at the end a fu…

organic-chemistry updated 4 years ago
1

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for gradient-activation

1000+ results
for gradient-activation