synapse-alpha / mirror-neuron

Experiments on bittensor reward models to find exploits
BSD 2-Clause "Simplified" License
1 stars 0 forks source link

Trainable dendrite pool #59

Open steffencruz opened 1 year ago

steffencruz commented 1 year ago

In the neuron source code there is a backward call to the dendrite pool

        # Pass rewards backward for potential PPO.
        if train_network:
            self.dendrite_pool.backward( 
                forward_calls = forward_calls,
                rewards = rewards,
                timeout = timeout
            )
            bittensor.logging.trace( 'Applied backward to network.' )

What is this doing and how can we include this? @joeylegere

Possible solution

Make use of fitness values in DummyDendritePool #68 and adjust uid-specific fitness based on rewards in the backward call

steffencruz commented 1 year ago

@joeylegere Interesting question is: "Does backprop into the network have a beneficial effect on the network or just the validator?"

steffencruz commented 1 year ago

This requires some thought and we're not convinced that it's valuable to us.