david-thrower / cerebros-core-algorithm-alpha

The Cerebros package is an ultra-precise Neural Architecture Search (NAS) / AutoML that is intended to much more closely mimic biological neurons than conventional neural network architecture strategies.
Other
27 stars 4 forks source link

try-using-ternary-dense #151

Open david-thrower opened 3 weeks ago

david-thrower commented 3 weeks ago

Kind of issue: enhancement

Issue described: Try using a Ternary operation layer instead of a Dense layer, e.g. Replace each occurrence of tf.keras.layers.Dense with a custom layer like this:

import tensorflow as tf

class TernaryDenseLayer(tf.keras.layers.Layer):
    def __init__(self, units, input_dim, **kwargs):
        super(TernaryDenseLayer, self).__init__(**kwargs)
        self.units = units
        self.input_dim = input_dim
        self.ternary_weights = self.add_weight(name='ternary_weights', 
                                                shape=(input_dim, units),
                                                initializer='glorot_uniform',
                                                trainable=True)

    def build(self, input_shape):
        # Create a trainable weight variable for the bias
        self.bias = self.add_weight(name='bias', 
                                    shape=(self.units,),
                                    initializer='zeros',
                                    trainable=True)

    def call(self, inputs):
        # Apply ternary weights to the input vector
        ternary_inputs = tf.cast(tf.sign(inputs), tf.float32) * tf.abs(inputs)
        output = tf.matmul(ternary_inputs, self.ternary_weights)
        # Add bias and apply activation function
        output = tf.nn.bias_add(output, self.bias)
        output = tf.nn.relu(output)
        return output

Ultimately, it may be worth integrating this with what was done here: https://arxiv.org/pdf/2406.02528