Matrix-Vector Library Designed for Neural Network Construction. cuda (gpu) support, openmp (multithreaded cpu) support, partial support of BLAS, expression template based implementation PTX code generation identical to hand written kernels, and support for auto-differentiation
Binary_Reduction Expressions currently will have the dimensionality/shape of the
the right-value expression, opposed to the left-value
IE
vec += mat // will have dims=2 and shape of the matrix.
This will be evaluated internally and will still return a vector,
This currently does not effect any user code, this is just to make a note of this oddity.
--> Fix would involve creating a specialization of Binary_Expression for Broadcasted-Reductions
Binary_Reduction Expressions currently will have the dimensionality/shape of the the right-value expression, opposed to the left-value
IE
vec += mat // will have dims=2 and shape of the matrix.
This will be evaluated internally and will still return a vector, This currently does not effect any user code, this is just to make a note of this oddity.
--> Fix would involve creating a specialization of Binary_Expression for Broadcasted-Reductions