fastmachinelearning / hls4ml

Machine learning on FPGAs using HLS

https://fastmachinelearning.org/hls4ml

Apache License 2.0

1.18k stars 388 forks source link

Rebased version the PR to add support for ConvTranspose layers #844

Open jmitrevs opened 11 months ago

jmitrevs commented 11 months ago

Description

This is a rebased version of #644. Please look there for details, but in summary it adds support for both io_stream and io_parallel compilation of Conv1DTranspose and Conv2DTranspose.

Type of change

[x] New feature (non-breaking change which adds functionality)

Tests

Still a to-do.

Checklist

[x] I have read the guidelines for contributing.
[ ] I have commented my code, particularly in hard-to-understand areas.
[ ] I have made corresponding changes to the documentation.
[ ] My changes generate no new warnings.
[x] I have installed and run pre-commit on the files I edited or added.
[ ] I have added tests that prove my fix is effective or that my feature works.

vloncar commented 11 months ago

Why do we need multidimensional weights here? Passing pointers doesn't work as expected? I like the idea, we'll need it in the future for Vitis, but the keep_dims approach feels very clunky.

jmitrevs commented 11 months ago

I pushed a first version of a test and tried to make initial fixes, though there are many remaining errors.

jmitrevs commented 11 months ago

Looking at the new test, the results see to be:

y_keras.shape=(10, 7, 7, 4)
y_hls4ml.shape=(10, 100)

and model_default_t w2[1][1][108]; seem to be the weight dimensions. Will need to investigate further.

jmitrevs commented 11 months ago

The output dimensions inconsistency is for padding = "valid". For "same" the output is flattened but there are the correct number of values. However, they still don't match.