yjxiong / temporal-segment-networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
BSD 2-Clause "Simplified" License
1.53k stars 477 forks source link

How the framework store the BN statistics #209

Closed h-bo closed 6 years ago

h-bo commented 6 years ago

Issue from https://github.com/yjxiong/action-detection/issues/55

I rewrote tsn model by tensorflow. But when reusing the pretrianed model you support, I found the result worse than expected. I found your BN layer might be a little different from simplily x= (x-mean_x)/var_x*scale + bias. Is that so? If not, can you please have an introduction to your BN layer. Thanks

There are 4 params for BN layer, and I think the order is scale, offset, mean, variance.

yjxiong commented 6 years ago

As I said it is the exact same implementation of the BN paper. The only reason of difference I can think of is that tensorflow has different handling of BN statistics. But I am not able to help you with that.

If you want to use modern frameworks, why not use PyTorch. We have official PyTorch implementation available and tensorflow in not in our plate right now.