Adds feature pyramid attention (FPA) module, resolves #167

mapbox / robosat

Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds

MIT License

2.02k stars 382 forks source link

By now we have https://arxiv.org/abs/1904.11492 which not only compares various attention mechanisms but also comes up with a framework for visual attention and proposal a new global context block in this visual attention framework.

I've implemented

Self-attention (as in SAGAN, BIGGAN, etc.)
Simple self-attention (see paper above)
Global Context block (see paper above)

for my 3d video models in https://github.com/moabitcoin/ig65m-pytorch/blob/706c9e737e42d98086b3af24548fb2bb6a7dc409/ig65m/attention.py#L9-L103

for the 2d segmentation case here we can adapt the 3d code and then e.g. use a couple of global context blocks on top of the last (high level) resnet feature blocks.

attention from https://arxiv.org/abs/1904.11492

mapbox / robosat

Adds feature pyramid attention (FPA) module, resolves #167 #168