junxnone / aiwiki

AI Wiki
https://junxnone.github.io/aiwiki
18 stars 2 forks source link

paper DETR #165

Open junxnone opened 3 years ago

junxnone commented 3 years ago

DETR

Arch

image -- | image image image

Set Prediction loss

Positional Encoding

Detection Output/Prediction FFNs

class_embed = nn.Linear(hidden_dim, num_classes + 1)
bbox_embed = MLP(hidden_dim, hidden_dim, 4, 3)

Segmentation

image image
Original Image Segmentation Output

Prob > 0.9 -- | image

Evaluation

Num Evaluation
1 image
2 image

Reference

junxnone commented 3 years ago

junxnone/tech-io#913