[101] Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

TL;DR

I read this because.. : multi-task learning with uncertainty!
task : semantic segmentation, instance segmentation, pixel-wise metric depth
problem : 이전의 멀티태스크 접근법은 loss들의 가중합인데 이 가중에 따라 성능이 매우 예민하게 움직인다.
idea : output y에 대해 가우시안으로 가정하고 MLE에 따라 추정하면 $\sigma$에 의해 각 task 자체의 noise와 상대적인 weight를 구할 수 있다. 즉 model weight $W$와 task dependent $\sigma_{task}$를 같이 최적화하자.
architecture : DeepLab V3(ResNet101 -> Atrous Spatial Pyramid Pooling) + 3개 태스크에 맞는 decoder
objective : CE(semantic segmentation), L1(instance segmentation, depth estimation)
baseline : task specific model, weighted multi-task model
data : CityScapes benchmark, depth image는 SGM이라는 모델로 pseudo-label 사용
evaluation : IoU, Instance Mean Error, Inverse Depth Mean Error
result : 3개의 태스크로 학습한게 segmentation, depth 예측에서 sota. instance segmentation은 2개로 학습한 곳에서 sota
contribution : 3 태스크로 학습한 모델이 처음이라고 하넹
limitation / things I cannot understand : 대충 결론적으로 보면 학습 가능한 weight 추가하고 이게 널뛰기 되지 않도록 Regularization term 추가한건데 mle 관점으로 해석되니까 보기에 아름답넹

multi-task loss weight에 따라 성능이 널뛰기 함

Epistemic uncertainty
- model에 의한 uncertainty, training data의 부족으로 인한 Uncertainty
Aleatroic uncertainty
- 데이터에 의한 uncertainty, data가 표현할 수 없는 정보에 대한 uncertainty.
- Data-dependent, Hetroscedatic
  - input data와 모델 아웃풋에 의해 결정되는 uncertainty
- Task-dependent, Homoscedastic
  - input data에 의존하지 않는 uncertainty

뭐라는지 안와닿네.. 어쨌든 이 논문에서는 마지막 task-dependent uncertainty에 대해 측정할거임