Unit 2: Explaining the "residual learning"

johko / computer-vision-course

This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord

MIT License

464 stars 144 forks source link

Unit 2: Explaining the "residual learning" #342

Open 0xD4rky opened 1 month ago

0xD4rky commented 1 month ago

I would like to explain the residual learning, introduced in the official paper, in depth.

I want to explain how learning (h(x)-x) is easier for the model rather than learning h(x) (where h(x) is the function that maps the input and output of the stacked layer).

Hence, allow me to raise a PR for updating the docs and you review the changes!

johko commented 4 weeks ago

Sounds great, feel free to write something up and create a Pr :+1:

0xD4rky commented 4 weeks ago

will do for sure!