Closed faseelmo closed 9 months ago
Think if task having task id as a feature is useful. Find a better way to show mapping on the feature than using Manhattan distance. Learn how XY routing works.
Train on one single task, but a large dataset of different mappings.
use those weights then to train on multiple tasks. Maybe freeze some weights?