[x] Datasets should probably share char2idx as we want unseen chars to be generated as well (Joseph)
[x] Sort through the data points in increaseing video length
[x] Fix any dataset problem (e.g. micro not loading) (Joseph)
[ ] Check [INAUDIBLE] token #13 (Joseph)
[ ] Write something that puts dataloader/model/training/eval/logging/etc. together. Somethine like the current test_better_model.py but more sophisticated? Perhaps with tensorboard support (Joseph)
[x] Finish eval function with loss and CER output (Zhaofeng)
[ ] (optional) Plug in a WER implementation into eval function (Yutong)
Dataset
s should probably sharechar2idx
as we want unseen chars to be generated as well (Joseph)test_better_model.py
but more sophisticated? Perhaps with tensorboard support (Joseph)