Towards an ecosystem of tasks related to Language Technologies. Inspired by Google Research. This repo combines design principles from Kaldi(https://github.com/kaldi-asr/kaldi) and festvox(https://github.com/festvox/festvox) but has quirks of its own.
This repo is mostly for my (and peer group's) learning. If you want SoTAs in NLP, check out NLP Progress
The goal is to make it easier to build and compare against baselines across tasks. Since there are many tasks, it might not be feasible to put all dependencies. Two alternatives: (1) Use a virtual env for each task like AWS (2) Put a docker image
(1) Picking up a task (from below) or one of your choice and adding it. (2) Picking up an issue and working on it.
Tasks:
Concepts:
Paradigms:
Layers -> Modules -> Models
For example,
Conv1d++ class is a layer that enables temporal convolutions during eval.
ResidualDilatedCausalConv1d is a module built on top of Conv1d++
Wavenet is a model built on top of ResidualDilatedCausalConv1d
LSTM++ class is a layer that enables learning initial hidden states based on condition.
VariationalEncoderDecoder is a module built on top of LSTM++
ImageCaptioning is a model built on top of VariationalEncoderDecoder
src.nn hosts all of these.
The directoy 'tasks' contains the individual tasks. Updated a sample speech task. I have other pressing things and so the timeline on this repo looks like end of Summer 2020.