APMplusplus / falkon

Towards an ecosystem of tasks and concepts related to Language Technologies
Apache License 2.0
3 stars 3 forks source link

Depreciated. Falcon is now part of festvox(https://github.com/festvox/festvox)

falkon

Towards an ecosystem of tasks related to Language Technologies. Inspired by Google Research. This repo combines design principles from Kaldi(https://github.com/kaldi-asr/kaldi) and festvox(https://github.com/festvox/festvox) but has quirks of its own.

This repo is mostly for my (and peer group's) learning. If you want SoTAs in NLP, check out NLP Progress

The goal is to make it easier to build and compare against baselines across tasks. Since there are many tasks, it might not be feasible to put all dependencies. Two alternatives: (1) Use a virtual env for each task like AWS (2) Put a docker image

Feel free to contribute. Easiest ways to get started are:

(1) Picking up a task (from below) or one of your choice and adding it. (2) Picking up an issue and working on it.

Tasks:

Concepts:

Paradigms:

Layers -> Modules -> Models

For example,

Conv1d++ class is a layer that enables temporal convolutions during eval.
ResidualDilatedCausalConv1d is a module built on top of Conv1d++
Wavenet is a model built on top of ResidualDilatedCausalConv1d

LSTM++ class is a layer that enables learning initial hidden states based on condition.
VariationalEncoderDecoder is a module built on top of LSTM++
ImageCaptioning is a model built on top of VariationalEncoderDecoder

src.nn hosts all of these.

The directoy 'tasks' contains the individual tasks. Updated a sample speech task. I have other pressing things and so the timeline on this repo looks like end of Summer 2020.