nyu-mll / jiant-v1-legacy

The jiant toolkit for general-purpose text understanding models
MIT License
20 stars 9 forks source link

jiant is an nlp toolkit

CircleCI Code style: black

This is the legacy version of jiant. If starting a new project, we recommend you use the new version of jiant here.

jiant is a software toolkit for natural language processing research, designed to facilitate work on multitask learning and transfer learning for sentence understanding tasks.

A few things you might want to know about jiant:

For a full system overview of jiant version 1.3.0, see https://arxiv.org/abs/2003.02249.

Getting Started

To find the setup instructions for using jiant and to run a simple example demo experiment using data from GLUE, follow this getting started tutorial!

Contributing

Guidelines for contributing to jiant are available here.

Official Documentation

Our official documentation is here: https://jiant.info/documentation#/

Running

To run an experiment, make a config file similar to jiant/config/demo.conf with your model configuration. In addition, you can use the --overrides flag to override specific variables. For example:

python main.py --config_file jiant/config/demo.conf \
    --overrides "exp_name = my_exp, run_name = foobar, d_hid = 256"

will run the demo config, but output to $JIANT_PROJECT_PREFIX/my_exp/foobar. To run the demo config, you will have to set environment variables. The best way to achieve that is to follow the instructions in user_config_template.sh

Suggested Citation

If you use jiant in academic work, please cite it directly:

@misc{wang2019jiant,
    author = {Alex Wang and Ian F. Tenney and Yada Pruksachatkun and Phil Yeres and Jason Phang and Haokun Liu and Phu Mon Htut and and Katherin Yu and Jan Hula and Patrick Xia and Raghu Pappagari and Shuning Jin and R. Thomas McCoy and Roma Patel and Yinghui Huang and Edouard Grave and Najoung Kim and Thibault F\'evry and Berlin Chen and Nikita Nangia and Anhad Mohananey and Katharina Kann and Shikha Bordia and Nicolas Patry and David Benton and Ellie Pavlick and Samuel R. Bowman},
    title = {\texttt{jiant} 1.3: A software toolkit for research on general-purpose text understanding models},
    howpublished = {\url{http://jiant.info/}},
    year = {2019}
}

Papers

jiant has been used in these four papers so far:

To exactly reproduce experiments from the ELMo's Friends paper use the jsalt-experiments branch. That will contain a snapshot of the code as of early August, potentially with updated documentation.

For the edge probing paper and the BERT layer paper, see the probing/ directory.

For the function word probing paper, use this branch and refer to the instructions in the scripts/fwords/ directory.

For the BERT NPI paper follow the instructions in scripts/bert_npi on the blimp-and-npi branch.

Getting Help

Post an issue here on GitHub if you have any problems, and create a pull request if you make any improvements (substantial or cosmetic) to the code that you're willing to share.

Releases

Releases are identified using git tags and distributed via PyPI for pip installation. After passing CI tests and creating a new git tag for a release, it can be uploaded to PyPI by running:

# create distribution
python setup.py sdist bdist_wheel

# upload to PyPI
python -m twine upload dist/*

More details can be found in setup.py.

License

This package is released under the MIT License. The material in the allennlp_mods directory is based on AllenNLP, which was originally released under the Apache 2.0 license.

Acknowledgments