[CLOSED] Adding Sentence Order Prediction [WIP]

nyu-mll / jiant-v1-legacy

The jiant toolkit for general-purpose text understanding models

MIT License

21 stars 9 forks source link

[CLOSED] Adding Sentence Order Prediction [WIP] #1058

Closed jeswan closed 4 years ago

jeswan commented 4 years ago

Issue by pruksmhc Friday Apr 10, 2020 at 21:22 GMT Originally opened as https://github.com/nyu-mll/jiant/pull/1058

Adding Sentence Order Prediction Task for ALBERT What this version of MLM supports: ALBERT embedder. Below are the runs for MLM + SOP + intermediate task (which is how we intend to use SOP).

Task	Task performance (best ALBERT finetuned on task)	MLM perplexity	SOP performance
CCG	0.944	5.123	0.978
QQP	0.8594	4.303	0.8428

pruksmhc included the following code: https://github.com/nyu-mll/jiant/pull/1058/commits

jeswan commented 4 years ago

Comment by pep8speaks Friday Apr 10, 2020 at 21:22 GMT

Hello @pruksmhc! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file jiant/models.py:

Line 177:101: E501 line too long (103 > 100 characters) Line 708:17: W291 trailing whitespace Line 1227:80: W291 trailing whitespace Line 1228:41: W291 trailing whitespace

In the file jiant/modules/simple_modules.py:

Line 190:101: E501 line too long (103 > 100 characters)

You can repair most issues by installing black and running: black -l 100 ./*. If you contribute often, have a look at the 'Contributing' section of the README for instructions on doing this automatically.

Comment last updated at 2020-04-11 01:21:21 UTC

jeswan commented 4 years ago

Comment by sleepinyourhat Friday Apr 10, 2020 at 22:33 GMT

That was fast! Taking a look shortly...

jeswan commented 4 years ago

Comment by sleepinyourhat Friday Apr 10, 2020 at 22:39 GMT

General sanity check before we run anything big/expensive:

Make sure that we're getting better-than-chance SOP accuracy out of the box with ALBERT, before any training.

jeswan commented 4 years ago

Comment by pruksmhc Saturday Apr 11, 2020 at 01:25 GMT

I'm reworking this task entirely (basically not making it inherit from Pairclassificationtask) to make the code most cleaner/clearer, so the game plan is to incorporate comments from here. I'm going to start another PR, since most of these comments will be resolved with this change.