shaigue / pmi_masking

This repository contains code that takes a text corpus and creates a PMI masking vocabulary for it.
MIT License
1 stars 0 forks source link

prunning & sampling strategies to handle large amounts of data #12

Open shaigue opened 1 year ago

shaigue commented 1 year ago

Different techniques for reducing complexity where considered:

Organize and present the different tradeoffs between those, and what was our decision.