This repository contains code that takes a text corpus and creates a PMI masking vocabulary for it.
1
stars
0
forks
source link
try to disentangle the dataset loading from my code, so that anyone could provide it's own dataset. #24
Open
shaigue opened 1 year ago