issues
search
ybracke
/
transnormer
A lexical normalizer for historical spelling variants using a transformer architecture.
GNU General Public License v3.0
6
stars
1
forks
source link
Refactor data loading
#18
Closed
ybracke
closed
1 year ago
ybracke
commented
1 year ago
[x] Add test data
[x] Update tests
[x] Move data loading functions from
models.train_model
to
data.loader
[x] Add an XML data loader (see
https://git.zdl.org/ybracke/DTAEvalCorpus
)
[x] Storing additional information like publication name, year, etc. as additional columns of a dataset
[x] Loading functionality for RIDGES (Bollmann split)
[x] Loading functionality for Leipzig corpora
ybracke
commented
1 year ago
Closed with PR #24
models.train_model
todata.loader