verenablaschke / norwegian-dict-gwt

Web dictionary Norwegian/Bokmål 🇳🇴 > German 🇩🇪. A project for the course "Computational Lexicography on the Web" at the University of Tübingen.
0 stars 0 forks source link

missing entries: inferred from parallel corpus #23

Closed verenablaschke closed 5 years ago

verenablaschke commented 5 years ago

P. Lison and J. Tiedemann, 2016, OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)

verenablaschke commented 5 years ago

Corpus Name: OpenSubtitles Package: OpenSubtitles in Moses format Website: http://opus.nlpl.eu/OpenSubtitles-v2018.php Release: v2018 Release date: Tue Apr 3 23:47:52 EEST 2018

verenablaschke commented 5 years ago

@INPROCEEDINGS{och00:isa, AUTHOR = {F.~J.~Och and H.~Ney}, TITLE ={Improved Statistical Alignment Models}, BOOKTITLE = ACL00 , PAGES ={440--447}, ADDRESS={ Hongkong, China}, MONTH = {October}, YEAR = 2000}

@INPROCEEDINGS{och00:aco, AUTHOR = {F.~J.~Och and H.~Ney}, TITLE = {A Comparison of Alignment Models for Statistical Machine Translation}, BOOKTITLE = COLING00, ADDRESS = {Saarbr\"ucken, Germany}, YEAR = {2000}, MONTH = {August}, PAGES = {1086--1090} }

verenablaschke commented 5 years ago

http://masatohagiwara.net/using-giza-to-obtain-word-alignment-between-bilingual-sentences.html https://okapiframework.org/wiki/index.php/GIZA%2B%2B_Installation_and_Running_Tutorial

verenablaschke commented 5 years ago

should be improved:

verenablaschke commented 5 years ago

Bsp: nålemerker

verenablaschke commented 5 years ago

https://github.com/ohenrik/nb_news_ud_sm