BrambleXu / knowledge-graph-learning

A curated list of awesome knowledge graph tutorials, projects and communities.
MIT License
735 stars 120 forks source link

ACL-2018-Subcharacter Information in Japanese Embeddings: When Is It Worth It? #291

Open BrambleXu opened 4 years ago

BrambleXu commented 4 years ago

Summary:

subcharacter information对于中文是有效的,那么日文又如何呢?研究发现subcharacter对于中文的提升效果在日文上并不稳定(我想应该是有片假名和平假名的缘故吧)。但是在一些汉字比较多的场景下,character ngrams效果确实有提高。不过在实验中,发现即使是enhanced skip-gram 也比不上 single-character ngram fasttext。

Resource:

Paper information:

Notes:

image

fastText是subword level model,可以学习character n-grams。

image

Model Graph:

Result:

Thoughts:

Next Reading:

Crescentz commented 3 years ago

请问有开源么