neologd / mecab-ipadic-neologd

Neologism dictionary based on the language resources on the Web for mecab-ipadic
Other
2.7k stars 288 forks source link

Add Left double quotation mark to Regexp.ja #70

Open jumon opened 4 years ago

jumon commented 4 years ago

Motivation

This issue is about Regexp.ja in a wiki.

以下の全角記号は半角記号に置換 /!”#$%&’()*+,−./:;<>?@[¥]^_`{|}

It recommends replacing Right double quotation mark(U+201D) to Quotation mark(U+0022) and not replacing Left double quotation mark(U+201C) to Quotation mark. I prefer both Right and Left double quotation mark to be replaced to Quotation mark in sentences like below.

ダブルクォテーションは日本語では“強調”のために使われる。
→ ダブルクォテーションは日本語では"強調"のために使われる。

Sorry if there is a specific reason why Left double quotation mark is not included in the rule.

Goal

My suggestion might look like this.

以下の全角記号は半角記号に置換
!“”#$%&’()*+,−./:;<>?@[¥]^_`{|}

In addition to adding Left double quotation mark(U+201C), I omitted Slash(U+002F), which is a half-width character, at the head of the line. I guess this is a mistake.

neologd commented 4 years ago

Thank you for reporting. We will report a result after updating.