JMdictProject / JMdictIssues

JMdict Japanese dictionary - lexicographic, etc. issues management
16 stars 1 forks source link

Adding derivative information to adjectival entries #40

Open Marcusjmdict opened 2 years ago

Marcusjmdict commented 2 years ago

Re-posting something I added to the mailing list in 2018 (March 24), that failed to gain traction. Giving it a second go.

Daijisen and Daijirin both list the derivatives -げ, -さ, -み and -がる at the bottom of adjectival entries, as (派生), e.g. for 痛い: daijs:[派生] -が・る ( 動ラ五[四] ) -げ ( 形動 ) -さ ( 名 ) -み ( 名 ) daijr: [派生]いたがる[動ラ五]いたげ[形動]いたさ[名]

Would it not be useful to add something like that to entries in JMdict?

Preferably the derivatives shouldn't be bound to any particular sense but be "entry-wide". If it's technically possibly, they could perhaps also take an optional xref value, as we actually have entries for a handful of these derivatives.

on JMDict's 痛い entry, then, these tags could perhaps look like this: [der-mi=1587300] [der-sa] [der-ge] [der-garu=2035530]

(and should, for formatting purposes, preferably appear at the very bottom of the entry)

(there are no entries for 痛さ or 痛げ in jmdict)

I'm thinking a dictionary app could then choose to display it something like this: https://i.imgur.com/GwEAmcg.png (mock-up)

EDIT: we could also maybe add -的 [der-teki] for the various nouns etc. that take it reasonably often, though on the other hand I guess unlike the adjectival derivatives, it can be applied to basically any noun without it being grammatically incorrect, so it'd probably be less useful than the others)

JMdictProject commented 2 years ago

I think that sort of thing would be very useful, and I flagged it in the "New Generation" page at http://www.edrdg.org/wiki/index.php/JMdict:_Next_Generation#Entry-wide_Inflection_Pattern_Elements As I wrote there "The format of the element has yet to be decided." I think we need to get the container first.