globalwordnet / english-wordnet

The Open English WordNet
https://en-word.net/
Other
469 stars 57 forks source link

Highly similar adjectives and their satellites #35

Open jmccrae opened 5 years ago

jmccrae commented 5 years ago

There are some adjectives where a satellite adjective is marked as being similar to a standard adjective however the definitions are nearly indistinguishable. These should probably be merged into a single adjective synset

The synset:

<Synset id="ewn-00799892-s" ili="i4384" partOfSpeech="s" dc:subject="adj.all">
  <Definition>characteristic of acting or a stage performance; often affected</Definition>
  <SynsetRelation relType="similar" target="ewn-00799602-a"/>
  <Example>&quot;histrionic gestures&quot;</Example>
  <Example>&quot;an attitude of melodramatic despair&quot;</Example>
  <Example>&quot;a theatrical pose&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-00799602-a" ili="i4383" partOfSpeech="a" dc:subject="adj.all">
  <Definition>suited to or characteristic of the stage or theater</Definition>
  <SynsetRelation relType="similar" target="ewn-00799892-s"/>
  <SynsetRelation relType="similar" target="ewn-00800146-s"/>
  <SynsetRelation relType="similar" target="ewn-00800270-s"/>
  <Example>&quot;a theatrical pose&quot;</Example>
  <Example>&quot;one of the most theatrical figures in public life&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-01905041-s" ili="i10411" partOfSpeech="s" dc:subject="adj.all">
  <Definition>careful in regard to your own interests</Definition>
  <SynsetRelation relType="similar" target="ewn-01904196-a"/>
  <Example>&quot;the prudent use and development of resources&quot;</Example>
  <Example>&quot;wild squirrels are provident&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-01904196-a" ili="i10408" partOfSpeech="a" dc:subject="adj.all">
  <Definition>careful and sensible; marked by sound judgment</Definition>
  <SynsetRelation relType="also" target="ewn-00309819-a"/>
  <SynsetRelation relType="also" target="ewn-01900825-a"/>
  <SynsetRelation relType="also" target="ewn-02003477-a"/>
  <SynsetRelation relType="also" target="ewn-02579686-a"/>
  <SynsetRelation relType="similar" target="ewn-01904557-s"/>
  <SynsetRelation relType="similar" target="ewn-01904789-s"/>
  <SynsetRelation relType="similar" target="ewn-01905041-s"/>
  <SynsetRelation relType="similar" target="ewn-01905234-s"/>
  <Example>&quot;a prudent manager&quot;</Example>
  <Example>&quot;prudent rulers&quot;</Example>
  <Example>&quot;prudent hesitation&quot;</Example>
  <Example>&quot;more prudent to hide than to fight&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-01089902-s" ili="i5965" partOfSpeech="s" dc:subject="adj.all">
  <Definition>crammed with food</Definition>
  <SynsetRelation relType="similar" target="ewn-01086845-a"/>
  <SynsetRelation relType="exemplifies" target="ewn-07089193-n"/>
  <Example>&quot;a full stomach&quot;</Example>
  <Example>&quot;I feel stuffed&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-01086845-a" ili="i5947" partOfSpeech="a" dc:subject="adj.all">
  <Definition>containing as much or as many as is possible or normal</Definition>
  <SynsetRelation relType="attribute" target="ewn-14475377-n"/>
  <SynsetRelation relType="similar" target="ewn-01087442-s"/>
  <SynsetRelation relType="similar" target="ewn-01087711-s"/>
  <SynsetRelation relType="similar" target="ewn-01087779-s"/>
  <SynsetRelation relType="similar" target="ewn-01087985-s"/>
  <SynsetRelation relType="similar" target="ewn-01088174-s"/>
  <SynsetRelation relType="similar" target="ewn-01088263-s"/>
  <SynsetRelation relType="similar" target="ewn-01088332-s"/>
  <SynsetRelation relType="similar" target="ewn-01088588-s"/>
  <SynsetRelation relType="similar" target="ewn-01088785-s"/>
  <SynsetRelation relType="similar" target="ewn-01088855-s"/>
  <SynsetRelation relType="similar" target="ewn-01088956-s"/>
  <SynsetRelation relType="similar" target="ewn-01089130-s"/>
  <SynsetRelation relType="similar" target="ewn-01089350-s"/>
  <SynsetRelation relType="similar" target="ewn-01089537-s"/>
  <SynsetRelation relType="similar" target="ewn-01089626-s"/>
  <SynsetRelation relType="similar" target="ewn-01089731-s"/>
  <SynsetRelation relType="similar" target="ewn-01089807-s"/>
  <SynsetRelation relType="similar" target="ewn-01089902-s"/>
  <SynsetRelation relType="similar" target="ewn-01090028-s"/>
  <SynsetRelation relType="similar" target="ewn-01090142-s"/>
  <Example>&quot;a full glass&quot;</Example>
  <Example>&quot;a sky full of stars&quot;</Example>
  <Example>&quot;a full life&quot;</Example>
  <Example>&quot;the auditorium was full to overflowing&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-01078146-s" ili="i5895" partOfSpeech="s" dc:subject="adj.all">
  <Definition>containing salt</Definition>
  <SynsetRelation relType="similar" target="ewn-01077510-a"/>
  <Example>&quot;a saline substance &quot;</Example>
  <Example>&quot;salty tears&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-01077510-a" ili="i5892" partOfSpeech="a" dc:subject="adj.all">
  <Definition>containing or filled with salt</Definition>
  <SynsetRelation relType="also" target="ewn-02403682-a"/>
  <SynsetRelation relType="similar" target="ewn-01077750-s"/>
  <SynsetRelation relType="similar" target="ewn-01078023-s"/>
  <SynsetRelation relType="similar" target="ewn-01078146-s"/>
  <SynsetRelation relType="similar" target="ewn-01078270-s"/>
  <Example>&quot;salt water&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-00302292-s" ili="i1671" partOfSpeech="s" dc:subject="adj.all">
  <Definition>may be computed or estimated</Definition>
  <SynsetRelation relType="similar" target="ewn-00302053-a"/>
  <Example>&quot;a calculable risk&quot;</Example>
  <Example>&quot;computable odds&quot;</Example>
  <Example>&quot;estimable assets&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-00302053-a" ili="i1670" partOfSpeech="a" dc:subject="adj.all">
  <Definition>able to be calculated or estimated</Definition>
  <SynsetRelation relType="also" target="ewn-00742137-a"/>
  <SynsetRelation relType="similar" target="ewn-00302292-s"/>
  <SynsetRelation relType="similar" target="ewn-00302449-s"/>
  <Example>&quot;a calculable risk&quot;</Example>
  <Example>&quot;calculable odds&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-00566245-s" ili="i3147" partOfSpeech="s" dc:subject="adj.all">
  <Definition>connecting or tending to connect</Definition>
  <SynsetRelation relType="similar" target="ewn-00565913-a"/>
  <Example>&quot;connective remarks between chapters&quot;</Example>
  <Example>&quot;connective tissue in animals&quot;</Example>
  <Example>&quot;conjunctive tissue in plants&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-00565913-a" ili="i3145" partOfSpeech="a" dc:subject="adj.all">
  <Definition>serving or tending to connect</Definition>
  <SynsetRelation relType="similar" target="ewn-00566054-s"/>
  <SynsetRelation relType="similar" target="ewn-00566245-s"/>
</Synset>

The synset:

<Synset id="ewn-01965069-s" ili="i10722" partOfSpeech="s" dc:subject="adj.all">
  <Definition>holding appropriate documentation and officially on record as qualified to perform a specified function or practice a specified skill</Definition>
  <SynsetRelation relType="similar" target="ewn-01964837-a"/>
  <Example>&quot;a registered pharmacist&quot;</Example>
  <Example>&quot;a registered hospital&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-01964837-a" ili="i10721" partOfSpeech="a" dc:subject="adj.all">
  <Definition>listed or recorded officially</Definition>
  <SynsetRelation relType="similar" target="ewn-01965069-s"/>
  <SynsetRelation relType="similar" target="ewn-01965322-s"/>
  <Example>&quot;record is made of `registered mail&apos; at each point on its route to assure safe delivery&quot;</Example>
  <Example>&quot;registered bonds&quot;</Example>
</Synset>

The synset:

<Synset id="ewn-00355287-s" ili="i1965" partOfSpeech="s" dc:subject="adj.all">
  <Definition>altered or revised by rephrasing or by adding or deleting material</Definition>
  <SynsetRelation relType="similar" target="ewn-00355201-a"/>
  <Example>&quot;the amended bill passed easily&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-00355201-a" ili="i1964" partOfSpeech="a" dc:subject="adj.all">
  <Definition>of legislation</Definition>
  <SynsetRelation relType="similar" target="ewn-00355287-s"/>
</Synset>

The synset:

<Synset id="ewn-00835235-s" ili="i4571" partOfSpeech="s" dc:subject="adj.all">
  <Definition>lacking in schooling</Definition>
  <SynsetRelation relType="similar" target="ewn-00834256-a"/>
  <Example>&quot;untaught people whose verbal skills are grossly deficient&quot;</Example>
  <Example>&quot;an untutored genius&quot;</Example>
  <Example>&quot;uneducated children&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-00834256-a" ili="i4567" partOfSpeech="a" dc:subject="adj.all">
  <Definition>not having a good education</Definition>
  <SynsetRelation relType="also" target="ewn-01424552-a"/>
  <SynsetRelation relType="also" target="ewn-00835739-a"/>
  <SynsetRelation relType="also" target="ewn-00413644-a"/>
  <SynsetRelation relType="also" target="ewn-00887528-a"/>
  <SynsetRelation relType="also" target="ewn-01311380-a"/>
  <SynsetRelation relType="similar" target="ewn-00834520-s"/>
  <SynsetRelation relType="similar" target="ewn-00834764-s"/>
  <SynsetRelation relType="similar" target="ewn-00835142-s"/>
  <SynsetRelation relType="similar" target="ewn-00835235-s"/>
  <SynsetRelation relType="similar" target="ewn-00835442-s"/>
</Synset>

The synset:

<Synset id="ewn-01950363-s" ili="i10640" partOfSpeech="s" dc:subject="adj.all">
  <Definition>exhibiting native good judgment</Definition>
  <SynsetRelation relType="similar" target="ewn-01950023-a"/>
  <Example>&quot;arrive home at a reasonable hour&quot;</Example>
  <Example>&quot;commonsense scholarship on the foibles of a genius&quot;</Example>
  <Example>&quot;unlearned and commonsensical countryfolk were capable of solving problems that beset the more sophisticated&quot;</Example>
</Synset>

Should be merged with:

<Synset id="ewn-01950023-a" ili="i10639" partOfSpeech="a" dc:subject="adj.all">
  <Definition>showing reason or sound judgment</Definition>
  <SynsetRelation relType="also" target="ewn-00959321-a"/>
  <SynsetRelation relType="also" target="ewn-01432984-a"/>
  <SynsetRelation relType="also" target="ewn-01931771-a"/>
  <SynsetRelation relType="also" target="ewn-02508521-a"/>
  <SynsetRelation relType="attribute" target="ewn-05167957-n"/>
  <SynsetRelation relType="similar" target="ewn-01950363-s"/>
  <SynsetRelation relType="similar" target="ewn-01950705-s"/>
  <SynsetRelation relType="similar" target="ewn-01951109-s"/>
  <Example>&quot;a sensible choice&quot;</Example>
  <Example>&quot;a sensible person&quot;</Example>
</Synset>
jmccrae commented 4 years ago

ewn-00488278-s: (s) average%5:00:00:common:01, ordinary%5:00:02:common:01 lacking special distinction, rank, or status; commonly encountered "average people" "the ordinary (or common) man in the street"

ewn-01677162-a: (a) ordinary%3:00:00:: not exceptional in any way especially in quality or ability or size or degree "ordinary everyday objects" "ordinary decency" "an ordinary day" "an ordinary wine"

jmccrae commented 4 years ago

Another example is:

ewn-00490844-s (Interlingual Index: i2751) (s) unusual

not commonly encountered "two-career families are no longer unusual"

ewn-00492026-a (Interlingual Index: i2757) (a) unusual

not usual or common or ordinary

jmccrae commented 4 years ago

Another example: ewn-01016876-a (Interlingual Index: i5545) (a) last

coming after all others in time or space or degree or being the only one remaining "the last time I saw Paris" "the last day of the month" "had the last word" "waited until the last minute" "he raised his voice in a last supreme call" "the last game of the season" "down to his last nickel"

ewn-01013868-s (Interlingual Index: i5532) (s) final, terminal, concluding, last

occurring at or forming an end or termination "his concluding words came as a surprise" "the final chapter" "the last days of the dinosaurs" "terminal leave"

fcbond commented 4 years ago

I agree that these should be merged.

arademaker commented 4 years ago

Good to have @fcbond opinion. It is hard to check these cases without the words in the XML fragments.

arademaker commented 4 years ago

In the case of last https://github.com/globalwordnet/english-wordnet/issues/35#issuecomment-569257337 like #238, one is relational and the other is descriptive.

jmccrae commented 4 years ago

I think we need to review this issue carefully, there seem to be very many cases of highly similar adjectives.

arademaker commented 4 years ago

What was your approach to find these similarities? I think we can try to judge the method instead of all cases trying to be less ad-hoc.

ekaf commented 3 years ago

"histrionic" carries a nuance of excess and exageration, which would be lost if it is merged with "theatrical".

jmccrae commented 3 years ago

We are not going to fix this for the 2021 release.... the more I look at it the more it seems to be a huge problem, let's see about next year.