JMdictProject / JMdictIssues

JMdict Japanese dictionary - lexicographic, etc. issues management
16 stars 1 forks source link

Add interface to submit example sentences #110

Closed devorair790 closed 7 months ago

devorair790 commented 7 months ago

I don't think there's any interface to submit example sentences for JMdict

JMdictProject commented 7 months ago

The example sentences uses by various apps and systems in conjunction with the JMdict entries come from the Tatoeba project. See https://www.edrdg.org/wiki/index.php/Main_Page#The_Tanaka_Corpus

devorair790 commented 7 months ago

But there's still no interface to submit indexed sentences

JMdictProject commented 7 months ago

But there's still no interface to submit indexed sentences

There is, within the Tatoeba project. Any Japanese sentence can have a set of indices attached. Note that the project usually expects that sentence submissions and amendments are carried out by native speakers.

furanzoni commented 5 months ago

There is, within the Tatoeba project. Any Japanese sentence can have a set of indices attached. Note that the project usually expects that sentence submissions and amendments are carried out by native speakers.

I don't see this interface to add sentences with indices or modify existing sentences' indices within the Tatoeba website. Would be useful so we could help fixing #121.

JMdictProject commented 5 months ago

I don't see this interface to add sentences with indices or modify existing sentences' indices within the Tatoeba website.

https://tatoeba.org/en/sentence_annotations/show/98515 (example)

You need to be logged in.

stephenmk commented 5 months ago

If the JMdictDB interface could at least display the number of sentences and priority-tagged sentences that are indexed to entries and senses, I think that would do a great deal to remind people to update the sentence indices when big changes are made.

Just in the past month we've forgotten to update the sentences for 貿易上 and チュッ, for example.

JMdictProject commented 5 months ago

The sentences and their indices are quite separate from the dictionary database. What might be more achievable is to have a separate function available which could identify the relationship, if any, between surface forms and sentence indices.

Wwwjdic has code and data files for associating surface forms with the sentences. I could potentially use that to provide such things as: