Tatoeba / tatoeba2

Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
https://tatoeba.org
GNU Affero General Public License v3.0
714 stars 132 forks source link

Create a way to link sentences in the same language with the same meaning. #3084

Open ckjpn opened 1 year ago

ckjpn commented 1 year ago

The Reasons

History

We used to create links to sentences in the same language that had the exact same meaning using the existing linking system.

With the introduction of the new layout, the heading "Translations" was added above directly-linked sentences.

TRANG then asked us not to create links between sentences in the same language since these wouldn't be considered translations.

Related Discussion

https://github.com/Tatoeba/tatoeba2/issues/2107 It's inaccurate to label same language sentences as "Translations"

To anyone working on this.

I, and perhaps a few others, have added many comments starting with "Same meaning:" following by 2 linefeeds. After creating a method to link such sentences, you could harvest all these links from comments and link the sentences together. You can download all the comments here: https://downloads.tatoeba.org/exports/comments.tar.bz2

As comments, these are not so useful for the average user, since a user needs to actually be on the page with the comment to see the link. Also, usually these comments are only on one of the sentences and not both.

Screen Shot 2023-10-07 at 9 56 41
DJ-Saidez commented 1 year ago

Would this functionality apply to the download files as well (a sentence will also display equivalent sentences), or only as displayed on the Tatoeba page? This would change to what extent you'd like the change to be.