Tatoeba / tatoeba2

Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
https://tatoeba.org
GNU Affero General Public License v3.0
679 stars 131 forks source link

Automatically add tags when translating quotes #3059

Open Guybrush88 opened 1 year ago

Guybrush88 commented 1 year ago

Story or Context I was translating some sentences from the advanced search, and I saw that one of them was a quote (with the proper tag) only when I opened the individual page of it, so I opened my translations to add proper tags to show they were derived from a quote, This is the sentence I translated: https://tatoeba.org/it/sentences/show/5860182

Idea If a sentence contains the tag showing it's a quote (this one, in the context I described: https://tatoeba.org/it/tags/show_sentences_with_tag/1185), this tag should be automatically added to its translations when added, so they can automatically comply to the attribution license.

jiru commented 11 months ago

Or maybe we should have tags that apply to the whole sentence group, as opposed to a single sentence?

Edit (note to self): However, with a mechanism like that, if the translation is manually linked, or ends up linked because of deduplication. there could be sentences initially not added as a quote ending up being tagged as quote just because they happen to be a possible translation of a quote in a different language. Is that okay?

Guybrush88 commented 11 months ago

However, with a mechanism like that, if the translation is manually linked, or ends up linked because of deduplication. there could be sentences initially not added as a quote ending up being tagged as quote just because they happen to be a possible translation of a quote in a different language. Is that okay?

In such cases, maybe we can have notifications or automated comments to have user discussions (if this is considered to be a decent idea to improve quality of tags)

Or maybe we should have tags that apply to the whole sentence group, as opposed to a single sentence?

In my opinion, this can be applied depending on how such tags are reused (both internally to Tatoeba and to external projects), in order to see what's more effective to preserve the license. If there are translations, they're more likely to be considered as derivative work, so maybe single sentences have to be tagged

jiru commented 11 months ago

How about having two tags, one for the original quote and one for translations of these quotes? For example existing "by XXX" and "quote translation" for all other sentences of the group. This way it makes easier to see which sentences are derivative and which are not.

Guybrush88 commented 11 months ago

How about having two tags, one for the original quote and one for translations of these quotes? For example existing "by XXX" and "quote translation" for all other sentences of the group. This way it makes easier to see which sentences are derivative and which are not.

I agree with that.

is "quote translation" enough for license attribution, though? Just asking to be sure that everything is fine license-wise. Maybe having both tags in derivative sentences?

jiru commented 11 months ago

Well, we already display the sentence a translation originates from on the sentence page.