cu-mkp / m-k-manuscript-data

Text of BnF Ms Fr 640 in multiple formats, metadata about the manuscript, and derived data
13 stars 5 forks source link

Double check occitan terms #588

Closed thuchacz closed 4 years ago

thuchacz commented 5 years ago

ISSUE: lack of <oc> tags in the DCE There are +/- two dozen words in the Glossary that have been flagged as Occitan or having Occitan connections (all listed below with their full glossary entry). However, very few of those terms appear to have been tagged consistently with <oc> tags across the DCE.

TASK: 1) determine whether the words in the list below actually merit being marked up as occitan 2) search for those words in the TCN add tags across all three versions (TC/TCN/TL). 3) Add translation using editorial comment


All words with an Occitan connection:

TillmannTaape commented 5 years ago

I also changed the TL on 13v, from "Candlemakers never make good candle when the Autan wind blows because it is always melty" to "Candlemakers never make good candles when the Autan wind blows because the candle always tends to melt" – the "it" in the previous TL was ambiguous in the English, whereas in the French it's clear from the grammatical gender that it refers to the candle (therefore no sup tags required). And "melty" just sounds too informal.

thuchacz commented 5 years ago

The only issue there is to, perhaps, NOT tag that pseudo-supped "candle" as a tool—as we always do with every instance of candle—since we can't consistently tag across all three versions. I'm sure that problem partly motivated the "it is always melty" translation.

On Wed, May 22, 2019 at 10:03 PM Tillmann Taape notifications@github.com wrote:

I also changed the TL on 13v, from "Candlemakers never make good candle when the Autan wind blows because it is always melty" to "Candlemakers never make good candles when the Autan wind blows because the candle always tends to melt" – the "it" in the previous TL was ambiguous in the English, whereas in the French it's clear from the grammatical gender that it refers to the candle (therefore no sup tags required). And "melty" just sounds too informal.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AFTLQATNURPP32BFDA2YREDPWWRKRA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWAFLLA#issuecomment-494949804, or mute the thread https://github.com/notifications/unsubscribe-auth/AFTLQAUWTFBELIJYXOGXNADPWWRKRANCNFSM4HMLKRHQ .

TillmannTaape commented 5 years ago

Good point, I won't tag it as tl.

TillmannTaape commented 5 years ago

Re crusol – I don't think it's worth reverting all 52 instances in the TL to "crusol", with a note explaining that it's Occitan for crucible. As MHS notes in the glossary, the ms has hybrid French/Occitan spellings as well, which raises the question how we should treat those. We could treat crusol as a variant spelling of creuset etc., i.e. list it in the glossary table as an alternative spelling, with (Occitan) in parentheses, but not tag it. On the other hand, that would mean that someone pulling out all the oc tags would miss the fact that he uses the Occitan version of the word a lot. But I'm still resisting the idea of keeping crusol every time in the TL and adding a note to the first instance in each entry; it seems very clunky.

thuchacz commented 5 years ago

Yeah. I see the dilemma. Since he doesn't use this term exclusively to refer to a crucible, I think we should tag it. I agree that the editorial comment would be clunky, so we should go the route we went with "crocum" and "crocum ferri"—i.e., give foreign language tags and rendering, but no editorial comments, only a glossary entry.

On Wed, May 22, 2019 at 10:59 PM Tillmann Taape notifications@github.com wrote:

Re crusol – I don't think it's worth reverting all 52 instances in the TL to "crusol", with a note explaining that it's Occitan for crucible. As MHS notes in the glossary, the ms has hybrid French/Occitan spellings as well, which raises the question how we should treat those. We could treat crusol as a variant spelling of creuset etc., i.e. list it in the glossary table as an alternative spelling, with (Occitan) in parentheses, but not tag it. On the other hand, that would mean that someone pulling out all the oc tags would miss the fact that he uses the Occitan version of the word a lot. But I'm still resisting the idea of keeping crusol every time in the TL and adding a note to the first instance in each entry; it seems very clunky.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AFTLQAQJNZNOEHJAOAV7YXLPWWXZPA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWAKOJY#issuecomment-494970663, or mute the thread https://github.com/notifications/unsubscribe-auth/AFTLQAQB4XKHS6SZR722ZHLPWWXZPANCNFSM4HMLKRHQ .

TillmannTaape commented 5 years ago

That sounds good. And leave it translated in the TL?

thuchacz commented 5 years ago

Anything with OC tags will be italicized, as a rule. So, we shouldn't translate this. Just leave it as crusol and trust that the reader will look it up. It may be a common, pseudo-French variant, but if we want people to notice he's using 2 words, one of which is more occitan than the other, we need to simply treat it as occitan. I'm sure this is not really satisfying...

On Wed, May 22, 2019 at 11:27 PM Tillmann Taape notifications@github.com wrote:

That sounds good. And leave it translated in the TL?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AFTLQAQMJDVA7VIYNVXWVFLPWW3F3A5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWAM2DI#issuecomment-494980365, or mute the thread https://github.com/notifications/unsubscribe-auth/AFTLQAS7LJXGFMQMMW52V43PWW3F3ANCNFSM4HMLKRHQ .

TillmannTaape commented 5 years ago

Not really, but I see the logic and don't have a better idea. @ps2270 ?

TillmannTaape commented 5 years ago

For words which are derived from Occitan but Frenchified, such as pomole and sernaille, I assume we want to keep it in the TL as writ in the TC/TCN, with an editorial note to explain the meaning and that it comes from an occitan word. But, should we tag it as fr, since it is quite far from the original occitan? Or ignore this and tag as occitan so that it comes up when you pull out oc-tagged terms? I think probably the latter but not sure.

thuchacz commented 5 years ago

Another good question. Since the OC terms (technical or otherwise) are comparatively few, I'm inclined to tag. And, if you think it's a weird enough case, we COULD give both OC and FR tags. The consequence (as above) is that they will be italicized in all three versions. So far, we have decided to provide editorial comments for all but a handful of widely recurring terms (which will be explained in the glossary).

Unsolicited opinion: frankly, I think it is silly of us NOT to give editorial comments for "crocum (ferri)" and "crusol." But that's a discussion for a later date.

On Wed, May 22, 2019 at 11:54 PM Tillmann Taape notifications@github.com wrote:

à l'enssus: Should we tag it oc, but keep the translation "on the top" in the TL since it's just an adverbial phrase and not a technical term?

For words which are derived from Occitan but Frenchified, such as pomole and sernaille, I assume we want to keep it in the TL as writ in the TC/TCN, with an editorial note to explain the meaning and that it comes from an occitan word. But, should we tag it as fr, since it is quite far from the original occitan? Or ignore this and tag as occitan so that it comes up when you pull out oc-tagged terms? I think probably the latter but not sure.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AFTLQARUSHD7GERHFF3NSLLPWW6IFA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWAOZYQ#issuecomment-494988514, or mute the thread https://github.com/notifications/unsubscribe-auth/AFTLQATLCTKQSHAGJEY6RDTPWW6IFANCNFSM4HMLKRHQ .

ps2270 commented 5 years ago

I'm trying to close the loop here on this issue, and the question referred to me:

Another good question. Since the OC terms (technical or otherwise) are comparatively few, I'm inclined to tag. And, if you think it's a weird enough case, we COULD give both OC and FR tags. The consequence (as above) is that they will be italicized in all three versions. So far, we have decided to provide editorial comments for all but a handful of widely recurring terms (which will be explained in the glossary). DECISION (if still needed): I think this makes sense to do it this way.

Unsolicited opinion: frankly, I think it is silly of us NOT to give editorial comments for "crocum (ferri)" and "crusol." But that's a discussion for a later date. PHS: I would agree with that, if we can automate it somehow... …

TillmannTaape commented 5 years ago

I have tagged all terms listed above and their cognates in the TL as <oc>, however, I have ONLY reinserted the original word in the TL, with an ed note containing the translation, for the terms ticked in the above list – pending a decision on whether to translate these or not.

ps2270 commented 5 years ago

I forgot to bring this up yesterday. Tillmann and I were discussing this a few days ago, and came up with the following idea about <oc> and <po> words: We translate them into English, and direct Nick to italicize all words in tl, and potentially tcn that have these two tag sets. This seems a good idea b/c Some oc words are repeated many, many times (crusol, viron, biron) They are always single words, unlike the Latin passages or Italian strings of words. They are different than the gk and de (German) words, which, although single, are untranslatable. The italicization would signal to the reader that there is something up with this word and they could go to the glossary to read about it. And, of course, the italicization would be noted in a ReadMe document.

Does anyone see any potential problems with this solution? @thuchacz @TillmannTaape @Pantagrueliste @njr2128 ???

TillmannTaape commented 5 years ago

I agree we should translate occitan and poitevin tags but tag them in all three versions. Italicising them in the TL is unproblematic, but I'm not sure it's a good idea in the TCN or TC – this is perhaps a question for Marc.

Alternatively, we could flag these words with a superscript O/P, or "Occ."/"Poit", or some other mark.

Pantagrueliste commented 5 years ago

I do not see any problem with this solution in the TL. I would not apply this in the TC, however.

ps2270 commented 5 years ago

DECISION: We translate all <oc> and <po> terms into English, and direct Nick to italicize in the tl ONLY words with these tag sets. If anyone has strong objections, now is the time to state them. If we don't hear today, @TillmannTaape can you please go back and retranslate whatever you have reverted previously!

thuchacz commented 5 years ago

This is a bit different than what we had decided previously, which was to treat Occitan as the separate language that it is (i.e., use <oc> tags, have those words render in italics, give their meaning in the glossary and, if they appear with limited frequency, give the translation in an editorial comment). In this way, we would treat Occitan just as we do Latin, German, Italian, and Spanish. As Poitevin is a French dialect, we decided we would tag all those terms with <po> tags, though not have those tags impact rendering (i.e., no italics), note the Poitevin origin of all those words in the glossary, and we would translate the term as we would any other French word. In other words, <po> tags would be silent tags (like most of our semantic markup) that would be there for the purposes of analysis but not rendering.

In what you are proposing now, you are demoting Occitan from the status of a separate language and promoting Poitevin to something more than French.

thuchacz commented 5 years ago

The issue with Occitan is politically charged, esp. given the French government's active oppression of minority languages (e.g., Occitan, Breton) well into the 20th century. I think it should be treated as a language.

ps2270 commented 5 years ago

Thanks. Well, we can not treat poitevin in this way, if that seems a solution to the language issue. My decision can be interpreted not to mean that it's not a separate language, but the fact that these are individual words, not strings of words, or passages, as in other cases (except putative German words). The pages will look very cluttered with all those asterisks for ed comms for biron/viron and crusol.

On Wed, Jul 17, 2019 at 9:28 AM thuchacz notifications@github.com wrote:

The issue with Occitan is politically charged, esp. given the French government's active oppression of minority languages (e.g., Occitan, Breton) well into the 20th century. I think it should be treated as a language.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AF75UYCRKVMCVSGEZFFFOALP74M7NA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2EGB3Y#issuecomment-512254191, or mute the thread https://github.com/notifications/unsubscribe-auth/AF75UYCN67UQ7W7EDEBAC7LP74M7NANCNFSM4HMLKRHQ .

-- Pamela H. Smith Seth Low Professor of History http://history.columbia.edu/faculty/smith-pamela-h/ The Making and Knowing Project http://makingandknowing.org

Director, Center for Science and Society http://scienceandsociety.columbia.edu/ Chair, Presidential Scholars in Society and Neuroscience https://presidentialscholars.columbia.edu/ Columbia University 605 Fayerweather Hall, MC 2516 1180 Amsterdam Avenue New York, N.Y. 10027 (212) 854-7662-Phone (212) 851-5963-Fax ps2270@columbia.edu

Entangled Itineraries: Materials, Practices, and Knowledges across Eurasia https://www.upress.pitt.edu/books/9780822965770/ has just been released! The Body of the Artisan http://press.uchicago.edu/ucp/books/book/chicago/B/bo3618964.html has been reissued in paperback and electronic editions. The Business of Alchemy https://press.princeton.edu/titles/10868.html is available in a new paperback.

Pantagrueliste commented 5 years ago

Setting nationalist considerations aside, Occitan is technically a language. But a language already in decline in the 16th century, especially after the Ordinance of Villers-Cotterêts in 1539. Our manuscript bears witness of this decline, with relatively few Occitan words for a text originated in early modern Toulouse. If signalling Occitan words results in a messy and cluttered rendering of the text, then I have no objection in making all regional tags silent.

ps2270 commented 5 years ago

silent, or italicized in the case of Occitan?

On Wed, Jul 17, 2019 at 11:03 AM Clément Godbarge notifications@github.com wrote:

Setting nationalist considerations aside, Occitan is technically a language. But a language already in decline at the end of the 16th century, especially after the Ordinance of Villers-Cotterêts in 1539. Our manuscript bears witness of this decline, with relatively few Occitan words for a text originated in early modern Toulouse. If signalling Occitan words results in a messy and cluttered rendering of the text, then I have no objection in making all regional tags silent.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AF75UYD46CLUKKFXYTDQ7WDP74YCZA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2EQCKA#issuecomment-512295208, or mute the thread https://github.com/notifications/unsubscribe-auth/AF75UYFVMIICBVVJWTFXBLLP74YCZANCNFSM4HMLKRHQ .

-- Pamela H. Smith Seth Low Professor of History http://history.columbia.edu/faculty/smith-pamela-h/ The Making and Knowing Project http://makingandknowing.org

Director, Center for Science and Society http://scienceandsociety.columbia.edu/ Chair, Presidential Scholars in Society and Neuroscience https://presidentialscholars.columbia.edu/ Columbia University 605 Fayerweather Hall, MC 2516 1180 Amsterdam Avenue New York, N.Y. 10027 (212) 854-7662-Phone (212) 851-5963-Fax ps2270@columbia.edu

Entangled Itineraries: Materials, Practices, and Knowledges across Eurasia https://www.upress.pitt.edu/books/9780822965770/ has just been released! The Body of the Artisan http://press.uchicago.edu/ucp/books/book/chicago/B/bo3618964.html has been reissued in paperback and electronic editions. The Business of Alchemy https://press.princeton.edu/titles/10868.html is available in a new paperback.

Pantagrueliste commented 5 years ago

I think we either silence all regional tags (including Occitan) or we italicize them (including Occitan).

TillmannTaape commented 5 years ago

I completely agree re the political implications of recognising Occitan as a language, but my sense is that the AP is not using it as a language in the same way he does Latin or Italian. He may very well have no knowledge of Occitan but use a few terms which have made their way into daily French language in the South – hence also his various degrees of gallicisation of these words. So I think we're ok to flag them as Occitan or Occitan-derived words used in French without doing injustice to Occitan as a language. In some ways, we might be doing it a greater injustice by implying that throwing in the odd word into a French sentence constitutes proper use of Occitan as a language.

I'm in favour of italicising in the TL, and even in all three versions if that seems reconcilable with our transcription protocol (has Marc pronounced on this?).

ps2270 commented 5 years ago

Marc has not yet been asked, but I think I have everyone opinions at the moment, so I'll ask him now.

On Wed, Jul 17, 2019 at 3:30 PM Tillmann Taape notifications@github.com wrote:

I completely agree re the political implications of recognising Occitan as a language, but my sense is that the AP is not using it as a language in the same way he does Latin or Italian. He may very well have no knowledge of Occitan but use a few terms which have made their way into daily French language in the South – hence also his various degrees of gallicisation of these words. So I think we're ok to flag them as Occitan or Occitan-derived words used in French without doing injustice to Occitan as a language. In some ways, we might be doing it a greater injustice by implying that throwing in the odd word into a French sentence constitutes proper use of Occitan as a language.

I'm in favour of italicising in the TL, and even in all three versions if that seems reconcilable with our transcription protocol (has Marc pronounced on this?).

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AF75UYDUJEFHI64K5CNVZ4LP75XM5A5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2GK3DA#issuecomment-512535948, or mute the thread https://github.com/notifications/unsubscribe-auth/AF75UYGXMMVDOWBKU4PF3KDP75XM5ANCNFSM4HMLKRHQ .

-- Pamela H. Smith Seth Low Professor of History http://history.columbia.edu/faculty/smith-pamela-h/ The Making and Knowing Project http://makingandknowing.org

Director, Center for Science and Society http://scienceandsociety.columbia.edu/ Chair, Presidential Scholars in Society and Neuroscience https://presidentialscholars.columbia.edu/ Columbia University 605 Fayerweather Hall, MC 2516 1180 Amsterdam Avenue New York, N.Y. 10027 (212) 854-7662-Phone (212) 851-5963-Fax ps2270@columbia.edu

Entangled Itineraries: Materials, Practices, and Knowledges across Eurasia https://www.upress.pitt.edu/books/9780822965770/ has just been released! The Body of the Artisan http://press.uchicago.edu/ucp/books/book/chicago/B/bo3618964.html has been reissued in paperback and electronic editions. The Business of Alchemy https://press.princeton.edu/titles/10868.html is available in a new paperback.

TillmannTaape commented 5 years ago

I've reverted to the English word in the TL, except for visaube which we are leaving in the original because we're not entirely sure what it is. I've removed comment anchors and marked the comment for deletion in the comment tracking sheet, except where the comment was fulfilling some additional function besides merely translating the Occitan word.

ps2270 commented 5 years ago

Marc's answer: " I rather agree those words should be made as unobtrusive as possible for the reader. There is no clear-cut linguistic boundary there, and the author wouldn't have considered those words foreign, they are seamlessly incorporated into his own language. Some words are names of local things ("millas"), others are local words that can actually be of Occitan origin or just pronounced and written in a more or less southern way. So they are interesting to annotate as cultural evidence and clues to the author's linguistic profile, but hardly more than that. Poitevin of course is even less distinct from standard French (whatever that was at the time)." And he says he would not italicize in the tcn.

DECISION: We'll translate into English and ask Nick to italicize <oc> tags in the TL only. I see Tillmann has already acted on this decision. Thanks. @TillmannTaape have you also changed the Comment Tracking Sheet?

ps2270 commented 5 years ago

@TillmannTaape Meant to say: If you haven't changed comment tracking, let me know. I'm deep in it anyway, so I can do it, if not.

TillmannTaape commented 5 years ago

@ps2270 Yes, in the tracking sheet I've marked as "delete" those comments which were only the translation from occitan, and taken out the comment anchors in the files.

ps2270 commented 5 years ago

great. thanks.

On Thu, Jul 18, 2019 at 10:22 AM Tillmann Taape notifications@github.com wrote:

@ps2270 https://github.com/ps2270 Yes, in the tracking sheet I've marked as "delete" those comments which were only the translation from occitan, and taken out the comment anchors in the files.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/cu-mkp/m-k-manuscript-data/issues/588?email_source=notifications&email_token=AF75UYFSQFH7GVAAEQA3B6LQAB4DLA5CNFSM4HMLKRH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2IUOUQ#issuecomment-512837458, or mute the thread https://github.com/notifications/unsubscribe-auth/AF75UYGRNVDTGRGOVEEJMSLQAB4DLANCNFSM4HMLKRHQ .

-- Pamela H. Smith Seth Low Professor of History http://history.columbia.edu/faculty/smith-pamela-h/ The Making and Knowing Project http://makingandknowing.org

Director, Center for Science and Society http://scienceandsociety.columbia.edu/ Chair, Presidential Scholars in Society and Neuroscience https://presidentialscholars.columbia.edu/ Columbia University 605 Fayerweather Hall, MC 2516 1180 Amsterdam Avenue New York, N.Y. 10027 (212) 854-7662-Phone (212) 851-5963-Fax ps2270@columbia.edu

Entangled Itineraries: Materials, Practices, and Knowledges across Eurasia https://www.upress.pitt.edu/books/9780822965770/ has just been released! The Body of the Artisan http://press.uchicago.edu/ucp/books/book/chicago/B/bo3618964.html has been reissued in paperback and electronic editions. The Business of Alchemy https://press.princeton.edu/titles/10868.html is available in a new paperback.

TillmannTaape commented 5 years ago

Can we close this issue? Has Nick been told to italicise oc tags in TL only, or is this noted somewhere else?

ps2270 commented 5 years ago

@njr2128 If your next meeting with Nick is this week, can you please let him know our decision, or I can make an issue about it in his repo?

ps2270 commented 5 years ago

@njr2128 Can we close this issue?

ps2270 commented 4 years ago

Final Decision: No italicization of <oc> tag in any version. TT, CAG, & MHS write up a paragraph on the use of these in the ms.

njr2128 commented 4 years ago

This has been updated in https://github.com/cu-mkp/making-knowing-edition/issues/235 and a new issue created for the explanation text to be written: #1145