nkprasad12 / morcus-net

Source code for Morcus Latin Tools.
https://morcus.net
GNU General Public License v3.0
6 stars 3 forks source link

Smarting handling of emphasized text in LS #234

Open nkprasad12 opened 1 year ago

nkprasad12 commented 1 year ago

We would ideally like to stick to the rule of: bold = English definition

nkprasad12 commented 1 year ago

Also lots of things connected to adj. that need to be split and tagged separately:

2 <= adj., Malian
2 <= neutr. indecl.
2 <= pertaining to C. Marius, Marian
2 <= comp. adj.
2 <= part. perf. subst.
2 <= adj., Moschian
2 <= Pass. reflex.
2 <= Neutr. plur
2 <= adj., Nelean
2 <= adj., Neronian
2 <= adj., Nicene
2 <= a nest.
2 <= a.;
2 <= adj., Nonacrian

2 <= adj. f., Nysæan
2 <= adj., Nysæan
nkprasad12 commented 1 year ago

Similarly, we have bits like

2 <= init.—Sup.

with init. and fin. that need to be separated so we can tag them correctly. There's also med.

nkprasad12 commented 1 year ago

Sometimes we have names like:

2 <= Ju.
2 <= Hesperian.
2 <= Eut.
nkprasad12 commented 1 year ago

Similarly lots of things with adv. that have text attached:

2 <= adv., with a distinction
2 <= adv., separately, distinctly
2 <= adv., separately
2 <= term. ad quem
2 <= adv., clearly

we should separate this

nkprasad12 commented 1 year ago

We also have sub fin. which needs to be investigated, probably a location reference (below the end) similarly ad fin.

nkprasad12 commented 1 year ago

We also have things like: P. a. no. which should only be regular italicized:

Cf. devotus, <hi rend="ital">P. a. no.</hi> B.

since it's referring to a section.

nkprasad12 commented 1 year ago

What is piur. ?

nkprasad12 commented 1 year ago

Is Art. an abbrevuiation for a name?

nkprasad12 commented 1 year ago

N. cr.

??? seems to be the name of a work.

nkprasad12 commented 1 year ago
nkprasad12 commented 1 year ago

unclear what in lemm. means

nkprasad12 commented 1 year ago

We have 28 P. a. fin which should be broken up from the fin

nkprasad12 commented 1 year ago

23 of: ext.; ???

nkprasad12 commented 1 year ago

in carm. should not be bold.

nkprasad12 commented 1 year ago

2 <= Act., to turn should be split

nkprasad12 commented 1 year ago

th of Oct. should not be bold

nkprasad12 commented 1 year ago

patron., dim. etc should probably be not highlighted

nkprasad12 commented 1 year ago

Handle alter - it sometimes represents the Latin word and sometimes the English. In the English, it is always preceded by to change so we can flag these manually.

nkprasad12 commented 1 year ago

// Also want "in", but only if it is after "With". Otherwise it's an English word. // Also want "object" but only if preceding "- " // Also want to connect "of thing;" with its preceding word

nkprasad12 commented 1 year ago

This is partially fixed by: https://github.com/nkprasad12/morcus-net/pull/229 This handles making most grammar terms specially highlighted. We should then go through and try to make it so that only English definitions are bold, and to catch some remaining edge cases.