Open nkprasad12 opened 1 year ago
Also lots of things connected to adj.
that need to be split and tagged separately:
2 <= adj., Malian
2 <= neutr. indecl.
2 <= pertaining to C. Marius, Marian
2 <= comp. adj.
2 <= part. perf. subst.
2 <= adj., Moschian
2 <= Pass. reflex.
2 <= Neutr. plur
2 <= adj., Nelean
2 <= adj., Neronian
2 <= adj., Nicene
2 <= a nest.
2 <= a.;
2 <= adj., Nonacrian
2 <= adj. f., Nysæan
2 <= adj., Nysæan
Similarly, we have bits like
2 <= init.—Sup.
with init. and fin. that need to be separated so we can tag them correctly. There's also med.
Sometimes we have names like:
2 <= Ju.
2 <= Hesperian.
2 <= Eut.
Similarly lots of things with adv. that have text attached:
2 <= adv., with a distinction
2 <= adv., separately, distinctly
2 <= adv., separately
2 <= term. ad quem
2 <= adv., clearly
we should separate this
We also have sub fin.
which needs to be investigated, probably a location reference (below the end)
similarly ad fin.
We also have things like: P. a. no.
which should only be regular italicized:
Cf. devotus, <hi rend="ital">P. a. no.</hi> B.
since it's referring to a section.
What is piur. ?
Is Art. an abbrevuiation for a name?
N. cr.
??? seems to be the name of a work.
unclear what in lemm.
means
We have 28 P. a. fin
which should be broken up from the fin
23 of: ext.;
???
in carm.
should not be bold.
2 <= Act., to turn
should be split
th of Oct.
should not be bold
patron.
, dim.
etc should probably be not highlighted
Handle alter
- it sometimes represents the Latin word and sometimes the English.
In the English, it is always preceded by to change
so we can flag these manually.
// Also want "in", but only if it is after "With". Otherwise it's an English word. // Also want "object" but only if preceding "- " // Also want to connect "of thing;" with its preceding word
This is partially fixed by: https://github.com/nkprasad12/morcus-net/pull/229 This handles making most grammar terms specially highlighted. We should then go through and try to make it so that only English definitions are bold, and to catch some remaining edge cases.
We would ideally like to stick to the rule of: bold = English definition