I've observed in several contexts (including with my own corpus) that case may be significant in lemmata, usually to distinguish proper nouns (for instance, Sarrasin for the name of the people or of M. Sarrasin, by opposition to 'sarrasin' as a kind of corn…). Could we not lowercase lemma during processing ?
This might create a problem with generate approach, but not necessarily with label…
I've observed in several contexts (including with my own corpus) that case may be significant in lemmata, usually to distinguish proper nouns (for instance, Sarrasin for the name of the people or of M. Sarrasin, by opposition to 'sarrasin' as a kind of corn…). Could we not lowercase lemma during processing ? This might create a problem with generate approach, but not necessarily with label…