Closed inikulin closed 10 months ago
In the current code there are both "rewritable units", "token" and "lexeme", did we stop midway through such a rename?
@nox no, these are all different things. The intention of this ticket is to simplify API and get rid of term "lexeme". Currently full parser produces lexemes, then if they are captured by a selector they converted to "tokens" with rewriting API exposed. In most cases tokens is the same thing as rewritable units with an exception to Element
which is a combination of start tag token, end tag token and inner content.
The idea is to rename "lexeme"s to "token"s to make things more conventional. And name "rewritable units" what currently named "tokens", with "Element" just being a compound rewritable unit.
Current terminology might be a bit confusing. So, let's rename tokens to rewritable units (
StartTag
will be still a rewritable unit, just not exposed in the public API).This allows us to rename
lexeme
totoken
.