cybersemics / em

A beautiful, minimalistic note-taking app for personal sensemaking.
Other
282 stars 107 forks source link

Improve Lexeme hashing of punctuation #1720

Closed raineorshine closed 1 year ago

raineorshine commented 1 year ago

Distinguish the characters -+/:., next to numbers:

(Continue to hash match those adjoining letters, e.g. sense-making and sensemaking)

Distinguish - at whitespace or beginning/end of line:

Distinguish @#<>:

Match &, +, and and:

Match ö and o:

raineorshine commented 1 year ago

Fixed in:

a3a5f4cbda normalizeThought: Ignore diacritics. 0d8929d603 normalizeThought: Transform ampersand to 'and'. 10b009f5b0 normalizeThought: Always distinguish hashtags and mentions. af13251894 normalizeThought: Boolean expressions and ending punctuation. 0ea8506644 normalizeThought: Remove colons preceding whitespace. 9be85568fb normalizeThought: Selectively remove hyphens and dashes. d82b28f6c8 Add normalizeThought tests.