Closed joshed-io closed 7 years ago
We could use this: https://www.npmjs.com/package/emoji-strip
only strip the emoji that are 2+ bytes, other ones work perfectly with Algolia
Thanks for the tip Mr. @Haroenv! So then... how to detect those chars... is there a regex that matches >2 byte chars?
https://github.com/mathiasbynens/emoji-regex and strip those w length over 2
Stripping all of them for right now using that regex, but welcome to a better solution for preserving the single-byte
We get some weird unicode characters in the results, from emojis that were indexed. ���
The best thing to do for now might be to remove emojis from the tweets at indexing time. Anyway, it will make the rating emoji stand out :)