catleeball / tmnt_wikipedia_bot

Find Wikipedia titles that can be sung to the Teenage Mutant Ninja Turtles themesong.
MIT License
143 stars 23 forks source link

Match whole banned words rather than substrings #10

Closed catleeball closed 5 years ago

catleeball commented 5 years ago

Currently if a string in the ban list exists anywhere in the title, including as a substring of a longer word, the title is discarded.

Update this to discard titles where the whole word matches exactly, but not as a substring of a longer word.

Example:

BANNED = ['dog']
title = 'my big dog' # should be discarded
title = 'a dogged load shark' # should not discard
catleeball commented 5 years ago

This is resolved by commit acc48a8