minimaxir / download-tweets-ai-text-gen

Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.
MIT License
221 stars 41 forks source link

Refine regex / reduce duplicate blank spaces #1

Open minimaxir opened 4 years ago

minimaxir commented 4 years ago

There can be optimizations to the cleaning regex pattern.

minimaxir commented 4 years ago

Regex is fine (weirdness was due to stripping user tag defaults) but may want to consider using a second regex to dedupe spaces.