cebtenzzre / tumblr-utils

A fork of tumblr-utils with Python 3 support, bug fixes, and lots of features I found useful.
GNU General Public License v3.0
39 stars 7 forks source link

Feature Request: Keep Reblogs With Owner Response #13

Open lucible opened 11 months ago

lucible commented 11 months ago

Hello! Thanks for your fork of tumblr-utils - all the additional features are very much appreciated.

While the --no-reblog and --only-reblog options are really helpful for narrowing down content on large blogs, I'd love an option to keep only original posts and reblogs where the original blog adds to the reblog. For example, if I'm backing up "alittleblog" and they post a text post then later reblog their own post and add new text underneath, the --no-reblog option will exclude the reblog where alittleblog added new text.

Hope that makes sense -- I took a look at the is_reblog module and it's a bit over my head without having dug into the post HTML so I'm not sure if this is really possible? but would be appreciated if it was. Let me know if there's any other info I could provide that would help.

cebtenzzre commented 11 months ago

This is not straightforward, because AFAIK the Tumblr API doesn't clearly differentiate between reblogs with added content and "pure" reblogs. I think a best-effort solution that works in simple cases such as the one you're describing would involve some basic HTML parsing, implying a BeautifulSoup dependency.