Flickr-Foundation / flickypedia

A tool to copy CC-licensed images from Flickr to Wikimedia Commons
https://www.flickr.org/tools/flickypedia/
Apache License 2.0
9 stars 2 forks source link

Would it be possible to prioritize backfilling of BHL pictures? #572

Open lubianat opened 5 days ago

lubianat commented 5 days ago

Hi, Alex, I found the code :)

First, kudos for the nice structure and readability; it is a pleasing code to read.

I am digging to see where I could help providing some list of BHL files on Commons missing the Flickypedia SDC.

I see there is a Flickr ID finder for the wikitext:

https://github.com/Flickr-Foundation/flickypedia/blob/1488f35237238c0c10d5579848831394854555d1/src/flickypedia/backfillr/flickr_matcher.py#L78-L80

There are about 150k pictures that User:Fae uploaded from the BiodivLibrary that I believe came from Flickr (maybe @JJDear83 can confirm?), but do not have a FlickrID:

QUESTION: Would it make sense to you to add the Flickr ids retroactively from such images? It should be possible to get the Flickr IDs from the BHL page IDs.

At the same time, there are about 7k pictures that are from BHL, have been reviewed by FlickrevieweR but are still missing a P12120 (Flickr ID) property.

I guess these are on target for the Backfillrbot, right? I'd be happy to provide this list in any format that is more suitable for pushing the info.

If there is anything at all I can do to help, just let me know.

Cheers, Tiago

lubianat commented 1 day ago

Update here: Alex kindly replied by mail. The conversation is going on.

JJDear83 commented 1 day ago

HI @lubianat I think Fae loaded those but he cites 303K images that he loaded up for us on his user project page.https://commons.m.wikimedia.org/wiki/User:F%C3%A6/Project_list. Scroll down and you will see BHL. I tried to get in touch but it seems he has ghosted so i couldn't figure out his process.