alltheplaces / alltheplaces

A set of spiders and scrapers to extract location information from places that post their location on the internet.
https://www.alltheplaces.xyz
Other
636 stars 214 forks source link

contact:facebook repeated 200 times and more - so likely not POI-specific #10963

Open matkoniecz opened 1 month ago

matkoniecz commented 1 month ago
matkoniecz commented 1 month ago

Maybe image or phone and similar tags that is repeated over 10 times should be thrown out automatically? Without throwing it out manually by changing spider?

Are there any more keys where detecting repetition and making issue like this would be useful?

RedAuburn commented 1 month ago

If a generic fix is made, the fix in https://github.com/alltheplaces/alltheplaces/pull/11138 can be removed