flairNLP / fundus

A very simple news crawler with a funny name
MIT License
127 stars 63 forks source link

Update LD Selector #514

Closed addie9800 closed 2 weeks ago

addie9800 commented 2 weeks ago

When working with Fundus I found that way too much JSON is extracted when computing the self.precomputed elements. This causes multiple issues:

MaxDall commented 2 weeks ago

To be honest, I would rather keep the generous LD selector and catch JSONDecodeErrors during parsing. You can fix the author for FreiePresse using get_value_by_key_path instead of bf_search .