Closed Benjamin2107 closed 2 weeks ago
@Benjamin2107 When I run the code snippet above, only the filter specified in the Kicker
publisher enum works as intended, but that got only fixed recently with #459. Could you confirm if this is also the case for you?
You can get the debugging logging messages enabled with
import logging
from fundus.logging import set_log_level
set_log_level(logging.DEBUG)
Yes, it is working now. Thanks :)
Describe the bug
While working on #464 I had trouble filtering some regex in the url_filter of PublisherSpec.
All unit tests are working fine but after testing the crawler myself I recognized videos and slideshows from my selected newspaper don't get filtered.
Is this a bug or is this my fault?
How to reproduce
Expected behavior.
Only "*/article*" urls should be shown. Instead there are urls containing "*/video*" or "*/slideshow*". (Depending on if the last 20 news are even containing videos or slideshows=
Logs and Stack traces
No response
Screenshots
No response
Additional Context
No response
Environment