adbar / htmldate

Fast and robust date extraction from web pages, with Python or on the command-line
https://htmldate.readthedocs.io
Apache License 2.0
117 stars 26 forks source link

The script does not find the date (Russian) #156

Open PetroffSky opened 2 days ago

PetroffSky commented 2 days ago

The script does not find the date (Russian): from htmldate import find_date

url = "https://kamaz.ru/press/releases/kamaz_i_skolkovo_sozdadut_ekologicheski_chistyy_gruzovik/"

print(find_date(url, extensive_search=True)) # Returns None print(find_date(url, extensive_search=False)) # Returns None

Xpath selector of dates on the page: //div[contains(text(), 'July 30, 2015')]