Closed Frankie-Figz closed 4 years ago
@Frankie-Figz Great work! Do you think you could:
Register
link on their individual page, e.g. this one? I ask because without this we're missing 3/8 of the events, at least for this current snapshot. And it shouldn't be too difficult with the Google Calendar / ICS links there. Those probably contain all the data you need if the page itself doesn't.Yes, i think so. Shouldn't be a problem if I manage to overcome my laziness
On Thu, Feb 27, 2020, 8:29 PM Scott McAllister notifications@github.com wrote:
@Frankie-Figz https://github.com/Frankie-Figz Great work! Do you think you could add support for those events that don't have a Register link on their individual page, e.g. this one https://www.lfwa.org/events/free-trees-from-strangling-vines-7kfa7-ws676-7zf9e-nklh7-bst6e? I ask because without this we're missing 3/8 of the events, at least for this current snapshot. And it shouldn't be too difficult with the Google Calendar / ICS links there. Those probably contain all the data you need if the page itself doesn't.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/DataKind-DC/capital-nature-ingest/pull/225?email_source=notifications&email_token=AJQJN6VLURITWJH6LAOTKGLRFBSIDA5CNFSM4K4KD5A2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENGUBNI#issuecomment-592265397, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJQJN6SZEKICJ3WESLOUZHLRFBSIDANCNFSM4K4KD5AQ .
I added the exception for dealing with events without a registration link by using the ICS link. This requires adding ICS and PYTZ modules. It is necessary that you add the following to the requirements.txt file.
ics==0.7 pytz==2018.9
A for loop that iterates over the tags for the article tag of type "event". In this way we capture events that have a venue multiplicity and avoid events without a registration link.
In addition, the correct status is retrieved with "status_code".