StephenOTT / OttawaSIREScrape

Scrape using FMiner to scrape content from the City of Ottawa's SIRE platform (Council Voting and Minutes)
1 stars 0 forks source link

Meeting Agenda has a Active email URL in it #21

Open StephenOTT opened 11 years ago

StephenOTT commented 11 years ago

Currently i am using a //a xpath query to capture all URLs on the Agenda HTML page.

But as we see with this page: http://app05.ottawa.ca/sirepub/mtgviewer.aspx?meetid=2409&doctype=AGENDA

Screen Shot 2013-02-26 at 1 23 20 AM

This is a issue with the original Word Document. The word document should be adjustmed to remove this email address as a active URL.

The reasoning for removing it is that every other link are links that activate the video and right hand panes.

Having a active email address is not connected with the user expectations.

@giggey

StephenOTT commented 11 years ago

If we are unable to remove the active email address i can modify the xpath to look for a http in the url.

giggey commented 11 years ago

I'm pretty sure they'll want to keep the email as an active URL b/c they would think it's easier for the general public to click on that to contact the person vs. trying to look them up in some directory that they may or may not know exists.

If the xpath can be modified to go around this I'm thinking it's better?

StephenOTT commented 11 years ago

@giggey its the only document with a email addresses as a active link in it. Must be a oversight/mistake?

giggey commented 11 years ago

I see. Should I ask if they can remove still?

StephenOTT commented 11 years ago

Would like to know why it is a link in the first place. If it was done on purpose or if it was a mistake.

All of the links in agendas appear to be related to right hand (voting information) boxes. But then there is this email.

Removal would be good. But if they have it there as a active link on purpose i think we can accommodate that.

giggey commented 11 years ago

never got to this. should I still ask Clerks?

StephenOTT commented 11 years ago

@giggey ya would be good to know. No rush for it. But something to keep in mind.

giggey commented 11 years ago

will do and will follow-up.