carpentries-incubator / lc-webscraping

Introduction to web scraping
https://carpentries-incubator.github.io/lc-webscraping/
Other
37 stars 28 forks source link

Deal with content from Additions heading in episode 02 #57

Open ndporter opened 1 year ago

ndporter commented 1 year ago

Accompanies PR #56 . This section includes multiple FIXME headings that lead me to assume it wasn't meant to be in the public version of the lesson.

Maintainers should determine whether to add corrected/complete information on some/all of these topics or remove them entirely. Merging the PR would remove it from the published lesson. At that point this issue can be used to discuss longer-term solutions.

Additions

FIXME: add more XPath functions such as concat() and normalize-space(). FIXME: mention XPath Checker for Firefox FIXME: Firefox sometime cleans up the HTML of a page before displaying it, meaning that the DOM tree we can access through the console might not reflect the actual source code. <tbody> elements are typically not reliable. The Scrapy documentation has more on the topic.