CivicActions / edscrapers

US Department of Education Data Scraping Kit; see https://us-ed-scraping.ckan.io/dataset
GNU Affero General Public License v3.0
15 stars 9 forks source link

(New Office) Institute of Education Sciences: create scraper for this office (Phase 1) #158

Closed higorspinto closed 4 years ago

higorspinto commented 4 years ago

Description

Institute of Education Sciences (IES) is among the list of new offices whose datasets need to be ingested into the data portal. During our prior scraping run we had the NCES scraper, NCES is a suboffice/sub-organisation of IES. Therefore, we need to update/improve the current NCES scraper to cater for IES and all its suboffices (including NCES).

https://ies.ed.gov/

Acceptance Criteria

Tasks

Jira Card

georgiana-b commented 4 years ago

Solved in https://github.com/CivicActions/edscrapers/pull/173.

We ended up adapting the existing NCES parser to also handle the results on https://ies.ed.gov/pubsearch/index.asp since the page structure was very similar to that of https://nces.ed.gov/pubsearch/index.asp.