issues
search
CivicActions
/
edscrapers
US Department of Education Data Scraping Kit; see https://us-ed-scraping.ckan.io/dataset
GNU Affero General Public License v3.0
15
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Doc improvements
#127
osahon-okungbowa
closed
4 years ago
0
Dashboard fixes & code portability
#126
osahon-okungbowa
closed
4 years ago
1
Datajson sanitisation
#125
osahon-okungbowa
closed
4 years ago
0
Parser fixes 2
#124
osahon-okungbowa
closed
4 years ago
0
Improvement & Fixes based on QA feedback
#123
osahon-okungbowa
closed
4 years ago
1
[parser-fixes][m]: fixed & improved FSA parser based on QA feedback
#122
osahon-okungbowa
closed
4 years ago
0
[oese-parser-improvements][m]: improved parser metadata & content quality
#121
osahon-okungbowa
closed
4 years ago
0
[opepd-parser-improvements][m]: improved metadata & content quality
#120
osahon-okungbowa
closed
4 years ago
0
[osers-parser-fix][fix]: handle cases where non-text content are passed to parser
#119
osahon-okungbowa
closed
4 years ago
0
Add the sanitized dataset properties to the resulting data.json file
#118
nightsh
closed
4 years ago
0
Migrate statistics and dashboard to the new infra
#117
nightsh
opened
4 years ago
0
[stub] Scrape and harvest dataset documentation
#116
nightsh
opened
4 years ago
0
Scrape and harvest collections / sources - PROTOTYPE [P1(OCR)]
#115
nightsh
closed
4 years ago
0
data.json Schema changes
#114
nightsh
opened
4 years ago
0
Rag transform 2: improved RAG computation
#113
osahon-okungbowa
closed
4 years ago
0
Improve stats through better RAG computation
#112
osahon-okungbowa
closed
4 years ago
1
Filter ZIP files by contents
#111
nightsh
opened
4 years ago
0
Sanitize transform: apply data sanitizing steps
#110
osahon-okungbowa
closed
4 years ago
1
Sanitising Datasets
#109
osahon-okungbowa
closed
4 years ago
1
Deduplicate transform improvements
#108
osahon-okungbowa
closed
4 years ago
1
Improve Deduplication of Datasets
#107
osahon-okungbowa
closed
4 years ago
1
[nces-parser-improvements][improvement]: improved regex on dataset title
#106
osahon-okungbowa
closed
4 years ago
0
osers-parser-phase3: improved parser content & metadata quality
#105
osahon-okungbowa
closed
4 years ago
0
[Parsing] Extract level of data from resource name
#104
nightsh
closed
4 years ago
2
Fix regex middleware to allow empty `allowed_domains` spider property
#103
nightsh
closed
4 years ago
0
Edgov parser phase3
#102
osahon-okungbowa
closed
4 years ago
1
Phase 3 - Improve P8 (ed.gov) Parser metadata quality
#101
osahon-okungbowa
closed
4 years ago
0
Nces parser phase3
#100
osahon-okungbowa
closed
4 years ago
0
Change data quality assessment to measure datajson output
#99
nightsh
closed
4 years ago
0
Define architecture & infrastructure
#98
nightsh
closed
4 years ago
0
Phase 3 - Improve metadata quality P9 (NCES) parser
#97
osahon-okungbowa
closed
4 years ago
0
[oela-parser-2][s]: improve metadata gathering quality of P4 parser
#96
osahon-okungbowa
closed
4 years ago
0
Automate the scraping pipelines
#95
nightsh
closed
4 years ago
0
Fix OSERS parser not importing base helpers
#94
nightsh
closed
4 years ago
0
Change import statements for all parsers
#93
nightsh
closed
4 years ago
0
Ope parser 2
#92
osahon-okungbowa
closed
4 years ago
0
[octae-parser-2][s]: improved parser metadata
#91
osahon-okungbowa
closed
4 years ago
0
Scraper for FSA
#90
nightsh
closed
4 years ago
0
Ocr parser 2c
#89
osahon-okungbowa
closed
4 years ago
0
Uniform I/O between transformers, tools and dashboard
#88
nightsh
opened
4 years ago
0
Collect headers for resources
#87
nightsh
closed
4 years ago
0
Add FSA scraper
#86
nightsh
closed
4 years ago
1
RAG summary
#85
nightsh
closed
4 years ago
0
Improved dashboard
#84
nightsh
closed
4 years ago
0
nces-parser-04: final parser for nces variant page structure
#83
osahon-okungbowa
closed
4 years ago
0
Update Tech Spec to reflect metrics and dashboard changes
#82
osahon-okungbowa
closed
4 years ago
1
dash-insights-03: created the dash 'insights' page
#81
osahon-okungbowa
closed
4 years ago
1
v1- Report / Visualise Gathered Metrics
#80
osahon-okungbowa
closed
4 years ago
1
Dash insights: created dashboard 'insights' page
#79
osahon-okungbowa
closed
4 years ago
2
nces parser 03
#78
osahon-okungbowa
closed
4 years ago
0
Previous
Next