issues
search
edgi-govdata-archiving
/
web-monitoring
Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
Creative Commons Attribution Share Alike 4.0 International
105
stars
17
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
GSoC Report for Week 3 of Phase 3
#69
janakrajchadha
closed
7 years ago
3
Data analysis to look for patterns in insignificant changes
#68
janakrajchadha
closed
5 years ago
1
Important Change Identification Road Map/Plan
#67
janakrajchadha
closed
5 years ago
1
Update Trello onboarding with newer videos
#66
lightandluck
closed
6 years ago
7
GSoC Report for Week 2 of Phase 3
#65
janakrajchadha
closed
7 years ago
3
DevOps issues from slack/mr0grog
#64
lightandluck
closed
5 years ago
3
GSoC Report for Week 1 of Phase 3
#63
janakrajchadha
closed
7 years ago
8
Determine how to trigger processing/analysis when new versions are added to DB
#62
Mr0grog
closed
5 years ago
4
GSoC Report for Week 4 of Phase 2
#61
janakrajchadha
closed
7 years ago
2
GSoC Report for Week 3 of Phase 2
#60
janakrajchadha
closed
7 years ago
2
Implement caching to reduce loading time while working with diff files
#59
janakrajchadha
closed
6 years ago
10
GSoC Report for Week 2 of Phase 2
#58
janakrajchadha
closed
7 years ago
2
GSoC Report for Week 1 of Phase 2
#57
janakrajchadha
closed
7 years ago
2
GSoC Report for Week 5 of Phase 1
#56
janakrajchadha
closed
7 years ago
5
Onboarding: Add a database API review step
#55
patcon
closed
5 years ago
4
GSoC Report for Week 4 Phase 1
#54
janakrajchadha
closed
7 years ago
12
Add functionality to get cabinet ID of a specific URL
#53
janakrajchadha
closed
7 years ago
1
Create functions that ID/characterize page elements for later use in filtration
#52
suchthis
closed
7 years ago
2
Determine key features in diffs that could be used for filtration
#51
suchthis
closed
7 years ago
7
Update training set (the ~300 changes)
#50
suchthis
closed
5 years ago
1
Clean the set of 300 “significant” changes in prep for model training
#49
suchthis
closed
5 years ago
1
Create experimental methods for computing differences
#48
suchthis
closed
5 years ago
4
Add README reference to Trello onboarding process
#47
patcon
closed
7 years ago
1
Document the differences in the data format of the different sources (PageFreezer, Versionista).
#46
titaniumbones
closed
1 year ago
25
Point README to youtube video rather than dropbox.
#45
patcon
closed
7 years ago
0
Comments on Trello board: EDGI: Web Monitoring Project - Onboarding
#44
KrzysztofMadejski
closed
5 years ago
7
Essential Diffing Tools for v0
#43
danielballan
closed
7 years ago
5
Migrate analyst training video to YouTube
#42
patcon
closed
7 years ago
9
remove global sprint
#41
titaniumbones
closed
7 years ago
0
QuickStart: What I need to do to start monitoring website X
#40
KrzysztofMadejski
opened
7 years ago
4
Move Sprint-specific info to a HackPad
#39
titaniumbones
closed
7 years ago
1
Include web-monitoring-db in Mozsprint README
#38
Mr0grog
closed
7 years ago
0
Analyst training video is 15, not 50 minutes.
#37
patcon
closed
7 years ago
2
Create a service to diff PDF files
#36
Mr0grog
opened
7 years ago
28
Update README to include web-monitoring-ui for Mozsprint
#35
lightandluck
closed
7 years ago
1
Add CC License
#34
titaniumbones
closed
7 years ago
6
Add Mozilla Global Sprint Welcome message
#33
dcwalk
closed
7 years ago
2
Add a link and overview of web-monitoring-versionista-scraper
#32
Mr0grog
closed
7 years ago
0
Hidden changes and Derived changes
#31
ChaiBapchya
closed
5 years ago
2
Tagging
#30
danielballan
closed
6 years ago
1
Discussion on Content Moderation
#29
ChaiBapchya
closed
5 years ago
12
Added environmental corpus
#28
ChaiBapchya
closed
7 years ago
6
Add metadata for Versionista to schema
#27
Mr0grog
closed
7 years ago
2
Add 1.0 target label?
#26
titaniumbones
closed
7 years ago
14
Question: Does IA expose their raw WARCs?
#25
danielballan
closed
5 years ago
10
Pull versions from sentry for diffing
#24
danielballan
closed
5 years ago
3
☂ Pull Versions from IA for diffing
#23
danielballan
closed
5 years ago
15
Discussion on Architecture diagram
#22
ChaiBapchya
closed
5 years ago
10
Designing Tentative Database Schema
#21
ChaiBapchya
closed
7 years ago
4
Incorporate Cluster in the schema
#20
danielballan
opened
7 years ago
13
Previous
Next