ecologylab / BigSemanticsWrapperRepository

Repository of wrappers used by the BigSemantics project.
Apache License 2.0
3 stars 9 forks source link

automatically detect web page structure changes #5

Closed quyin closed 9 years ago

quyin commented 11 years ago

1) overall process: run extraction on example_urls, and compare with precached extraction results.

2) builds upon a structure that caches and updates html pages and extraction results. this structure can also be used for unit testing extraction.

quyin commented 9 years ago

This should be merged into https://github.com/ecologylab/BigSemanticsJava/issues/11