machawk1 / wail

:whale2: Web Archiving Integration Layer: One-Click User Instigated Preservation
https://matkelly.com/wail
MIT License
345 stars 32 forks source link

Integrate OutbackCDX into WAIL for more efficient indexing #521

Open machawk1 opened 3 years ago

machawk1 commented 3 years ago

https://github.com/nla/outbackcdx

Requires Java 8/11 -- covered by #345 once merged as newest Heritrix requires newer Java than 7.

machawk1 commented 3 years ago

Per the README, since we are still using OpenWayback for replay, the config can leverage OutbackCDX through

<property name="resourceIndex">
  <bean class="org.archive.wayback.resourceindex.RemoteResourceIndex">
    <property name="searchUrlBase" value="http://localhost:8080/myindex" />
  </bean>
</property>