apache / incubator-stormcrawler

A scalable, mature and versatile web crawler based on Apache Storm
https://stormcrawler.apache.org/
Apache License 2.0
883 stars 260 forks source link

Update version of StormCrawler in READMEs and archetypes #1317

Open jnioche opened 1 week ago

jnioche commented 1 week ago

The website also needs fixing

rzo1 commented 1 week ago

@kunalpal97 would you like to check the website as well for similar occurences?

kunalpal97 commented 1 week ago

@rzo1 I’d be happy to help! Could you clarify which sections or pages of the website you'd like me to review, or should I go through the entire site?

rzo1 commented 1 week ago

The Link by @jnioche contains references to the archetype and the Storm version. There might be other occurences as well, maybe a search in an IDE will reveal other locations as well.

jnioche commented 1 week ago

For the website here are my notes for the pre-Apache releases, the paths will need changing

git checkout gh-pages

point 
[_includes/header.html](https://github.com/DigitalPebble/storm-crawler/edit/gh-pages/_includes/header.html) to latest release

Modify “getting started” as well

[getting-started/index.html](https://github.com/DigitalPebble/storm-crawler/edit/gh-pages/getting-started/index.html)

For the READMEs in the repo

 README.md
 ./external/warc/README.md  
 ./external/opensearch/README.md

I used to grep -r on the previous version number to find occurrences.

Thanks @kunalpal97 for offering to contribute