mam10eks / search-homepage-of-university-leipzig

A seachengine to retrieve informations from the homepage of the university leipzig. Created during the information retrieval internship in winter semester 2017.
1 stars 1 forks source link

Index all currently available segments #4

Closed mam10eks closed 6 years ago

mam10eks commented 6 years ago

After the description of sebastian you should create an index out of all segments that we currently have so that the search engine works on real data

mam10eks commented 6 years ago

I will do this until next monday (20.11.2017)

mam10eks commented 6 years ago

The index could be created by leveraging the script 'index_data.sh' within the project https://github.com/mam10eks/nutch_tools/ in the directory 'index_lucene'.

But unfortunately I could not upload files bigger than 100MB here so I could not share the directly. This will be solved in #12