Hey, I've indexed the dataset. When I use the topics you provided for retrieval, an error occurs for the training/dev set (test set works well):
2019-12-11 21:57:43,459 INFO [main] search.SearchCollection (SearchCollection.java:212) - Reading index at /data/index/lucene-index.car17v2.0.pos+docvectors+rawdocs
2019-12-11 21:57:43,659 INFO [main] search.SearchCollection (SearchCollection.java:239) - Use Bag of Terms query
java.lang.IllegalArgumentException: URLDecoder: Illegal hex characters in escape (%) pattern - Error at index 0 in: "%2"
at java.base/java.net.URLDecoder.decode(URLDecoder.java:232)
at java.base/java.net.URLDecoder.decode(URLDecoder.java:142)
at io.anserini.search.topicreader.CarTopicReader.read(CarTopicReader.java:47)
at io.anserini.search.topicreader.TopicReader.read(TopicReader.java:58)
at io.anserini.search.SearchCollection.runTopics(SearchCollection.java:373)
at io.anserini.search.SearchCollection.main(SearchCollection.java:559)
Exception in thread "main" java.lang.IllegalArgumentException: Unable to load topic reader: Car
at io.anserini.search.SearchCollection.runTopics(SearchCollection.java:376)
at io.anserini.search.SearchCollection.main(SearchCollection.java:559)
I guess the problem is the different format between train/dev and test set since anserini fails to parae the train/ dev queries.
The top 10 lines of dev set is
Hey, I've indexed the dataset. When I use the topics you provided for retrieval, an error occurs for the training/dev set (test set works well):
I guess the problem is the different format between train/dev and test set since anserini fails to parae the train/ dev queries. The top 10 lines of dev set is
while the top 10 lines of test set is
could you help with this? thank you!