patrickfrey / strusAnalyzer

Library for document analysis (segmentation, tokenization, normalization, aggregation) with the goal to get a set of items that can be inserted into a strus storage. Also some functions for analysing tokens or phrases of the strus query are provided.
http://www.project-strus.net
Mozilla Public License 2.0
3 stars 0 forks source link

How to index the filename? #54

Open andreasbaumann opened 7 years ago

andreasbaumann commented 7 years ago

This would be cool because I can then set something like:

docid: bigxmlfile.xml/17

with bigxmlfile.xml containing:

<DOCS>
  <DOC>
    <DOCID>17</DOCID>
...