kbss-cvut / s-pipes

Tool for execution of RDF-based pipelines.
GNU Lesser General Public License v3.0
4 stars 5 forks source link

Excel/html files should be processed directly in tabular module #228

Open blcham opened 11 months ago

blcham commented 11 months ago

In Tabular module, we transfrom HTML and Excel files into CSV and then process them using supercsv library. We need to do that directly and not transform it. In order to do that, we need to refactor the code first.

There should be a unified interface to process tabular data using (I guess) the adapter pattern. Try to do little commits to refactor it incrementally.

A/C: