KevM / tikaondotnet

Use the Java Tika text extraction library on the .NET platform
http://kevm.github.io/tikaondotnet/
Apache License 2.0
195 stars 73 forks source link

Building TextExtraction #121

Open cvalde opened 6 years ago

cvalde commented 6 years ago

Hello, I´m trying to build TikaOnDotnet.TextExtraction myself to be able to debug it and learn a bit. However, I get 'C:\projects\tikaondotnet-master\src\TikaOnDotnet.TextExtractor....\SolutionInfo.cs' not found.

What is this file? Is it something created before the build runs? Can I build from the GUI? I don't understand if build.cmd should be used instead of the VS GUI. I'm using VS 2015. I downloaded the zip and tried to generate the project. Thanks.

KevM commented 6 years ago

Thanks for checking us out. Take a look at the Developer guide. It should help you get going. https://github.com/KevM/tikaondotnet/blob/master/Developers.md

cvalde commented 6 years ago

Yes, I read the developers guide but finally took a simpler route. Ignored the missing SoliutionInfo.cs and built anyway from the VS GUI. I'm not creating a package but only a basic application to be able to debug TextExtractor. I made some modifications to observe the effect in the output. Apparently it's not possible to suppress the output of the "meta" lines like this one

but I made the example output xml instead of plain text. For me it's a good start, thanks.

KevM commented 6 years ago

Hmm the guide should have helped you through the SolutionInfo.cs problem. ☹️