Open GoogleCodeExporter opened 8 years ago
Ok. I'll recompile the source and check to make sure that this issue is
resolved. Sorry for not getting to this sooner.
Gully
Original comment by GullyBu...@gmail.com
on 8 May 2013 at 5:36
Latest version is available online. I do not know what the issue is here since
we could not reproduce it and the string edu.isi.bmkeg.pdf.model.ChunkBlock
only occurred in two examples of old rule files. If you were attempting to use
those rule files to run extraction this would not have worked and might have
caused the errors you described.
Is this still an issue?
Original comment by GullyBu...@gmail.com
on 22 May 2013 at 2:29
Thanks a lot. I can confirm that this is working for me now, with version
1.7.2-SNAPSHOT. Indeed, I think I was using an old rule file.
Your help and support is very much appreciated.
Original comment by javb...@gmail.com
on 22 May 2013 at 4:58
Hello. I have such similar issue even though i am using C:\Program Files
(x86)\lapdftext\lapdftext-1.7.2-SNAPSHOT-jar-with-dependencies.jar
the java script is as below:
File pdf = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm11.pdf");
File outXmlFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm1.xml");
File ruleFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm1.drl");
//File ruleFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\general.drl");
LapdfEngine engine = new LapdfEngine();
LapdfDocument lapdf = engine.blockifyPdfFile(pdf);
engine.classifyDocument(lapdf, ruleFile);
engine.writeSectionsToOpenAccessXmlFile(lapdf, outXmlFile);
I attached the files. Indeed the pdf file is plos biology article which is
given as an example in lapdftext site. drl file is also epoch_7Jun_8.drl. So as
a result of my code I got an erro as below:
java.lang.ClassNotFoundException: Unable to find class 'ChunkBlock'
at org.drools.base.ClassTypeResolver.resolveType(ClassTypeResolver.java:215)
at org.drools.compiler.PackageBuilder.mergePackage(PackageBuilder.java:897)
at org.drools.compiler.PackageBuilder.newPackage(PackageBuilder.java:871)
at org.drools.compiler.PackageBuilder.addPackage(PackageBuilder.java:578)
at org.drools.compiler.PackageBuilder.addPackageFromDrl(PackageBuilder.java:267)
at org.drools.compiler.PackageBuilder.addKnowledgeResource(PackageBuilder.java:459)
at org.drools.builder.impl.KnowledgeBuilderImpl.add(KnowledgeBuilderImpl.java:28)
at edu.isi.bmkeg.lapdf.classification.ruleBased.RuleBasedChunkClassifier.<init>(RuleBasedChunkClassifier.java:83)
at edu.isi.bmkeg.lapdf.controller.LapdfEngine.classifyDocument(LapdfEngine.java:334)
at lapdftext.FirstTrial.<init>(FirstTrial.java:25)
at lapdftext.FirstTrial.main(FirstTrial.java:45)
ERROR: edu.isi.bmkeg.lapdf.classification.ruleBased.RuleBasedChunkClassifier -
GlobalError: chunkError importing :
'edu.isi.bmkeg.pdf.features.ChunkFeatures'Error importing :
'edu.isi.bmkeg.pdf.model.ChunkBlock'
INFO : edu.isi.bmkeg.lapdf.controller.LapdfEngine - Writing block-classified
XML in OpenAccess format C:\LayoutAwarePDFText\lapdftext-master\dnm1.xml
What might be the problem?
Thanks in advance.
Kindest regards,
Mehmet
Original comment by malee1...@gmail.com
on 5 Feb 2015 at 10:16
Attachments:
Original issue reported on code.google.com by
javb...@gmail.com
on 4 Apr 2013 at 6:03