Ignorato / lapdftext

Automatically exported from code.google.com/p/lapdftext
0 stars 0 forks source link

ClassNotFoundException: Unable to find class 'ChunkBlock' #6

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
Run extractFullText

What is the expected output? What do you see instead?
I get a ClassNotFoundException error looking for the class 
edu.isi.bmkeg.pdf.model.ChunkBlock.
However, the class included in the snapshot jar 
lapdftext-1.7.1-SNAPSHOT-jar-with-dependencies.jar is instead:
edu/isi/bmkeg/lapdf/model/ChunkBlock.class

What version of the product are you using? On what operating system?
1.7.1-SNAPSHOT, Mac OS X Mountain Lion

Please provide any additional information below.

Original issue reported on code.google.com by javb...@gmail.com on 4 Apr 2013 at 6:03

GoogleCodeExporter commented 8 years ago
Ok. I'll recompile the source and check to make sure that this issue is 
resolved. Sorry for not getting to this sooner. 

Gully

Original comment by GullyBu...@gmail.com on 8 May 2013 at 5:36

GoogleCodeExporter commented 8 years ago
Latest version is available online. I do not know what the issue is here since 
we could not reproduce it and the string edu.isi.bmkeg.pdf.model.ChunkBlock 
only occurred in two examples of old rule files. If you were attempting to use 
those rule files to run extraction this would not have worked and might have 
caused the errors you described. 

Is this still an issue?

Original comment by GullyBu...@gmail.com on 22 May 2013 at 2:29

GoogleCodeExporter commented 8 years ago
Thanks a lot.  I can confirm that this is working for me now, with version 
1.7.2-SNAPSHOT.  Indeed, I think I was using an old rule file.
Your help and support is very much appreciated.

Original comment by javb...@gmail.com on 22 May 2013 at 4:58

GoogleCodeExporter commented 8 years ago
Hello. I have such similar issue even though i am using C:\Program Files 
(x86)\lapdftext\lapdftext-1.7.2-SNAPSHOT-jar-with-dependencies.jar

the java script is as below:

                File pdf = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm11.pdf");
                File outXmlFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm1.xml");
                File ruleFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\dnm1.drl");
                //File ruleFile = new File("C:\\LayoutAwarePDFText\\lapdftext-master\\general.drl");

                LapdfEngine engine = new LapdfEngine();
                LapdfDocument lapdf = engine.blockifyPdfFile(pdf);

                engine.classifyDocument(lapdf, ruleFile);
                engine.writeSectionsToOpenAccessXmlFile(lapdf, outXmlFile);

I attached the files. Indeed the pdf file is plos biology article which is 
given as an example in lapdftext site. drl file is also epoch_7Jun_8.drl. So as 
a result of my  code I got an erro as below:

java.lang.ClassNotFoundException: Unable to find class 'ChunkBlock'
    at org.drools.base.ClassTypeResolver.resolveType(ClassTypeResolver.java:215)
    at org.drools.compiler.PackageBuilder.mergePackage(PackageBuilder.java:897)
    at org.drools.compiler.PackageBuilder.newPackage(PackageBuilder.java:871)
    at org.drools.compiler.PackageBuilder.addPackage(PackageBuilder.java:578)
    at org.drools.compiler.PackageBuilder.addPackageFromDrl(PackageBuilder.java:267)
    at org.drools.compiler.PackageBuilder.addKnowledgeResource(PackageBuilder.java:459)
    at org.drools.builder.impl.KnowledgeBuilderImpl.add(KnowledgeBuilderImpl.java:28)
    at edu.isi.bmkeg.lapdf.classification.ruleBased.RuleBasedChunkClassifier.<init>(RuleBasedChunkClassifier.java:83)
    at edu.isi.bmkeg.lapdf.controller.LapdfEngine.classifyDocument(LapdfEngine.java:334)
    at lapdftext.FirstTrial.<init>(FirstTrial.java:25)
    at lapdftext.FirstTrial.main(FirstTrial.java:45)
ERROR: edu.isi.bmkeg.lapdf.classification.ruleBased.RuleBasedChunkClassifier - 
GlobalError: chunkError importing : 
'edu.isi.bmkeg.pdf.features.ChunkFeatures'Error importing : 
'edu.isi.bmkeg.pdf.model.ChunkBlock'
INFO : edu.isi.bmkeg.lapdf.controller.LapdfEngine - Writing block-classified 
XML in OpenAccess format C:\LayoutAwarePDFText\lapdftext-master\dnm1.xml

What might be the problem?

Thanks in advance.

Kindest regards,
Mehmet

Original comment by malee1...@gmail.com on 5 Feb 2015 at 10:16

Attachments: