allenai / pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.
http://pdffigures2.allenai.org/
Apache License 2.0
600 stars 117 forks source link

sbt assembly #56

Open ThymeSLH opened 1 year ago

ThymeSLH commented 1 year ago

When i run sbt assembly command, i got such error: [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:3:20: object common is not a member of package org.allenai [error] import org.allenai.common.testkit.UnitSpec [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:9:37: not found: type UnitSpec [error] class TestExtractionFilters extends UnitSpec { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:28:26: value should is not a member of String [error] "Page boundary filter" should "filter out bad extractions" in { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:43:27: value should is not a member of String [error] "Graphics split filter" should "filter out bad extractions" in { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:56:13: value should is not a member of String [error] "Figures" should "all be extracted" in { [error] ^ [error] 5 errors found [error] (Test / compileIncremental) Compilation failed [error] Total time: 0 s, completed Sep 26, 2023, 12:33:34 PM sbt:pdffigures2> 2023-09-26 23:57:13,624 shutdown-hooks-run-all ERROR No Log4j 2 configuration file found. Using default configuration (logging only errors to the console), or user programmatically provided configurations. Set system property 'log4j2.debug' to show Log4j 2 internal initialization logging. See https://logging.apache.org/log4j/2.x/manual/configuration.html for instructions on how to configure Log4j 2 <Trial 17763026 worker_0> pdffigures2 $ sbt [info] Loading settings for project pdffigures2-build from plugins.sbt ... [info] Loading project definition from /opt/tiger/user/songlinghan/pdffigures2/project [info] Loading settings for project root from build.sbt ... [info] Set current project to pdffigures2 (in build file:/opt/tiger/user/songlinghan/pdffigures2/) [info] sbt server started at local:///home/tiger/.sbt/1.0/server/07fa3a092e10c1aa3ac2/sock sbt:pdffigures2> assembly [warn] Multiple main classes detected. Run 'show discoveredMainClasses' to see the list [info] Compiling 1 Scala source to /opt/tiger/user/songlinghan/pdffigures2/target/scala-2.12/test-classes ... [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:3:20: object common is not a member of package org.allenai [error] import org.allenai.common.testkit.UnitSpec [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:9:37: not found: type UnitSpec [error] class TestExtractionFilters extends UnitSpec { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:28:26: value should is not a member of String [error] "Page boundary filter" should "filter out bad extractions" in { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:35:20: value === is not a member of Iterable[org.allenai.pdffigures2.Figure] [error] assert(figures === Nil) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:43:27: value should is not a member of String [error] "Graphics split filter" should "filter out bad extractions" in { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:50:20: value === is not a member of Iterable[org.allenai.pdffigures2.Figure] [error] assert(figures === Nil) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:56:13: value should is not a member of String [error] "Figures" should "all be extracted" in { [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:63:27: value === is not a member of Int [error] assert(figures.length === 2) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:64:31: value === is not a member of org.allenai.pdffigures2.FigureType.FigureType [error] assert(figures(0).figType === FigureType.Table) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:65:28: value === is not a member of String [error] assert(figures(0).name === "1") [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:66:28: value === is not a member of Int [error] assert(figures(0).page === 4) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:68:26: value === is not a member of String [error] figures(0).caption === "Table 1: Over a set of ten relations, TEXTRUNNER achieved a 33% lower error rate than KNOWITALL, while finding approximately as many correct extractions." [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:70:31: value === is not a member of org.allenai.pdffigures2.FigureType.FigureType [error] assert(figures(1).figType === FigureType.Figure) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:71:28: value === is not a member of String [error] assert(figures(1).name === "1") [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:72:28: value === is not a member of Int [error] assert(figures(1).page === 4) [error] ^ [error] /opt/tiger/user/songlinghan/pdffigures2/src/test/scala/org/allenai/pdffigures2/TestExtractionFilters.scala:74:26: value === is not a member of String [error] figures(1).caption === "Figure 1: Overview of the tuples extracted from 9 million Web page corpus. 7.8 million well-formed tuples are found having probability ≥ 0.8. Of those, TEXTRUNNER finds 1 million concrete tuples with arguments grounded in particular real-world entities, 88.1% of which are correct, and 6.8 million tuples reflecting abstract assertions, 79.2% of which are correct." [error] ^ [error] 16 errors found [error] (Test / compileIncremental) Compilation failed

Is it because the sbt version ? image

GabrieleGioetto commented 1 year ago

I had the same errors, however using this pull request solved the issues for me

OrianeN commented 12 months ago

I had similar errors as well when running sbt assembly (I don't think exactly the same, though, but also something that looks like syntax errors), and the PR mentioned by @GabrieleGioetto worked perfectly (git clone the fork, then run sbt assembly).

Thank you @GabrieleGioetto for linking the PR, and I hope it gets merged soon !