dalab / web2text

Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18
MIT License
168 stars 31 forks source link

Exception in thread "main" java.util.NoSuchElementException: None.get #11

Closed lukaspistelak closed 3 years ago

lukaspistelak commented 4 years ago

I Get this error in scala when: try to run:

https://www.americaninno.com/boston/funding-boston/vecna-robotics-brings-in-50-million-in-series-b/

ml@ml-System-Product-Name:/media/data/AuthorExtractor$ python3 src/main/python/author_extractor.py --predict_from_html LOCAL-DIR/www.americaninno.com/boston/funding-boston/vecna-robotics-brings-in-50-million-in-series-b/index.html 
2020-02-17 13:45:22.617549: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer.so.6'; dlerror: libnvinfer.so.6: cannot open shared object file: No such file or directory
2020-02-17 13:45:22.617627: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer_plugin.so.6'; dlerror: libnvinfer_plugin.so.6: cannot open shared object file: No such file or directory
2020-02-17 13:45:22.617640: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:30] Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.
[info] Loading project definition from /media/data/AuthorExtractor/project
[info] Loading settings for project authorextractor from build.sbt ...
[info] Set current project to AuthorExtractor (in build file:/media/data/AuthorExtractor/)
[warn] Multiple main classes detected.  Run 'show discoveredMainClasses' to see the list
[info] running (fork) ch.ethz.dalab.web2text.ExtractPageFeatures LOCAL-DIR/www.americaninno.com/boston/funding-boston/vecna-robotics-brings-in-50-million-in-series-b/index.html /media/data/AuthorExtractor/public/inference/test
[error] Exception in thread "main" java.util.NoSuchElementException: None.get
[error]     at scala.None$.get(Option.scala:347)
[error]     at scala.None$.get(Option.scala:345)
[error]     at ch.ethz.dalab.web2text.cdom.CDOM$.fromBody(CDOM.scala:142)
[error]     at ch.ethz.dalab.web2text.cdom.CDOM$.fromHTML(CDOM.scala:57)
[error]     at ch.ethz.dalab.web2text.cdom.CDOM$.apply(CDOM.scala:60)
[error] uthorextat ch.ethz.dalab.web2text.ExtractPageFeatures$.extractPageFeatures(ExtractPageFeatures.scala:55)
[error]     at ch.ethz.dalab.web2text.ExtractPageFeatures$.main(ExtractPageFeatures.scala:41)
[error]     at ch.ethz.dalab.web2text.ExtractPageFeatures.main(ExtractPageFeatures.scala)
[error] Nonzero exit code returned from runner: 1
[error] (Compile / runMain) Nonzero exit code returned from runner: 1
[error] Total time: 2 s, completed Feb 17, 2020 1:45:29 PM
tvogels commented 3 years ago

Looks like this pull request was a call for help rather than something intended to merge. Sorry for not responding in February. Hope you figured it out. I'll close this now.