kermitt2 / grobid

A machine learning software for extracting information from scholarly documents
https://grobid.readthedocs.io
Apache License 2.0
3.43k stars 443 forks source link

Could the java documentation and the process of embedding grobid into Java project be updated? #577

Closed lucaspada894 closed 4 years ago

lucaspada894 commented 4 years ago

Following the instructions on the grobid site, I cannot embed grobid into my JAVA project due to poor instructions regarding Gradle and Maven. Also, I do not know how to use the APIs because the Java documentation has different parameters for the methods. Specifically, fullTextToTei take in different arguments than what is shown in the Java docs.

lucaspada894 commented 4 years ago

Honestly, the entire process of using the API is outdated on the site.

kermitt2 commented 4 years ago

Hello @lucaspada894 !

We've just made a new release, so all the documentation and the grobid modules/demos in the other repo have not been tested and updated yet, it will take a few days/weeks... sorry (this is a side project for us, everything is slow slow pace here :).

In between, you can look at the class: grobid/grobid-core/src/main/java/org/grobid/core/engines/Engine.java which is basically the Java API to use.

lucaspada894 commented 4 years ago

For some reason the function fullTextToTei hangs indefinitely for me. I am currently using your API to create a pipeline for coronavirus PDFs to a structure that my University needs.

On Fri, May 1, 2020, 2:28 AM Patrice Lopez notifications@github.com wrote:

Hello @lucaspada894 https://github.com/lucaspada894 !

We've just made a new release, so all the documentation and the grobid modules/demos in the other repo have not been tested and updated yet, it will take a few days/weeks... sorry (this is a side project for us, everything is slow slow pace here :).

In between, you can look at the class: grobid/grobid-core/src/main/java/org/grobid/core/engines/Engine.java which is basically the Java API to use.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622283087, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKU5OLWFM232X2KW4W3RPJ22FANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

@lucaspada894 could you share the snippet / code you're using?

lucaspada894 commented 4 years ago

I call the function from the Engine instance created by GrobidFactory.getInstance().create engine()

On Fri, May 1, 2020, 2:28 AM Patrice Lopez notifications@github.com wrote:

Hello @lucaspada894 https://github.com/lucaspada894 !

We've just made a new release, so all the documentation and the grobid modules/demos in the other repo have not been tested and updated yet, it will take a few days/weeks... sorry (this is a side project for us, everything is slow slow pace here :).

In between, you can look at the class: grobid/grobid-core/src/main/java/org/grobid/core/engines/Engine.java which is basically the Java API to use.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622283087, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKU5OLWFM232X2KW4W3RPJ22FANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

I will send it to you later today. I haven't slept and am exhausted.

On Fri, May 1, 2020, 7:11 AM Luca Foppiano notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 could you share the snippet / code you're using?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622363263, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKSYZXOSFSSRCTRX2TLRPK37XANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

What is your timeline?

On Fri, May 1, 2020, 7:11 AM Luca Foppiano notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 could you share the snippet / code you're using?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622363263, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKSYZXOSFSSRCTRX2TLRPK37XANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

No worries, anytime is good. I'll try to answer quckly but allow some time ;-)

lfoppiano commented 4 years ago

Meanwhile, have you looked at the grobid-example sample project? https://github.com/kermitt2/grobid-example

kermitt2 commented 4 years ago

@lucaspada894 maybe try this https://github.com/kermitt2/article-dataset-builder you will get very high rate of OA PDF processed by Grobid without pain - for instance, for the same article list, you will get more full structured full texts than the official CORD-19 dataset, and significantly better/richer structuring with the latest version of Grobid (CORD-19 is also relying on Grobid conversion, but not the latest version of Grobid from what I have seen).

From my experience, when it comes to pipeline for scientific articles, using web services is much more convenient.

lucaspada894 commented 4 years ago

try { String pGrobidHome = "lib/grobid/grobid-home"; String pGrobidProperties = "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\config\grobid.properties";

// The GrobidHomeFinder can be instantiate without parameters to

verify the grobid home in the standard // location (classpath, ../grobid-home, ../../grobid-home) // If the location is customised: GrobidHomeFinder grobidHomeFinder = new GrobidHomeFinder(Arrays.asList(pGrobidHome)); GrobidProperties.getInstance(grobidHomeFinder);

//The GrobidProperties needs to be instantiate using the correct

grobidHomeFinder or it will use the default //locations

System.out.println(">>>>>>>>

GROBID_HOME="+GrobidProperties.get_GROBID_HOME_PATH());

Engine engine = GrobidFactory.getInstance().createEngine();

// Biblio object for the result
BiblioItem resHeader = new BiblioItem();
String tei =

engine.processHeader("lib/PDF_FILES/2020.03.23.004580v1.full.pdf", 1, resHeader);

}

On Fri, May 1, 2020 at 9:31 AM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 maybe try this https://github.com/kermitt2/article-dataset-builder you will get very high rate of OA PDF processed by Grobid without pain - for instance, for the same article list, you will get more full structured full texts than the official CORD-19 dataset, and significantly better/richer structuring with the latest version of Grobid (CORD-19 is also based on Grobid conversion, but not the latest version from what I have seen).

From my experience, when it comes to pipeline for scientific articles, using web services is much more convenient.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622411237, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKTR5HGAZI44VZUD5SLRPLMKNANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

This is an attempt at testing it out with the same code provided in the example project.

On Fri, May 1, 2020 at 6:21 PM Gian Spadafora gians@iastate.edu wrote:

try { String pGrobidHome = "lib/grobid/grobid-home"; String pGrobidProperties = "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\config\grobid.properties";

// The GrobidHomeFinder can be instantiate without parameters to verify the grobid home in the standard
// location (classpath, ../grobid-home, ../../grobid-home)
// If the location is customised:
GrobidHomeFinder grobidHomeFinder = new GrobidHomeFinder(Arrays.asList(pGrobidHome));
GrobidProperties.getInstance(grobidHomeFinder);

//The GrobidProperties needs to be instantiate using the correct grobidHomeFinder or it will use the default
//locations

System.out.println(">>>>>>>> GROBID_HOME="+GrobidProperties.get_GROBID_HOME_PATH());

Engine engine = GrobidFactory.getInstance().createEngine();

// Biblio object for the result
BiblioItem resHeader = new BiblioItem();
String tei = engine.processHeader("lib/PDF_FILES/2020.03.23.004580v1.full.pdf", 1, resHeader);

}

On Fri, May 1, 2020 at 9:31 AM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 maybe try this https://github.com/kermitt2/article-dataset-builder you will get very high rate of OA PDF processed by Grobid without pain - for instance, for the same article list, you will get more full structured full texts than the official CORD-19 dataset, and significantly better/richer structuring with the latest version of Grobid (CORD-19 is also based on Grobid conversion, but not the latest version from what I have seen).

From my experience, when it comes to pipeline for scientific articles, using web services is much more convenient.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622411237, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKTR5HGAZI44VZUD5SLRPLMKNANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

I also do not know how to use the GrobiAnalysisConfig file and set it up for the Engine.

On Fri, May 1, 2020 at 6:22 PM Gian Spadafora gians@iastate.edu wrote:

This is an attempt at testing it out with the same code provided in the example project.

On Fri, May 1, 2020 at 6:21 PM Gian Spadafora gians@iastate.edu wrote:

try { String pGrobidHome = "lib/grobid/grobid-home"; String pGrobidProperties = "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\config\grobid.properties";

// The GrobidHomeFinder can be instantiate without parameters to verify the grobid home in the standard
// location (classpath, ../grobid-home, ../../grobid-home)
// If the location is customised:
GrobidHomeFinder grobidHomeFinder = new GrobidHomeFinder(Arrays.asList(pGrobidHome));
GrobidProperties.getInstance(grobidHomeFinder);

//The GrobidProperties needs to be instantiate using the correct grobidHomeFinder or it will use the default
//locations

System.out.println(">>>>>>>> GROBID_HOME="+GrobidProperties.get_GROBID_HOME_PATH());

Engine engine = GrobidFactory.getInstance().createEngine();

// Biblio object for the result
BiblioItem resHeader = new BiblioItem();
String tei = engine.processHeader("lib/PDF_FILES/2020.03.23.004580v1.full.pdf", 1, resHeader);

}

On Fri, May 1, 2020 at 9:31 AM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 maybe try this https://github.com/kermitt2/article-dataset-builder you will get very high rate of OA PDF processed by Grobid without pain - for instance, for the same article list, you will get more full structured full texts than the official CORD-19 dataset, and significantly better/richer structuring with the latest version of Grobid (CORD-19 is also based on Grobid conversion, but not the latest version from what I have seen).

From my experience, when it comes to pipeline for scientific articles, using web services is much more convenient.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622411237, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKTR5HGAZI44VZUD5SLRPLMKNANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

Sorry to be bothering you again, guys, but I need a solution urgently. I need grobid to be packaged into my project. This means I cannot simply open a terminal and start a Grobid service and then run my application afterwards. I need it to be done seamlessly. Without understanding the APIs I cannot do this.

On Fri, May 1, 2020 at 8:39 PM Gian Spadafora gians@iastate.edu wrote:

I also do not know how to use the GrobiAnalysisConfig file and set it up for the Engine.

On Fri, May 1, 2020 at 6:22 PM Gian Spadafora gians@iastate.edu wrote:

This is an attempt at testing it out with the same code provided in the example project.

On Fri, May 1, 2020 at 6:21 PM Gian Spadafora gians@iastate.edu wrote:

try { String pGrobidHome = "lib/grobid/grobid-home"; String pGrobidProperties = "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\config\grobid.properties";

// The GrobidHomeFinder can be instantiate without parameters to verify the grobid home in the standard
// location (classpath, ../grobid-home, ../../grobid-home)
// If the location is customised:
GrobidHomeFinder grobidHomeFinder = new GrobidHomeFinder(Arrays.asList(pGrobidHome));
GrobidProperties.getInstance(grobidHomeFinder);

//The GrobidProperties needs to be instantiate using the correct grobidHomeFinder or it will use the default
//locations

System.out.println(">>>>>>>> GROBID_HOME="+GrobidProperties.get_GROBID_HOME_PATH());

Engine engine = GrobidFactory.getInstance().createEngine();

// Biblio object for the result
BiblioItem resHeader = new BiblioItem();
String tei = engine.processHeader("lib/PDF_FILES/2020.03.23.004580v1.full.pdf", 1, resHeader);

}

On Fri, May 1, 2020 at 9:31 AM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 maybe try this https://github.com/kermitt2/article-dataset-builder you will get very high rate of OA PDF processed by Grobid without pain

  • for instance, for the same article list, you will get more full structured full texts than the official CORD-19 dataset, and significantly better/richer structuring with the latest version of Grobid (CORD-19 is also based on Grobid conversion, but not the latest version from what I have seen).

From my experience, when it comes to pipeline for scientific articles, using web services is much more convenient.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-622411237, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKTR5HGAZI44VZUD5SLRPLMKNANCNFSM4MWZUKEA .

kermitt2 commented 4 years ago

The simplest and most efficient way to integrate grobid in an application is using the service, it provides multithreading, robustness, good documentation, docker, etc. There's a java client here.

Using the Java API would be justified only I think if you need some low level data structures and functionalities, but for processing usual scientific articles this should not be necessary. You would then need to understand the API, implement your own parallelization, and so on, this is a very big effort for something already existing elsewhere.

Having said that, if you stay on the Java API integration, everything is now updated, including the javadoc. Can you run the grobid-example sample project? If no, could you provide some error trace, info about your environment, jdk version?

lfoppiano commented 4 years ago

@lucaspada894 did you solve your problem or you still need help?

lucaspada894 commented 4 years ago

Hello,

I went on and used it as a service. However, as of yesterday, I ran into a new issue. The Grobid service crashes sometimes when it deals with some table formats or corrupted PDFs. I am trying to process 44,000 files and would like to know if there is a way for Grobid to simply skip a file that results in error. I'm at the very last step of the process and this came up as a big problem.

Any help would be greatly appreciated,

Gian Spadafora

On Thu, May 21, 2020, 7:44 AM Luca Foppiano notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 did you solve your problem or you still need help?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632065149, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKXMKUWGVKSAGYMKYVLRSUOZJANCNFSM4MWZUKEA .

kermitt2 commented 4 years ago

@lucaspada894 In principle the server does not crash for any of these kind of cases (it ran over 12M PDF without any crashes and 44K is not a lot at all) - but it could come from a recent update or over-loading the server without waiting when receiving a 503 response. Are you using Windows? how much memory and how many concurrent queries? Which version of GROBID are you using? Can you provide the server logs for the crashes? Optionally if you have a sharable problematic PDF for that, it can help.

lucaspada894 commented 4 years ago

Yes, I will send you the logs and PDFs in a few minutes.

On Thu, May 21, 2020, 2:36 PM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 In principle the server does not crash for any of these kind of cases (it ran over 12M PDF without any crashes and 44K is not a lot at all) - but it could come from a recent update or over-loading the server without waiting when receiving a 503 response. Are you using Windows? how much memory and how many concurrent queries? Which version of GROBID are you using? Can you provide the server logs for the crashes? Optionally if you have a sharable problematic PDF for that, it can help.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632302689, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKWTKRLWIVCDQALLUQTRSV7DLANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

Sorry for the delay,

I have included one of the error logs. The PDFs I included resulted in the same error logs. I am calling the grobid service using the Java client you provide to make best use of concurrency and as recommended on the grobid website.

On Thu, May 21, 2020 at 2:39 PM Gian Spadafora gians@iastate.edu wrote:

Yes, I will send you the logs and PDFs in a few minutes.

On Thu, May 21, 2020, 2:36 PM Patrice Lopez notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 In principle the server does not crash for any of these kind of cases (it ran over 12M PDF without any crashes and 44K is not a lot at all) - but it could come from a recent update or over-loading the server without waiting when receiving a 503 response. Are you using Windows? how much memory and how many concurrent queries? Which version of GROBID are you using? Can you provide the server logs for the crashes? Optionally if you have a sharable problematic PDF for that, it can help.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632302689, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKWTKRLWIVCDQALLUQTRSV7DLANCNFSM4MWZUKEA .


INFO  [2020-05-21 11:00:18,400] org.eclipse.jetty.setuid.SetUIDListener: Opened application@28e0e464{HTTP/1.1,[http/1.1]}{0.0.0.0:8070}
INFO  [2020-05-21 11:00:18,400] org.eclipse.jetty.setuid.SetUIDListener: Opened admin@1d98daa0{HTTP/1.1,[http/1.1]}{0.0.0.0:8071}
INFO  [2020-05-21 11:00:18,402] org.eclipse.jetty.server.Server: jetty-9.4.18.v20190429; built: 2019-04-29T20:42:08.989Z; git: e1bc35120a6617ee3df052294e433f3a25ce7097; jvm 1.8.0_202-release-1483-b03
WARN  [2020-05-21 11:00:18,576] org.glassfish.jersey.internal.inject.Providers: A provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility registered in SERVER runtime does not implement any provider interfaces applicable in the SERVER runtime. Due to constraint configuration problems the provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility will be ignored. 
INFO  [2020-05-21 11:00:18,886] io.dropwizard.jersey.DropwizardResourceConfig: The following paths were found for the configured resources:
POST    /api/annotatePDF (org.grobid.service.GrobidRestService)
POST    /api/citationPatentAnnotations (org.grobid.service.GrobidRestService)
GET     /api/grobid (org.grobid.service.GrobidRestService)
GET     /api/health (org.grobid.service.resources.HealthResource)
GET     /api/isalive (org.grobid.service.GrobidRestService)
POST    /api/processAffiliations (org.grobid.service.GrobidRestService)
PUT     /api/processAffiliations (org.grobid.service.GrobidRestService)
POST    /api/processCitation (org.grobid.service.GrobidRestService)
PUT     /api/processCitation (org.grobid.service.GrobidRestService)
POST    /api/processCitationNames (org.grobid.service.GrobidRestService)
PUT     /api/processCitationNames (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentPDF (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentST36 (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentTXT (org.grobid.service.GrobidRestService)
POST    /api/processDate (org.grobid.service.GrobidRestService)
PUT     /api/processDate (org.grobid.service.GrobidRestService)
POST    /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
POST    /api/processFulltextDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderDocument (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderNames (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderNames (org.grobid.service.GrobidRestService)
POST    /api/processReferences (org.grobid.service.GrobidRestService)
PUT     /api/processReferences (org.grobid.service.GrobidRestService)
POST    /api/referenceAnnotations (org.grobid.service.GrobidRestService)
GET     /api/version (org.grobid.service.GrobidRestService)

INFO [2020-05-21 11:00:18,887] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@42d9e8d2{/,null,AVAILABLE} INFO [2020-05-21 11:00:18,890] io.dropwizard.setup.AdminEnvironment: tasks =

POST    /tasks/log-level (io.dropwizard.servlets.tasks.LogConfigurationTask)
POST    /tasks/gc (io.dropwizard.servlets.tasks.GarbageCollectionTask)

INFO [2020-05-21 11:00:18,894] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@4d69d288{/,null,AVAILABLE} INFO [2020-05-21 11:00:18,907] org.eclipse.jetty.server.AbstractConnector: Started application@28e0e464{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-21 11:00:18,910] org.eclipse.jetty.server.AbstractConnector: Started admin@1d98daa0{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-21 11:00:18,910] org.eclipse.jetty.server.Server: Started @2290ms 127.0.0.1 - - [21/May/2020:11:00:19 +0000] "GET /api/isalive HTTP/1.1" 200 4 "-" "Java/13.0.1" 70 21 May 2020 06:00.19 [INFO ] GrobidService - Cheking Grobid service... 21 May 2020 06:00.19 [INFO ] GrobidService - Grobid service is ok and can be used. 127.0.0.1 - - [21/May/2020:11:00:19 +0000] "GET /api/isalive HTTP/1.1" 200 4 "-" "Java/1.8.0_241" 1 27993 PDF files 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10023767.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10028170.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10024427.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10063534.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10037086.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10073413.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10073719.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10074948.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10074934.pdf 21 May 2020 06:00.19 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10076517.pdf wait for thread completion INFO [2020-05-21 11:00:19,765] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\affiliation-address\model.wapiti (size: 2699936) INFO [2020-05-21 11:00:19,981] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\header\model.wapiti (size: 2225578) INFO [2020-05-21 11:00:20,163] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\citation\model.wapiti (size: 440148) INFO [2020-05-21 11:00:20,200] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\header\model.wapiti (size: 36094028) INFO [2020-05-21 11:00:23,991] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\date\model.wapiti (size: 102435) INFO [2020-05-21 11:00:24,000] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\citation\model.wapiti (size: 16412787) INFO [2020-05-21 11:00:25,671] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\fulltext\model.wapiti (size: 22836546) INFO [2020-05-21 11:00:27,864] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\segmentation\model.wapiti (size: 17807323) INFO [2020-05-21 11:00:29,677] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\reference-segmenter\model.wapiti (size: 4921245) INFO [2020-05-21 11:00:30,104] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\figure\model.wapiti (size: 422671) INFO [2020-05-21 11:00:30,136] org.grobid.core.jni.WapitiModel: Loading model: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\table\model.wapiti (size: 1202011) INFO [2020-05-21 11:00:30,234] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10 INFO [2020-05-21 11:00:30,234] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 2/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 3/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 4/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 5/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 6/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 7/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 8/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 9/10 INFO [2020-05-21 11:00:30,235] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 WARN [2020-05-21 11:00:31,432] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,433] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,435] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,436] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,438] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,440] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles WARN [2020-05-21 11:00:31,441] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\language-detection\cybozu\profiles 21 May 2020 06:01.04 [INFO ] GrobidWorker - ..\PDF_FILES\10074934.pdf processed. 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-8 End. :45152 ms 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10084516.pdf 127.0.0.1 - - [21/May/2020:11:01:04 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 28282 "-" "Java/1.8.0_241" 45129 INFO [2020-05-21 11:01:04,581] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 21 May 2020 06:01.04 [INFO ] GrobidWorker - ..\PDF_FILES\10063534.pdf processed. 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-5 End. :45527 ms 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10192046.pdf 21 May 2020 06:01.04 [INFO ] GrobidWorker - ..\PDF_FILES\10076517.pdf processed. 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-10 End. :45582 ms 21 May 2020 06:01.04 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10204695.pdf 127.0.0.1 - - [21/May/2020:11:01:04 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 53538 "-" "Java/1.8.0_241" 45191 INFO [2020-05-21 11:01:04,934] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 127.0.0.1 - - [21/May/2020:11:01:04 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 57660 "-" "Java/1.8.0_241" 45570 21 May 2020 06:01.05 [INFO ] GrobidWorker - ..\PDF_FILES\10073413.pdf processed. 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-6 End. :45669 ms 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10208228.pdf INFO [2020-05-21 11:01:04,992] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 WARN [2020-05-21 11:01:05,010] org.grobid.core.utilities.matching.ReferenceMarkerMatcher: Cannot parse citation reference range: [(, Figs, ., , 4, -, 6, )] 127.0.0.1 - - [21/May/2020:11:01:05 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 53628 "-" "Java/1.8.0_241" 45335 21 May 2020 06:01.05 [INFO ] GrobidWorker - ..\PDF_FILES\10024427.pdf processed. 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-2 End. :45721 ms 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10229718.pdf INFO [2020-05-21 11:01:05,082] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 127.0.0.1 - - [21/May/2020:11:01:05 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 42901 "-" "Java/1.8.0_241" 45389 21 May 2020 06:01.05 [INFO ] GrobidWorker - ..\PDF_FILES\10073719.pdf processed. 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-7 End. :45849 ms 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10233719.pdf INFO [2020-05-21 11:01:05,168] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 127.0.0.1 - - [21/May/2020:11:01:05 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 61840 "-" "Java/1.8.0_241" 45832 INFO [2020-05-21 11:01:05,255] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 21 May 2020 06:01.05 [INFO ] GrobidWorker - ..\PDF_FILES\10028170.pdf processed. 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-3 End. :46466 ms 21 May 2020 06:01.05 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\103154.pdf 127.0.0.1 - - [21/May/2020:11:01:05 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 77987 "-" "Java/1.8.0_241" 46128 INFO [2020-05-21 11:01:05,881] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 21 May 2020 06:01.06 [INFO ] GrobidWorker - ..\PDF_FILES\103154.pdf processed. 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-3 End. :261 ms 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10329144.pdf WARN [2020-05-21 11:01:06,082] org.grobid.core.lang.impl.CybozuLanguageDetector: Cannot detect language because of: com.cybozu.labs.langdetect.LangDetectException: no features in text WARN [2020-05-21 11:01:06,120] org.grobid.core.lang.impl.CybozuLanguageDetector: Cannot detect language because of: com.cybozu.labs.langdetect.LangDetectException: no features in text 127.0.0.1 - - [21/May/2020:11:01:06 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 2035 "-" "Java/1.8.0_241" 259 INFO [2020-05-21 11:01:06,144] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 21 May 2020 06:01.06 [INFO ] GrobidWorker - ..\PDF_FILES\10037086.pdf processed. 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-4 End. :46983 ms 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10329145.pdf 21 May 2020 06:01.06 [INFO ] GrobidWorker - ..\PDF_FILES\10074948.pdf processed. 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-9 End. :47017 ms 21 May 2020 06:01.06 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10348097.pdf

127.0.0.1 - - [21/May/2020:11:01:06 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 92395 "-" "Java/1.8.0_241" 46649 INFO [2020-05-21 11:01:06,395] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 127.0.0.1 - - [21/May/2020:11:01:06 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 93905 "-" "Java/1.8.0_241" 46681 INFO [2020-05-21 11:01:06,421] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\affiliation-address\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\affiliation-address\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\header\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\header\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\citation\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\name\citation\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\header\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\header\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\date\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\date\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\citation\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\citation\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\fulltext\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\fulltext\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\segmentation\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\segmentation\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\reference-segmenter\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\reference-segmenter\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\figure\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\figure\model.wapiti [Wapiti] Loading model: "D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\table\model.wapiti" Model path: D:\Projects\pdf_to_boac_schema\lib\grobid\grobid-home\models\table\model.wapiti

***ERROR PART** error: Cannot open input data buffer: Smith smith S Sm Smi Smit h th ith mith LINESTART INITCAP NODIGIT 0 1 1 1 1 1 0 0 0 0 0 0 0 0 NOPUNCT 0 0 AC ac A AC AC AC C AC AC AC LINEIN ALLCAP NODIGIT 0 0 0 0 0 0 0 0 0 0 0 0 0 0 NOPUNCT 0 0 , , , , , , , , , , LINEIN ALLCAP NODIGIT 1 0 0 0 0 0 0 0 0 0 0 0 0 0 COMMA 0 0 Boyd boyd B Bo Boy Boyd d yd oyd Boyd LINEIN INITCAP NODIGIT 0 1 0 1 1 1 0 0 0 0 0 0 0 0 NOPUNCT 0 0 MR mr M MR MR MR R MR MR MR LINEIN ALLCAP NODIGIT 0 0 0 0 0 0 0 0 0 0 0 0 0 0 NOPUNCT 0 0 . . . . . . . . . . LINEIN ALLCAP NODIGIT 1 0 0 0 0 0 0 0 0 0 0 0 0 0 DOT 1 0 Preferential preferential P Pr Pre Pref l al ial tial LINEIN INITCAP NODIGIT 0 0 1 0 0 0 0 0 0 0 0 0 0 0 NOPUNCT 1 0 effects effects e ef eff effe s ts cts ects LINEIN NOCAPS NODIGIT 0 0 1 0 0 0 0 0 0 0 0 0 0 0 NOPUNCT 1 0 of of o of of of f of of of LINEIN NOCAPS NODIGIT 0 0 1 0 0 1 0 0 0 0 0 0 0 0 NOPUNCT 1 0 1 1 1 1 1 1 1 1 1 1 LINEIN NOCAPS ALLDIGIT 1 0 0 0 0 1 0 0 0 0 0 0 0 0 NOPUNCT 2 0 , , , , , , , , , , LINEIN ALLCAP NODIGIT 1 0 0 0 0 0 0 0 0 0 0 0 0 0 COMMA 2 0 3 3 3 3 3 3 3 3 3 3 LINEIN NOCAPS ALLDIGIT 1 0 0 0 0 0 0 0 0 0 0 0 0 0 NOPUNCT 2 0

FAILURE: Build failed with an exception.

lfoppiano commented 4 years ago

OK, so you're using Windows. As you can see all the unresolved issues about windows, our recommendation would be to use a virtual machine with linux or run grobid on docker (https://grobid.readthedocs.io/en/latest/Grobid-docker/)

lucaspada894 commented 4 years ago

macOS should not be a problem, right?

On Thu, May 21, 2020 at 4:52 PM Luca Foppiano notifications@github.com wrote:

OK, so you're using Windows. As you can see all the unresolved issues about windows, our recommendation would be to use a virtual machine with linux or run grobid on docker ( https://grobid.readthedocs.io/en/latest/Grobid-docker/)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632365114, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKXHWP2FQ55GFQIRJX3RSWO77ANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

macOS should not be a problem, right?

no, it should not, if you are developing it's fine. It's what I'm using for development, anyway.

However certain components behave slightly different (not at the same degrees of windows, though), so for batch processing and production environment, the main architecture is Linux.

lucaspada894 commented 4 years ago

I will run it through Docker and let you know if the problem persists. Thank you very much! And sorry for the inconvenience!

On Thu, May 21, 2020 at 5:38 PM Luca Foppiano notifications@github.com wrote:

macOS should not be a problem, right?

no, it should not, if you are developing it's fine. It's what I'm using for development, anyway.

However certain components behave slightly different (not at the same degrees of windows, though), so for batch processing and production environment, the main architecture is Linux.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632381532, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKSBCGDXRYQKAEK27L3RSWUN5ANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

Hello again. I have run it through Docker. I am getting a new error log which is attached to this email. I have set grobid-client.properties to execute on port 8080 since I have changed to the Docker build. In particular, I seem to get a 'connection refused' midway through processing. Initially, the PDFs are processed just fine.

On Thu, May 21, 2020 at 6:08 PM Gian Spadafora gians@iastate.edu wrote:

I will run it through Docker and let you know if the problem persists. Thank you very much! And sorry for the inconvenience!

On Thu, May 21, 2020 at 5:38 PM Luca Foppiano notifications@github.com wrote:

macOS should not be a problem, right?

no, it should not, if you are developing it's fine. It's what I'm using for development, anyway.

However certain components behave slightly different (not at the same degrees of windows, though), so for batch processing and production environment, the main architecture is Linux.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632381532, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKSBCGDXRYQKAEK27L3RSWUN5ANCNFSM4MWZUKEA .

"C:\Program Files\Java\jdk-13.0.1\bin\java.exe" -Dvisualvm.id=77656153892300 "-javaagent:C:\Program Files\JetBrains\IntelliJ IDEA 2019.2.4\lib\idea_rt.jar=63506:C:\Program Files\JetBrains\IntelliJ IDEA 2019.2.4\bin" -Dfile.encoding=UTF-8 -classpath D:\Projects\pdf_to_boac_schema\target\classes;C:\Users\Gian.m2\repository\org\apache\maven\plugins\maven-compiler-plugin\3.8.1\maven-compiler-plugin-3.8.1.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-plugin-api\3.0\maven-plugin-api-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-model\3.0\maven-model-3.0.jar;C:\Users\Gian.m2\repository\org\sonatype\sisu\sisu-inject-plexus\1.4.2\sisu-inject-plexus-1.4.2.jar;C:\Users\Gian.m2\repository\org\sonatype\sisu\sisu-inject-bean\1.4.2\sisu-inject-bean-1.4.2.jar;C:\Users\Gian.m2\repository\org\sonatype\sisu\sisu-guice\2.1.7\sisu-guice-2.1.7-noaop.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-artifact\3.0\maven-artifact-3.0.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-utils\2.0.4\plexus-utils-2.0.4.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-core\3.0\maven-core-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-settings\3.0\maven-settings-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-settings-builder\3.0\maven-settings-builder-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-repository-metadata\3.0\maven-repository-metadata-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-model-builder\3.0\maven-model-builder-3.0.jar;C:\Users\Gian.m2\repository\org\apache\maven\maven-aether-provider\3.0\maven-aether-provider-3.0.jar;C:\Users\Gian.m2\repository\org\sonatype\aether\aether-impl\1.7\aether-impl-1.7.jar;C:\Users\Gian.m2\repository\org\sonatype\aether\aether-spi\1.7\aether-spi-1.7.jar;C:\Users\Gian.m2\repository\org\sonatype\aether\aether-api\1.7\aether-api-1.7.jar;C:\Users\Gian.m2\repository\org\sonatype\aether\aether-util\1.7\aether-util-1.7.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-interpolation\1.14\plexus-interpolation-1.14.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-classworlds\2.2.3\plexus-classworlds-2.2.3.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-component-annotations\1.5.5\plexus-component-annotations-1.5.5.jar;C:\Users\Gian.m2\repository\org\sonatype\plexus\plexus-sec-dispatcher\1.3\plexus-sec-dispatcher-1.3.jar;C:\Users\Gian.m2\repository\org\sonatype\plexus\plexus-cipher\1.4\plexus-cipher-1.4.jar;C:\Users\Gian.m2\repository\org\apache\maven\shared\maven-shared-utils\3.2.1\maven-shared-utils-3.2.1.jar;C:\Users\Gian.m2\repository\org\apache\maven\shared\maven-shared-incremental\1.1\maven-shared-incremental-1.1.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-java\0.9.10\plexus-java-0.9.10.jar;C:\Users\Gian.m2\repository\org\ow2\asm\asm\6.2\asm-6.2.jar;C:\Users\Gian.m2\repository\com\thoughtworks\qdox\qdox\2.0-M8\qdox-2.0-M8.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-compiler-api\2.8.4\plexus-compiler-api-2.8.4.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-compiler-manager\2.8.4\plexus-compiler-manager-2.8.4.jar;C:\Users\Gian.m2\repository\org\codehaus\plexus\plexus-compiler-javac\2.8.4\plexus-compiler-javac-2.8.4.jar;C:\Users\Gian.m2\repository\org\json\json\20190722\json-20190722.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\core\jackson-databind\2.9.6\jackson-databind-2.9.6.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\dataformat\jackson-dataformat-xml\2.9.7\jackson-dataformat-xml-2.9.7.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\module\jackson-module-jaxb-annotations\2.9.7\jackson-module-jaxb-annotations-2.9.7.jar;C:\Users\Gian.m2\repository\org\codehaus\woodstox\stax2-api\3.1.4\stax2-api-3.1.4.jar;C:\Users\Gian.m2\repository\com\fasterxml\woodstox\woodstox-core\5.0.3\woodstox-core-5.0.3.jar;C:\Users\Gian.m2\repository\xom\xom\1.2.5\xom-1.2.5.jar;C:\Users\Gian.m2\repository\xml-apis\xml-apis\1.3.03\xml-apis-1.3.03.jar;C:\Users\Gian.m2\repository\xerces\xercesImpl\2.8.0\xercesImpl-2.8.0.jar;C:\Users\Gian.m2\repository\xalan\xalan\2.7.0\xalan-2.7.0.jar;C:\Users\Gian.m2\repository\javax\xml\jaxp-api\1.4.2\jaxp-api-1.4.2.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\core\jackson-core\2.9.6\jackson-core-2.9.6.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\core\jackson-annotations\2.9.6\jackson-annotations-2.9.6.jar;C:\Users\Gian.m2\repository\org\grobid\grobid-core\0.6.0\grobid-core-0.6.0.jar;C:\Users\Gian.m2\repository\org\apache\commons\commons-text\1.8\commons-text-1.8.jar;C:\Users\Gian.m2\repository\org\slf4j\slf4j-api\1.7.25\slf4j-api-1.7.25.jar;C:\Users\Gian.m2\repository\com\cybozu\labs\langdetect\1.1-20120112\langdetect-1.1-20120112.jar;C:\Users\Gian.m2\repository\com\rockymadden\stringmetric\stringmetric-core_2.10\0.27.3\stringmetric-core_2.10-0.27.3.jar;C:\Users\Gian.m2\repository\org\scala-lang\scala-library\2.10.3\scala-library-2.10.3.jar;C:\Users\Gian.m2\repository\commons-pool\commons-pool\1.6\commons-pool-1.6.jar;C:\Users\Gian.m2\repository\commons-io\commons-io\2.5\commons-io-2.5.jar;C:\Users\Gian.m2\repository\org\apache\commons\commons-lang3\3.6\commons-lang3-3.6.jar;C:\Users\Gian.m2\repository\org\apache\commons\commons-collections4\4.1\commons-collections4-4.1.jar;C:\Users\Gian.m2\repository\commons-dbutils\commons-dbutils\1.7\commons-dbutils-1.7.jar;C:\Users\Gian.m2\repository\org\apache\httpcomponents\httpclient\4.5.3\httpclient-4.5.3.jar;C:\Users\Gian.m2\repository\org\apache\httpcomponents\httpcore\4.4.6\httpcore-4.4.6.jar;C:\Users\Gian.m2\repository\commons-logging\commons-logging\1.2\commons-logging-1.2.jar;C:\Users\Gian.m2\repository\commons-codec\commons-codec\1.9\commons-codec-1.9.jar;C:\Users\Gian.m2\repository\com\google\guava\guava\28.2-jre\guava-28.2-jre.jar;C:\Users\Gian.m2\repository\com\google\guava\failureaccess\1.0.1\failureaccess-1.0.1.jar;C:\Users\Gian.m2\repository\com\google\guava\listenablefuture\9999.0-empty-to-avoid-conflict-with-guava\listenablefuture-9999.0-empty-to-avoid-conflict-with-guava.jar;C:\Users\Gian.m2\repository\com\google\code\findbugs\jsr305\3.0.2\jsr305-3.0.2.jar;C:\Users\Gian.m2\repository\org\checkerframework\checker-qual\2.10.0\checker-qual-2.10.0.jar;C:\Users\Gian.m2\repository\com\google\errorprone\error_prone_annotations\2.3.4\error_prone_annotations-2.3.4.jar;C:\Users\Gian.m2\repository\com\google\j2objc\j2objc-annotations\1.3\j2objc-annotations-1.3.jar;C:\Users\Gian.m2\repository\net\arnx\jsonic\1.3.10\jsonic-1.3.10.jar;C:\Users\Gian.m2\repository\org\apache\pdfbox\pdfbox\2.0.18\pdfbox-2.0.18.jar;C:\Users\Gian.m2\repository\org\apache\pdfbox\fontbox\2.0.18\fontbox-2.0.18.jar;C:\Users\Gian.m2\repository\net\sf\saxon\Saxon-HE\9.6.0-9\Saxon-HE-9.6.0-9.jar;C:\Users\Gian.m2\repository\com\fasterxml\jackson\module\jackson-module-afterburner\2.10.1\jackson-module-afterburner-2.10.1.jar;C:\Users\Gian.m2\repository\joda-time\joda-time\2.9.9\joda-time-2.9.9.jar;C:\Users\Gian.m2\repository\org\apache\lucene\lucene-analyzers-common\4.5.1\lucene-analyzers-common-4.5.1.jar;C:\Users\Gian.m2\repository\org\apache\lucene\lucene-core\4.5.1\lucene-core-4.5.1.jar;C:\Users\Gian.m2\repository\javax\xml\bind\jaxb-api\2.3.0\jaxb-api-2.3.0.jar;C:\Users\Gian.m2\repository\black\ninia\jep\3.8.2\jep-3.8.2.jar;C:\Users\Gian.m2\repository\org\slf4j\slf4j-log4j12\1.7.25\slf4j-log4j12-1.7.25.jar;C:\Users\Gian.m2\repository\log4j\log4j\1.2.17\log4j-1.2.17.jar;C:\Users\Gian.m2\repository\org\slf4j\slf4j-jdk14\1.7.25\slf4j-jdk14-1.7.25.jar Main.main 22:50:46.161 [main] INFO org.grobid.service.main.GrobidServiceApplication - Configuration file: /opt/grobid/grobid-service/config/config.yaml INFO [2020-05-22 22:50:50,586] io.dropwizard.server.DefaultServerFactory: Registering jersey handler with root path prefix: / INFO [2020-05-22 22:50:50,597] io.dropwizard.server.DefaultServerFactory: Registering admin handler with root path prefix: / INFO [2020-05-22 22:50:50,612] io.dropwizard.assets.AssetsBundle: Registering AssetBundle with name: grobidAssets for path /* INFO [2020-05-22 22:50:51,818] com.hubspot.dropwizard.guicier.DropwizardModule: Added guice injected health check: org.grobid.service.resources.HealthResource INFO [2020-05-22 22:50:51,848] org.grobid.service.GrobidRestService: Initiating Servlet GrobidRestService INFO [2020-05-22 22:50:51,850] org.grobid.core.main.LibraryLoader: Loading external native sequence labelling library INFO [2020-05-22 22:50:51,871] org.grobid.core.main.LibraryLoader: Loading Wapiti native library... INFO [2020-05-22 22:50:51,884] org.grobid.core.main.LibraryLoader: Native library for sequence labelling loaded INFO [2020-05-22 22:50:51,902] org.grobid.core.lexicon.Lexicon: Initiating dictionary INFO [2020-05-22 22:50:51,904] org.grobid.core.lexicon.Lexicon: End of Initialization of dictionary INFO [2020-05-22 22:50:51,912] org.grobid.core.lexicon.Lexicon: Initiating names INFO [2020-05-22 22:50:51,916] org.grobid.core.lexicon.Lexicon: End of initialization of names INFO [2020-05-22 22:50:53,514] org.grobid.core.lexicon.Lexicon: Initiating country codes INFO [2020-05-22 22:50:53,516] org.grobid.core.lexicon.Lexicon: End of initialization of country codes INFO [2020-05-22 22:50:54,094] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10 INFO [2020-05-22 22:50:54,095] org.grobid.service.GrobidRestService: Initiating of Servlet GrobidRestService finished. INFO [2020-05-22 22:50:54,126] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.resources.HealthResource as a root resource class INFO [2020-05-22 22:50:54,128] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.GrobidRestService as a root resource class INFO [2020-05-22 22:50:54,137] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidServiceExceptionMapper as a provider class INFO [2020-05-22 22:50:54,140] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility as a provider class INFO [2020-05-22 22:50:54,141] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionMapper as a provider class INFO [2020-05-22 22:50:54,142] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.WebApplicationExceptionMapper as a provider class INFO [2020-05-22 22:50:54,152] org.grobid.service.main.GrobidServiceApplication: Service config=GrobidServiceConfiguration{server=DefaultServerFactory{applicationConnectors=[io.dropwizard.jetty.HttpConnectorFactory@497570fb], adminConnectors=[io.dropwizard.jetty.HttpConnectorFactory@d71adc2], adminMaxThreads=64, adminMinThreads=1, applicationContextPath=/, adminContextPath=/}, logging=DefaultLoggingFactory{level=INFO, loggers={org.apache.pdfbox.pdmodel.font.PDSimpleFont="OFF"}, appenders=[io.dropwizard.logging.ConsoleAppenderFactory@6e57b5e9, io.dropwizard.logging.FileAppenderFactory@367b22e5]}, metrics=MetricsFactory{frequency=1 minute, reporters=[]}} INFO [2020-05-22 22:50:54,212] io.dropwizard.server.ServerFactory: Starting grobid-service


( ( )( )( \ _ /( \ ( ( ( ___ )|\ /|\ /( __ ( \ | ( \/| ( )|| ( ) || ( ) ) ) ( | ( \ ) | ( \/| ( \/| ( )|| ) ( | ) ( | ( \/| ( \/ | | | ()|| | | || (/ / | | | | ) | | (_ | ( | (__)|| | | | | | | | | ( | | __ | )| | | || _ ( | | | | | | ( )| ) | )( ( ) ) | | | | | ) | | _ )| (\ ( | | | || ( \ \ | | | | ) | ) || ( | (\ ( \ _/ / | | | | | ( | () || ) \ \| () || )) )) (| (/ ) /__) || (__/| ) \ \ \ / ) (| (/| (/\ (____)|/ __/(____)|/ \/ ___/(__/ ___)(/|/ _/ _/ _/(/(/

INFO [2020-05-22 22:50:54,633] org.eclipse.jetty.setuid.SetUIDListener: Opened application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-22 22:50:54,636] org.eclipse.jetty.setuid.SetUIDListener: Opened admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-22 22:50:54,648] org.eclipse.jetty.server.Server: jetty-9.4.11.v20180605; built: 2018-06-05T18:24:03.829Z; git: d5fc0523cfa96bfebfbda19606cad384d772f04c; jvm 1.8.0_212-b04 WARN [2020-05-22 22:50:55,448] org.glassfish.jersey.internal.inject.Providers: A provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility registered in SERVER runtime does not implement any provider interfaces applicable in the SERVER runtime. Due to constraint configuration problems the provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility will be ignored. INFO [2020-05-22 22:50:55,740] io.dropwizard.jersey.DropwizardResourceConfig: The following paths were found for the configured resources:

POST    /api/annotatePDF (org.grobid.service.GrobidRestService)
POST    /api/citationPatentAnnotations (org.grobid.service.GrobidRestService)
GET     /api/grobid (org.grobid.service.GrobidRestService)
GET     /api/health (org.grobid.service.resources.HealthResource)
GET     /api/isalive (org.grobid.service.GrobidRestService)
POST    /api/processAffiliations (org.grobid.service.GrobidRestService)
PUT     /api/processAffiliations (org.grobid.service.GrobidRestService)
POST    /api/processCitation (org.grobid.service.GrobidRestService)
PUT     /api/processCitation (org.grobid.service.GrobidRestService)
POST    /api/processCitationNames (org.grobid.service.GrobidRestService)
PUT     /api/processCitationNames (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentPDF (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentST36 (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentTXT (org.grobid.service.GrobidRestService)
POST    /api/processDate (org.grobid.service.GrobidRestService)
PUT     /api/processDate (org.grobid.service.GrobidRestService)
POST    /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
POST    /api/processFulltextDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderDocument (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderNames (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderNames (org.grobid.service.GrobidRestService)
POST    /api/processReferences (org.grobid.service.GrobidRestService)
PUT     /api/processReferences (org.grobid.service.GrobidRestService)
POST    /api/referenceAnnotations (org.grobid.service.GrobidRestService)
GET     /api/version (org.grobid.service.GrobidRestService)

INFO [2020-05-22 22:50:55,745] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@5403907{/,null,AVAILABLE} INFO [2020-05-22 22:50:55,749] io.dropwizard.setup.AdminEnvironment: tasks =

POST    /tasks/log-level (io.dropwizard.servlets.tasks.LogConfigurationTask)
POST    /tasks/gc (io.dropwizard.servlets.tasks.GarbageCollectionTask)

INFO [2020-05-22 22:50:55,752] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@621f23ac{/,null,AVAILABLE} INFO [2020-05-22 22:50:55,760] org.eclipse.jetty.server.AbstractConnector: Started application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-22 22:50:55,762] org.eclipse.jetty.server.AbstractConnector: Started admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-22 22:50:55,762] org.eclipse.jetty.server.Server: Started @11025ms 172.17.0.1 - - [22/May/2020:22:50:56 +0000] "GET /api/isalive HTTP/1.1" 200 4 "-" "Java/13.0.1" 200 INFO [2020-05-22 22:50:56,235] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti (size: 2700194) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/affiliation-address/model.wapiti" Model path: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/header/model.wapiti" INFO [2020-05-22 22:50:56,422] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/header/model.wapiti (size: 2225578) 22 May 2020 17:50.56 [INFO ] GrobidService - Cheking Grobid service... 172.17.0.1 - - [22/May/2020:22:50:56 +0000] "GET /api/isalive HTTP/1.1" 200 4 "-" "Java/1.8.0_241" 1 22 May 2020 17:50.56 [INFO ] GrobidService - Grobid service is ok and can be used. 27993 PDF files 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10023767.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10063534.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10073413.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10024427.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10073719.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10028170.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10074934.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10037086.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10076517.pdf 22 May 2020 17:50.56 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10074948.pdf wait for thread completion Model path: /opt/grobid/grobid-home/models/name/header/model.wapiti INFO [2020-05-22 22:50:56,722] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/citation/model.wapiti (size: 436519) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/citation/model.wapiti" Model path: /opt/grobid/grobid-home/models/name/citation/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/header/model.wapiti" INFO [2020-05-22 22:50:56,754] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/header/model.wapiti (size: 36094028) Model path: /opt/grobid/grobid-home/models/header/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/date/model.wapiti" INFO [2020-05-22 22:51:00,583] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/date/model.wapiti (size: 102435) Model path: /opt/grobid/grobid-home/models/date/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/citation/model.wapiti" INFO [2020-05-22 22:51:00,590] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/citation/model.wapiti (size: 16329542) Model path: /opt/grobid/grobid-home/models/citation/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/fulltext/model.wapiti" INFO [2020-05-22 22:51:02,350] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/fulltext/model.wapiti (size: 21112902) Model path: /opt/grobid/grobid-home/models/fulltext/model.wapiti INFO [2020-05-22 22:51:04,933] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/segmentation/model.wapiti (size: 16788068) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/segmentation/model.wapiti" Model path: /opt/grobid/grobid-home/models/segmentation/model.wapiti INFO [2020-05-22 22:51:06,959] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti (size: 4921245) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/reference-segmenter/model.wapiti" Model path: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti INFO [2020-05-22 22:51:07,580] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/figure/model.wapiti (size: 695735) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/figure/model.wapiti" Model path: /opt/grobid/grobid-home/models/figure/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/table/model.wapiti" INFO [2020-05-22 22:51:07,719] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/table/model.wapiti (size: 1364953) Model path: /opt/grobid/grobid-home/models/table/model.wapiti INFO [2020-05-22 22:51:07,825] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10 INFO [2020-05-22 22:51:07,833] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 2/10 INFO [2020-05-22 22:51:07,836] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 3/10 INFO [2020-05-22 22:51:07,838] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 4/10 INFO [2020-05-22 22:51:07,839] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 5/10 INFO [2020-05-22 22:51:07,841] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 6/10 INFO [2020-05-22 22:51:07,842] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 7/10 INFO [2020-05-22 22:51:07,844] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 8/10 INFO [2020-05-22 22:51:07,873] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 9/10 INFO [2020-05-22 22:51:07,894] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 ERROR [2020-05-22 22:51:08,999] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-22 22:51:10,023] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 22 May 2020 17:51.10 [ERROR] GrobidService - Failed : HTTP error code : 503 172.17.0.1 - - [22/May/2020:22:51:10 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 14253 172.17.0.1 - - [22/May/2020:22:51:10 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 14925 ERROR [2020-05-22 22:51:11,053] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:11 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 15215 ERROR [2020-05-22 22:51:13,303] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:13 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 17455 ERROR [2020-05-22 22:51:14,306] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:14 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 18452 ERROR [2020-05-22 22:51:15,313] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:15 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 19460 ERROR [2020-05-22 22:51:21,039] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:21 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 25192 ERROR [2020-05-22 22:51:22,048] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 26233 WARN [2020-05-22 22:51:22,282] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,284] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,288] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,291] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,292] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,294] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-22 22:51:22,295] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles ERROR [2020-05-22 22:51:23,197] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:23 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 27355 ERROR [2020-05-22 22:51:24,287] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [22/May/2020:22:51:24 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 28436 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10063534.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10028170.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-5 End. :29068 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-3 End. :29068 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10192046.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10084516.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10074948.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10023767.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-1 End. :29091 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-9 End. :29090 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10073719.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10076517.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-7 End. :29091 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10208228.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10024427.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10204695.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10074934.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-8 End. :29091 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-2 End. :29091 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10229718.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-10 End. :29091 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\103154.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10233719.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10329144.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Software caused connection abort: recv failed 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10037086.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-4 End. :29257 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10329145.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Software caused connection abort: recv failed 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10073413.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-6 End. :29258 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10348097.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10208228.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-9 End. :366 ms 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10356134.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10204695.pdf processed. 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10348097.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-6 End. :199 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10357212.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10192046.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10329144.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-10 End. :366 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10361725.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\103154.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-1 End. :366 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10362838.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10233719.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10329145.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-4 End. :201 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10365166.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-3 End. :389 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-8 End. :367 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10373368.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10368945.pdf 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-2 End. :369 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10378626.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10084516.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-5 End. :415 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10384886.pdf 22 May 2020 17:51.25 [ERROR] GrobidService - Error writing to server 22 May 2020 17:51.25 [INFO ] GrobidWorker - ..\PDF_FILES\10229718.pdf processed. 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-7 End. :398 ms 22 May 2020 17:51.25 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10388554.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10378626.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 End. :63 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10384886.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10390629.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10373368.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 End. :41 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 End. :65 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10392722.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10392726.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10368945.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 End. :66 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10397660.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10365166.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 End. :70 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10403354.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10356134.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 End. :73 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10403671.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10361725.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 End. :72 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10405897.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10362838.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 End. :73 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10416375.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10357212.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 End. :75 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10416381.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10388554.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 End. :49 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10418876.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10403671.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 End. :79 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10418901.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10405897.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 End. :80 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10425240.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10403354.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 End. :82 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10426207.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10416381.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 End. :83 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10426212.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10392722.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 End. :92 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10390629.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 End. :92 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10432280.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10430622.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10416375.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 End. :85 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10418876.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10438601.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 End. :78 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10392726.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 End. :93 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10443789.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10397660.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10443529.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 End. :92 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10458992.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10432280.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10418901.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 End. :64 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 End. :72 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10460831.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10460833.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10426207.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 End. :69 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10475084.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10458992.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 End. :63 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10481737.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10426212.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 End. :66 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10488727.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10443529.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 End. :66 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10498029.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10438601.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 End. :66 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10425240.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10500282.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 End. :73 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10503775.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10443789.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 End. :70 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10430622.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10507310.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 End. :72 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10507327.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10475084.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 End. :59 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10507362.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10481737.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 End. :68 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10515281.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10507327.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 End. :61 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10515282.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10488727.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 End. :68 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-6 Start. Processing = ..\PDF_FILES\10515283.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10507310.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 End. :62 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-8 Start. Processing = ..\PDF_FILES\10515284.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10460831.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 End. :70 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10517924.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10460833.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 End. :71 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-9 Start. Processing = ..\PDF_FILES\10518694.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10500282.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 End. :71 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10518710.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [ERROR] GrobidService - Unexpected end of file from server 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10498029.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 End. :72 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10503775.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10522785.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 End. :72 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10541018.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Connection reset 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10507362.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 End. :57 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-4 Start. Processing = ..\PDF_FILES\10544100.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Connection reset 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10518710.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 End. :43 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-1 Start. Processing = ..\PDF_FILES\10544119.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Connection reset 22 May 2020 17:51.26 [ERROR] GrobidService - Connection reset 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10541018.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 End. :44 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10522785.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 End. :62 ms 22 May 2020 17:51.26 [ERROR] GrobidService - Unrecognized Windows Sockets error: 0: recv failed 22 May 2020 17:51.26 [ERROR] GrobidService - Unrecognized Windows Sockets error: 0: recv failed 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-10 Start. Processing = ..\PDF_FILES\10547254.pdf 22 May 2020 17:51.26 [ERROR] GrobidService - Connection reset 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-7 Start. Processing = ..\PDF_FILES\10550676.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10515282.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 End. :68 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-5 Start. Processing = ..\PDF_FILES\10562501.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10515281.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 End. :68 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-3 Start. Processing = ..\PDF_FILES\10573552.pdf 22 May 2020 17:51.26 [INFO ] GrobidWorker - ..\PDF_FILES\10517924.pdf processed. 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 End. :67 ms 22 May 2020 17:51.26 [INFO ] GrobidWorker - pool-1-thread-2 Start. Processing = ..\PDF_FILES\10573809.pdf 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect 22 May 2020 17:51.28 [ERROR] GrobidService - Connection refused: connect

Process finished with exit code -1

lfoppiano commented 4 years ago

@lucaspada894 I'm trying to reproduce your problem, did you use the grobid-client-python to process your files?

lucaspada894 commented 4 years ago

No, I used the grobid-client-java to process the files.

On Sat, May 23, 2020 at 12:41 AM Luca Foppiano notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 I'm trying to reproduce your problem, did you use the grobid-client-python to process your files?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632989505, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKT7CCDVWU3QH4VW4VDRS5OZTANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

OK, when you get connection refused, it's when the docker container is killed. This can happen when the allocated memory is not enough. However, I had it set to 4.5Gb and still got the problem. Could you try to increase the memory allocated to your docker server to, let's say 6 or 8 Gb?

Grobid should work fine with 4 Gb but, well, let's see if we can make it run first ;-)

lucaspada894 commented 4 years ago

Okay, also, here is a different log for when I use the line "docker run -t --rm --init -p 8080:8070 -p 8081:8071 lfoppiano/grobid:0.5.6". The error in particular seems to be "Could not get an engine from the pool within configured time. Sending service unavailable."

On Sat, May 23, 2020 at 1:26 AM Luca Foppiano notifications@github.com wrote:

OK, when you get connection refused, it's when the docker container is killed. This can happen when the allocated memory is not enough. However, I had it set to 4.5Gb and still got the problem. Could you try to increase the memory allocated to your docker server to, let's say 6 or 8 Gb?

Grobid should work fine with 4 Gb but, well, let's see if we can make it run first ;-)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632994280, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKRDRAYRHFNIWVRNBOLRS5UARANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

Oops, forgot to attach it.

On Sat, May 23, 2020 at 1:59 AM Gian Spadafora gians@iastate.edu wrote:

Okay, also, here is a different log for when I use the line "docker run -t --rm --init -p 8080:8070 -p 8081:8071 lfoppiano/grobid:0.5.6". The error in particular seems to be "Could not get an engine from the pool within configured time. Sending service unavailable."

On Sat, May 23, 2020 at 1:26 AM Luca Foppiano notifications@github.com wrote:

OK, when you get connection refused, it's when the docker container is killed. This can happen when the allocated memory is not enough. However, I had it set to 4.5Gb and still got the problem. Could you try to increase the memory allocated to your docker server to, let's say 6 or 8 Gb?

Grobid should work fine with 4 Gb but, well, let's see if we can make it run first ;-)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632994280, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKRDRAYRHFNIWVRNBOLRS5UARANCNFSM4MWZUKEA .

Microsoft Windows [Version 10.0.18362.836] (c) 2019 Microsoft Corporation. All rights reserved.

C:\Users\Gian>docker run -t --rm --init -p 8080:8070 -p 8081:8071 lfoppiano/grobid:0.5.6 06:53:51.515 [main] INFO org.grobid.service.main.GrobidServiceApplication - Configuration file: /opt/grobid/grobid-service/config/config.yaml INFO [2020-05-23 06:53:53,792] io.dropwizard.server.DefaultServerFactory: Registering jersey handler with root path prefix: / INFO [2020-05-23 06:53:53,795] io.dropwizard.server.DefaultServerFactory: Registering admin handler with root path prefix: / INFO [2020-05-23 06:53:53,797] io.dropwizard.assets.AssetsBundle: Registering AssetBundle with name: grobidAssets for path /* INFO [2020-05-23 06:53:54,475] com.hubspot.dropwizard.guicier.DropwizardModule: Added guice injected health check: org.grobid.service.resources.HealthResource INFO [2020-05-23 06:53:54,492] org.grobid.service.GrobidRestService: Initiating Servlet GrobidRestService INFO [2020-05-23 06:53:54,498] org.grobid.core.main.LibraryLoader: Loading external native sequence labelling library INFO [2020-05-23 06:53:54,502] org.grobid.core.main.LibraryLoader: Loading Wapiti native library... INFO [2020-05-23 06:53:54,525] org.grobid.core.main.LibraryLoader: Native library for sequence labelling loaded INFO [2020-05-23 06:53:54,532] org.grobid.core.lexicon.Lexicon: Initiating dictionary INFO [2020-05-23 06:53:54,533] org.grobid.core.lexicon.Lexicon: End of Initialization of dictionary INFO [2020-05-23 06:53:54,534] org.grobid.core.lexicon.Lexicon: Initiating names INFO [2020-05-23 06:53:54,535] org.grobid.core.lexicon.Lexicon: End of initialization of names INFO [2020-05-23 06:53:55,029] org.grobid.core.lexicon.Lexicon: Initiating country codes INFO [2020-05-23 06:53:55,029] org.grobid.core.lexicon.Lexicon: End of initialization of country codes INFO [2020-05-23 06:53:55,186] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10INFO [2020-05-23 06:53:55,186] org.grobid.service.GrobidRestService: Initiating of Servlet GrobidRestService finished. INFO [2020-05-23 06:53:55,202] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.resources.HealthResource as a root resource class INFO [2020-05-23 06:53:55,202] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.GrobidRestService as a root resource class INFO [2020-05-23 06:53:55,206] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidServiceExceptionMapper as a provider class INFO [2020-05-23 06:53:55,207] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility as a provider class INFO [2020-05-23 06:53:55,207] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionMapper as a provider class INFO [2020-05-23 06:53:55,207] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.WebApplicationExceptionMapper as a provider class INFO [2020-05-23 06:53:55,211] org.grobid.service.main.GrobidServiceApplication: Service config=GrobidServiceConfiguration{server=DefaultServerFactory{applicationConnectors=[io.dropwizard.jetty.HttpConnectorFactory@497570fb], adminConnectors=[io.dropwizard.jetty.HttpConnectorFactory@d71adc2], adminMaxThreads=64, adminMinThreads=1, applicationContextPath=/, adminContextPath=/}, logging=DefaultLoggingFactory{level=INFO, loggers={org.apache.pdfbox.pdmodel.font.PDSimpleFont="OFF"}, appenders=[io.dropwizard.logging.ConsoleAppenderFactory@6e57b5e9, io.dropwizard.logging.FileAppenderFactory@367b22e5]}, metrics=MetricsFactory{frequency=1 minute, reporters=[]}} INFO [2020-05-23 06:53:55,232] io.dropwizard.server.ServerFactory: Starting grobid-service


( ( )( )( \ _ /( \ ( ( ( ___ )|\ /|\ /( __ ( \ | ( \/| ( )|| ( ) || ( ) ) ) ( | ( \ ) | ( \/| ( \/| ( )|| ) ( | ) ( | ( \/| ( \/ | | | ()|| | | || (/ / | | | | ) | | (_ | ( | (__)|| | | | | | | | | ( | | __ | )| | | || _ ( | | | | | | ( )| ) | )( ( ) ) | | | | | ) | | _ )| (\ ( | | | || ( \ \ | | | | ) | ) || ( | (\ ( \ _/ / | | | | | ( | () || ) \ \| () || )) )) (| (/ ) /__) || (__/| ) \ \ \ / ) (| (/| (/\ (____)|/ __/(____)|/ \/ ___/(__/ ___)(/|/ _/ _/ _/(/(/

INFO [2020-05-23 06:53:55,423] org.eclipse.jetty.setuid.SetUIDListener: Opened application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-23 06:53:55,423] org.eclipse.jetty.setuid.SetUIDListener: Opened admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-23 06:53:55,428] org.eclipse.jetty.server.Server: jetty-9.4.11.v20180605; built: 2018-06-05T18:24:03.829Z; git: d5fc0523cfa96bfebfbda19606cad384d772f04c; jvm 1.8.0_212-b04 WARN [2020-05-23 06:53:56,179] org.glassfish.jersey.internal.inject.Providers: A provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility registered in SERVER runtime does not implement any provider interfaces applicable in the SERVER runtime. Due to constraint configuration problems the provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility will be ignored. INFO [2020-05-23 06:53:56,554] io.dropwizard.jersey.DropwizardResourceConfig: The following paths were found for the configured resources:

POST    /api/annotatePDF (org.grobid.service.GrobidRestService)
POST    /api/citationPatentAnnotations (org.grobid.service.GrobidRestService)
GET     /api/grobid (org.grobid.service.GrobidRestService)
GET     /api/health (org.grobid.service.resources.HealthResource)
GET     /api/isalive (org.grobid.service.GrobidRestService)
POST    /api/processAffiliations (org.grobid.service.GrobidRestService)
PUT     /api/processAffiliations (org.grobid.service.GrobidRestService)
POST    /api/processCitation (org.grobid.service.GrobidRestService)
PUT     /api/processCitation (org.grobid.service.GrobidRestService)
POST    /api/processCitationNames (org.grobid.service.GrobidRestService)
PUT     /api/processCitationNames (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentPDF (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentST36 (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentTXT (org.grobid.service.GrobidRestService)
POST    /api/processDate (org.grobid.service.GrobidRestService)
PUT     /api/processDate (org.grobid.service.GrobidRestService)
POST    /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
POST    /api/processFulltextDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderDocument (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderNames (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderNames (org.grobid.service.GrobidRestService)
POST    /api/processReferences (org.grobid.service.GrobidRestService)
PUT     /api/processReferences (org.grobid.service.GrobidRestService)
POST    /api/referenceAnnotations (org.grobid.service.GrobidRestService)
GET     /api/version (org.grobid.service.GrobidRestService)

INFO [2020-05-23 06:53:56,568] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@5403907{/,null,AVAILABLE} INFO [2020-05-23 06:53:56,573] io.dropwizard.setup.AdminEnvironment: tasks =

POST    /tasks/log-level (io.dropwizard.servlets.tasks.LogConfigurationTask)
POST    /tasks/gc (io.dropwizard.servlets.tasks.GarbageCollectionTask)

INFO [2020-05-23 06:53:56,581] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@621f23ac{/,null,AVAILABLE} INFO [2020-05-23 06:53:56,608] org.eclipse.jetty.server.AbstractConnector: Started application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-23 06:53:56,609] org.eclipse.jetty.server.AbstractConnector: Started admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-23 06:53:56,609] org.eclipse.jetty.server.Server: Started @6148ms INFO [2020-05-23 06:54:01,338] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti (size: 2700194) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/affiliation-address/model.wapiti" Model path: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/header/model.wapiti" INFO [2020-05-23 06:54:01,589] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/header/model.wapiti (size: 2225578) Model path: /opt/grobid/grobid-home/models/name/header/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/citation/model.wapiti" INFO [2020-05-23 06:54:01,792] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/citation/model.wapiti (size: 436519) Model path: /opt/grobid/grobid-home/models/name/citation/model.wapiti INFO [2020-05-23 06:54:01,833] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/header/model.wapiti (size: 36094028) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/header/model.wapiti" Model path: /opt/grobid/grobid-home/models/header/model.wapiti INFO [2020-05-23 06:54:07,181] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/date/model.wapiti (size: 102435) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/date/model.wapiti" Model path: /opt/grobid/grobid-home/models/date/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/citation/model.wapiti" INFO [2020-05-23 06:54:07,197] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/citation/model.wapiti (size: 16329542) Model path: /opt/grobid/grobid-home/models/citation/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/fulltext/model.wapiti" INFO [2020-05-23 06:54:09,300] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/fulltext/model.wapiti (size: 21112902) Model path: /opt/grobid/grobid-home/models/fulltext/model.wapiti INFO [2020-05-23 06:54:12,183] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/segmentation/model.wapiti (size: 16788068) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/segmentation/model.wapiti" Model path: /opt/grobid/grobid-home/models/segmentation/model.wapiti INFO [2020-05-23 06:54:15,049] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti (size: 4921245) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/reference-segmenter/model.wapiti" Model path: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti INFO [2020-05-23 06:54:15,633] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/figure/model.wapiti (size: 695735) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/figure/model.wapiti" Model path: /opt/grobid/grobid-home/models/figure/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/table/model.wapiti" INFO [2020-05-23 06:54:15,698] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/table/model.wapiti (size: 1364953) Model path: /opt/grobid/grobid-home/models/table/model.wapiti INFO [2020-05-23 06:54:15,822] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10INFO [2020-05-23 06:54:15,924] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 2/10INFO [2020-05-23 06:54:15,934] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 3/10INFO [2020-05-23 06:54:15,947] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 4/10INFO [2020-05-23 06:54:15,949] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 5/10INFO [2020-05-23 06:54:15,967] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 6/10INFO [2020-05-23 06:54:15,974] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 7/10INFO [2020-05-23 06:54:15,977] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 8/10INFO [2020-05-23 06:54:15,997] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 9/10INFO [2020-05-23 06:54:16,020] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 ERROR [2020-05-23 06:54:17,075] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:54:18,099] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:54:19,141] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:54:21,358] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 21122 172.17.0.1 - - [23/May/2020:06:54:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 21755 172.17.0.1 - - [23/May/2020:06:54:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 21756 172.17.0.1 - - [23/May/2020:06:54:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 21618 ERROR [2020-05-23 06:54:22,392] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:22 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 22113 ERROR [2020-05-23 06:54:23,408] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:23 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 23123 ERROR [2020-05-23 06:54:24,415] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:24 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 24084 ERROR [2020-05-23 06:54:30,698] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:30 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 29692 WARN [2020-05-23 06:54:31,736] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-23 06:54:31,740] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-23 06:54:31,740] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-23 06:54:31,758] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles ERROR [2020-05-23 06:54:31,768] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:31 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 31491 WARN [2020-05-23 06:54:31,798] org.grobid.core.lang.impl.CybozuLanguageDetector: Cannot detect language because of: com.cybozu.labs.langdetect.LangDetectException: no features in text WARN [2020-05-23 06:54:31,827] org.grobid.core.lang.impl.CybozuLanguageDetector: Cannot detect language because of: com.cybozu.labs.langdetect.LangDetectException: no features in text INFO [2020-05-23 06:54:31,923] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 172.17.0.1 - - [23/May/2020:06:54:32 +0000] "POST /api/processFulltextDocument HTTP/1.1" 200 1187 "-" "Java/1.8.0_241" 31656 ERROR [2020-05-23 06:54:33,119] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:33 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 2329 ERROR [2020-05-23 06:54:34,119] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:34 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 3309 ERROR [2020-05-23 06:54:35,120] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:54:35 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 4336

C:\Users\Gian>docker run -t --rm --init -p 8080:8070 -p 8081:8071 lfoppiano/grobid:0.5.6 06:55:59.669 [main] INFO org.grobid.service.main.GrobidServiceApplication - Configuration file: /opt/grobid/grobid-service/config/config.yaml INFO [2020-05-23 06:56:02,760] io.dropwizard.server.DefaultServerFactory: Registering jersey handler with root path prefix: / INFO [2020-05-23 06:56:02,792] io.dropwizard.server.DefaultServerFactory: Registering admin handler with root path prefix: / INFO [2020-05-23 06:56:02,793] io.dropwizard.assets.AssetsBundle: Registering AssetBundle with name: grobidAssets for path /* INFO [2020-05-23 06:56:03,476] com.hubspot.dropwizard.guicier.DropwizardModule: Added guice injected health check: org.grobid.service.resources.HealthResource INFO [2020-05-23 06:56:03,488] org.grobid.service.GrobidRestService: Initiating Servlet GrobidRestService INFO [2020-05-23 06:56:03,489] org.grobid.core.main.LibraryLoader: Loading external native sequence labelling library INFO [2020-05-23 06:56:03,491] org.grobid.core.main.LibraryLoader: Loading Wapiti native library... INFO [2020-05-23 06:56:03,501] org.grobid.core.main.LibraryLoader: Native library for sequence labelling loaded INFO [2020-05-23 06:56:03,504] org.grobid.core.lexicon.Lexicon: Initiating dictionary INFO [2020-05-23 06:56:03,504] org.grobid.core.lexicon.Lexicon: End of Initialization of dictionary INFO [2020-05-23 06:56:03,505] org.grobid.core.lexicon.Lexicon: Initiating names INFO [2020-05-23 06:56:03,505] org.grobid.core.lexicon.Lexicon: End of initialization of names INFO [2020-05-23 06:56:04,068] org.grobid.core.lexicon.Lexicon: Initiating country codes INFO [2020-05-23 06:56:04,068] org.grobid.core.lexicon.Lexicon: End of initialization of country codes INFO [2020-05-23 06:56:04,458] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10INFO [2020-05-23 06:56:04,458] org.grobid.service.GrobidRestService: Initiating of Servlet GrobidRestService finished. INFO [2020-05-23 06:56:04,464] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.resources.HealthResource as a root resource class INFO [2020-05-23 06:56:04,464] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.GrobidRestService as a root resource class INFO [2020-05-23 06:56:04,465] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidServiceExceptionMapper as a provider class INFO [2020-05-23 06:56:04,465] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility as a provider class INFO [2020-05-23 06:56:04,466] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.GrobidExceptionMapper as a provider class INFO [2020-05-23 06:56:04,466] com.hubspot.dropwizard.guicier.DropwizardModule: Registering org.grobid.service.exceptions.mapper.WebApplicationExceptionMapper as a provider class INFO [2020-05-23 06:56:04,469] org.grobid.service.main.GrobidServiceApplication: Service config=GrobidServiceConfiguration{server=DefaultServerFactory{applicationConnectors=[io.dropwizard.jetty.HttpConnectorFactory@497570fb], adminConnectors=[io.dropwizard.jetty.HttpConnectorFactory@d71adc2], adminMaxThreads=64, adminMinThreads=1, applicationContextPath=/, adminContextPath=/}, logging=DefaultLoggingFactory{level=INFO, loggers={org.apache.pdfbox.pdmodel.font.PDSimpleFont="OFF"}, appenders=[io.dropwizard.logging.ConsoleAppenderFactory@6e57b5e9, io.dropwizard.logging.FileAppenderFactory@367b22e5]}, metrics=MetricsFactory{frequency=1 minute, reporters=[]}} INFO [2020-05-23 06:56:04,524] io.dropwizard.server.ServerFactory: Starting grobid-service


( ( )( )( \ _ /( \ ( ( ( ___ )|\ /|\ /( __ ( \ | ( \/| ( )|| ( ) || ( ) ) ) ( | ( \ ) | ( \/| ( \/| ( )|| ) ( | ) ( | ( \/| ( \/ | | | ()|| | | || (/ / | | | | ) | | (_ | ( | (__)|| | | | | | | | | ( | | __ | )| | | || _ ( | | | | | | ( )| ) | )( ( ) ) | | | | | ) | | _ )| (\ ( | | | || ( \ \ | | | | ) | ) || ( | (\ ( \ _/ / | | | | | ( | () || ) \ \| () || )) )) (| (/ ) /__) || (__/| ) \ \ \ / ) (| (/| (/\ (____)|/ __/(____)|/ \/ ___/(__/ ___)(/|/ _/ _/ _/(/(/

INFO [2020-05-23 06:56:04,827] org.eclipse.jetty.setuid.SetUIDListener: Opened application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-23 06:56:04,828] org.eclipse.jetty.setuid.SetUIDListener: Opened admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-23 06:56:04,831] org.eclipse.jetty.server.Server: jetty-9.4.11.v20180605; built: 2018-06-05T18:24:03.829Z; git: d5fc0523cfa96bfebfbda19606cad384d772f04c; jvm 1.8.0_212-b04 WARN [2020-05-23 06:56:05,230] org.glassfish.jersey.internal.inject.Providers: A provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility registered in SERVER runtime does not implement any provider interfaces applicable in the SERVER runtime. Due to constraint configuration problems the provider org.grobid.service.exceptions.mapper.GrobidExceptionsTranslationUtility will be ignored. INFO [2020-05-23 06:56:05,671] io.dropwizard.jersey.DropwizardResourceConfig: The following paths were found for the configured resources:

POST    /api/annotatePDF (org.grobid.service.GrobidRestService)
POST    /api/citationPatentAnnotations (org.grobid.service.GrobidRestService)
GET     /api/grobid (org.grobid.service.GrobidRestService)
GET     /api/health (org.grobid.service.resources.HealthResource)
GET     /api/isalive (org.grobid.service.GrobidRestService)
POST    /api/processAffiliations (org.grobid.service.GrobidRestService)
PUT     /api/processAffiliations (org.grobid.service.GrobidRestService)
POST    /api/processCitation (org.grobid.service.GrobidRestService)
PUT     /api/processCitation (org.grobid.service.GrobidRestService)
POST    /api/processCitationNames (org.grobid.service.GrobidRestService)
PUT     /api/processCitationNames (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentPDF (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentST36 (org.grobid.service.GrobidRestService)
POST    /api/processCitationPatentTXT (org.grobid.service.GrobidRestService)
POST    /api/processDate (org.grobid.service.GrobidRestService)
PUT     /api/processDate (org.grobid.service.GrobidRestService)
POST    /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextAssetDocument (org.grobid.service.GrobidRestService)
POST    /api/processFulltextDocument (org.grobid.service.GrobidRestService)
PUT     /api/processFulltextDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderDocument (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderDocument (org.grobid.service.GrobidRestService)
POST    /api/processHeaderNames (org.grobid.service.GrobidRestService)
PUT     /api/processHeaderNames (org.grobid.service.GrobidRestService)
POST    /api/processReferences (org.grobid.service.GrobidRestService)
PUT     /api/processReferences (org.grobid.service.GrobidRestService)
POST    /api/referenceAnnotations (org.grobid.service.GrobidRestService)
GET     /api/version (org.grobid.service.GrobidRestService)

INFO [2020-05-23 06:56:05,682] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@5403907{/,null,AVAILABLE} INFO [2020-05-23 06:56:05,688] io.dropwizard.setup.AdminEnvironment: tasks =

POST    /tasks/log-level (io.dropwizard.servlets.tasks.LogConfigurationTask)
POST    /tasks/gc (io.dropwizard.servlets.tasks.GarbageCollectionTask)

INFO [2020-05-23 06:56:05,692] org.eclipse.jetty.server.handler.ContextHandler: Started i.d.j.MutableServletContextHandler@621f23ac{/,null,AVAILABLE} INFO [2020-05-23 06:56:05,707] org.eclipse.jetty.server.AbstractConnector: Started application@6963b88c{HTTP/1.1,[http/1.1]}{0.0.0.0:8070} INFO [2020-05-23 06:56:05,708] org.eclipse.jetty.server.AbstractConnector: Started admin@ab24484{HTTP/1.1,[http/1.1]}{0.0.0.0:8071} INFO [2020-05-23 06:56:05,708] org.eclipse.jetty.server.Server: Started @7659ms INFO [2020-05-23 06:56:06,929] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti (size: 2700194) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/affiliation-address/model.wapiti" Model path: /opt/grobid/grobid-home/models/affiliation-address/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/header/model.wapiti" INFO [2020-05-23 06:56:07,116] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/header/model.wapiti (size: 2225578) Model path: /opt/grobid/grobid-home/models/name/header/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/name/citation/model.wapiti" INFO [2020-05-23 06:56:07,333] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/name/citation/model.wapiti (size: 436519) Model path: /opt/grobid/grobid-home/models/name/citation/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/header/model.wapiti" INFO [2020-05-23 06:56:07,398] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/header/model.wapiti (size: 36094028) Model path: /opt/grobid/grobid-home/models/header/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/date/model.wapiti" INFO [2020-05-23 06:56:11,254] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/date/model.wapiti (size: 102435) Model path: /opt/grobid/grobid-home/models/date/model.wapiti INFO [2020-05-23 06:56:11,261] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/citation/model.wapiti (size: 16329542) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/citation/model.wapiti" Model path: /opt/grobid/grobid-home/models/citation/model.wapiti INFO [2020-05-23 06:56:12,784] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/fulltext/model.wapiti (size: 21112902) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/fulltext/model.wapiti" Model path: /opt/grobid/grobid-home/models/fulltext/model.wapiti INFO [2020-05-23 06:56:14,818] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/segmentation/model.wapiti (size: 16788068) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/segmentation/model.wapiti" Model path: /opt/grobid/grobid-home/models/segmentation/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/reference-segmenter/model.wapiti" INFO [2020-05-23 06:56:17,815] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti (size: 4921245) Model path: /opt/grobid/grobid-home/models/reference-segmenter/model.wapiti INFO [2020-05-23 06:56:18,369] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/figure/model.wapiti (size: 695735) [Wapiti] Loading model: "/opt/grobid/grobid-home/models/figure/model.wapiti" Model path: /opt/grobid/grobid-home/models/figure/model.wapiti [Wapiti] Loading model: "/opt/grobid/grobid-home/models/table/model.wapiti" INFO [2020-05-23 06:56:18,494] org.grobid.core.jni.WapitiModel: Loading model: /opt/grobid/grobid-home/models/table/model.wapiti (size: 1364953) Model path: /opt/grobid/grobid-home/models/table/model.wapiti INFO [2020-05-23 06:56:18,571] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 1/10INFO [2020-05-23 06:56:18,685] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 2/10INFO [2020-05-23 06:56:18,741] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 3/10INFO [2020-05-23 06:56:18,743] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 4/10INFO [2020-05-23 06:56:18,744] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 5/10INFO [2020-05-23 06:56:18,745] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 6/10INFO [2020-05-23 06:56:18,747] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 7/10INFO [2020-05-23 06:56:18,747] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 8/10INFO [2020-05-23 06:56:18,748] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 9/10INFO [2020-05-23 06:56:18,780] org.grobid.core.factory.GrobidPoolingFactory: Number of Engines in pool active/max: 10/10 ERROR [2020-05-23 06:56:19,795] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:56:20,800] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:56:21,819] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. ERROR [2020-05-23 06:56:24,494] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:25 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 18620 172.17.0.1 - - [23/May/2020:06:56:25 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 18619 172.17.0.1 - - [23/May/2020:06:56:25 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 18621 172.17.0.1 - - [23/May/2020:06:56:25 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 18619 ERROR [2020-05-23 06:56:25,564] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:25 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 19144 ERROR [2020-05-23 06:56:26,570] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:26 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 20140 ERROR [2020-05-23 06:56:27,574] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:27 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 21148 ERROR [2020-05-23 06:56:28,583] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:28 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 22162 ERROR [2020-05-23 06:56:29,644] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:29 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 23222 WARN [2020-05-23 06:56:30,587] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-23 06:56:30,601] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles WARN [2020-05-23 06:56:30,602] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles ERROR [2020-05-23 06:56:30,661] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:30 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 24251 ERROR [2020-05-23 06:56:37,858] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:37 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 7296 ERROR [2020-05-23 06:56:38,858] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:38 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 8894 ERROR [2020-05-23 06:56:39,859] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:39 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 9834 ERROR [2020-05-23 06:56:41,364] org.grobid.service.process.GrobidRestProcessFiles: Could not get an engine from the pool within configured time. Sending service unavailable. 172.17.0.1 - - [23/May/2020:06:56:41 +0000] "POST /api/processFulltextDocument HTTP/1.1" 503 279 "-" "Java/1.8.0_241" 11341

C:\Users\Gian>

lfoppiano commented 4 years ago

Two things.

First of all, you should use the latest version, which is the 0.6.0 (yes, I know the documentation said 0.5.6, I corrected this morning). 😅 💦

Secondly, that error is fine, when the pool is empty, the system return 503 (system unavailable) which makes the client wait and try again. What is not good is that the service ends after a while.

Try to increase the memory:

image

I will investigate also

WARN  [2020-05-23 06:56:30,587] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles
WARN  [2020-05-23 06:56:30,601] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles
WARN  [2020-05-23 06:56:30,602] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

but not today...

lucaspada894 commented 4 years ago

Hello, the process seems to be running fine for now with 6gb of memory.

On Sat, May 23, 2020 at 2:04 AM Luca Foppiano notifications@github.com wrote:

Two things.

First of all, you should use the latest version, which is the 0.6.0 (yes, I know the documentation said 0.5.6, I corrected this morning). 😅 💦

Secondly, that error is fine, when the pool is empty, the system return 503 (system unavailable) which makes the client wait and try again. What is not good is that the service ends after a while.

Try to increase the memory:

[image: image] https://user-images.githubusercontent.com/15426/82724162-a6d8c000-9cc3-11ea-9be6-cab7aebd207f.png

I will investigate also

WARN [2020-05-23 06:56:30,587] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,601] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,602] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

but not today...

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632998456, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKU6XHASVACMZUAAK4TRS5YRDANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

I will use 0.6.0

On Sat, May 23, 2020 at 2:16 AM Gian Spadafora gians@iastate.edu wrote:

Hello, the process seems to be running fine for now with 6gb of memory.

On Sat, May 23, 2020 at 2:04 AM Luca Foppiano notifications@github.com wrote:

Two things.

First of all, you should use the latest version, which is the 0.6.0 (yes, I know the documentation said 0.5.6, I corrected this morning). 😅 💦

Secondly, that error is fine, when the pool is empty, the system return 503 (system unavailable) which makes the client wait and try again. What is not good is that the service ends after a while.

Try to increase the memory:

[image: image] https://user-images.githubusercontent.com/15426/82724162-a6d8c000-9cc3-11ea-9be6-cab7aebd207f.png

I will investigate also

WARN [2020-05-23 06:56:30,587] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,601] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,602] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

but not today...

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632998456, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKU6XHASVACMZUAAK4TRS5YRDANCNFSM4MWZUKEA .

lucaspada894 commented 4 years ago

Hello, I have run into an error again. This time it seems to be a stack overflow error.

On Sat, May 23, 2020 at 2:16 AM Gian Spadafora gians@iastate.edu wrote:

I will use 0.6.0

On Sat, May 23, 2020 at 2:16 AM Gian Spadafora gians@iastate.edu wrote:

Hello, the process seems to be running fine for now with 6gb of memory.

On Sat, May 23, 2020 at 2:04 AM Luca Foppiano notifications@github.com wrote:

Two things.

First of all, you should use the latest version, which is the 0.6.0 (yes, I know the documentation said 0.5.6, I corrected this morning). 😅 💦

Secondly, that error is fine, when the pool is empty, the system return 503 (system unavailable) which makes the client wait and try again. What is not good is that the service ends after a while.

Try to increase the memory:

[image: image] https://user-images.githubusercontent.com/15426/82724162-a6d8c000-9cc3-11ea-9be6-cab7aebd207f.png

I will investigate also

WARN [2020-05-23 06:56:30,587] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,601] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

WARN [2020-05-23 06:56:30,602] org.grobid.core.utilities.LanguageUtilities: Cannot detect language because of: java.lang.IllegalStateException: Cannot read profiles for cybozu language detection from: /opt/grobid/grobid-home/language-detection/cybozu/profiles

but not today...

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-632998456, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKU6XHASVACMZUAAK4TRS5YRDANCNFSM4MWZUKEA .

at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidWorker.processCommand(GrobidWorker.java:56) at org.grobid.client.GrobidWorker.run(GrobidWorker.java:48) at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.lang.Thread.run(Unknown Source) 23 May 2020 10:38.57 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:38.59 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.18 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.19 [ERROR] GrobidService - Connection refused: connect 23 May 2020 10:39.21 [ERROR] GrobidService - Connection refused: connect Exception in thread "pool-1-thread-5" java.lang.StackOverflowError 23 May 2020 10:39.40 [INFO ] GrobidWorker - pool-1-thread-12 Start. Processing = ..\PDF_FILES\10507327.pdf at java.net.DualStackPlainSocketImpl.connect0(Native Method) at java.net.DualStackPlainSocketImpl.socketConnect(Unknown Source) at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source) at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source) at java.net.AbstractPlainSocketImpl.connect(Unknown Source) at java.net.PlainSocketImpl.connect(Unknown Source) at java.net.SocksSocketImpl.connect(Unknown Source) at java.net.Socket.connect(Unknown Source) at java.net.Socket.connect(Unknown Source) at sun.net.NetworkClient.doConnect(Unknown Source) at sun.net.www.http.HttpClient.openServer(Unknown Source) at sun.net.www.http.HttpClient.openServer(Unknown Source) at sun.net.www.http.HttpClient.(Unknown Source) at sun.net.www.http.HttpClient.New(Unknown Source) at sun.net.www.http.HttpClient.New(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.plainConnect(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.connect(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.getOutputStream0(Unknown Source) at sun.net.www.protocol.http.HttpURLConnection.getOutputStream(Unknown Source) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:82) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110) at org.grobid.client.GrobidService.runGrobid(GrobidService.java:110)

lfoppiano commented 4 years ago

This seems that the docker container has been terminated. For large processing, 8Gb is probably enough. Otherwise, reduce the number of parallel threads in the client.

lfoppiano commented 4 years ago

@lucaspada894 adding more memory did fix the issue?

lucaspada894 commented 4 years ago

Yes, it did. Thank you very much!

On Mon, Jun 22, 2020, 8:58 PM Luca Foppiano notifications@github.com wrote:

@lucaspada894 https://github.com/lucaspada894 adding more memory did fix the issue?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-647860034, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKQ5X7N2FBDZRWY6AXDRYAD2ZANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

If you are satisfied, please close this issue. 😉

FYI I'm working on adding more documentation (mostly already discussed or written somewhere in here), you can see the preview here: https://grobid.readthedocs.io/en/add-developers-guide/

lucaspada894 commented 4 years ago

Oh, sorry about that. I will close the issue in the morning.

On Tue, Jun 23, 2020, 1:00 AM Luca Foppiano notifications@github.com wrote:

If you are satisfied, please close this issue.

FYI I'm working on adding more documentation (mostly already discussed or written somewhere in here), you can see the preview here: https://grobid.readthedocs.io/en/add-developers-guide/

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kermitt2/grobid/issues/577#issuecomment-647927359, or unsubscribe https://github.com/notifications/unsubscribe-auth/AIQAFKUYUTE5HLT234MATKLRYBAHXANCNFSM4MWZUKEA .

lfoppiano commented 4 years ago

@lucaspada894 I'm closing this issue. If more work is needed, feel free to reopen it again.