Running patent metadata retriever and cached webcrawler jobs with empty cache resulted in many faults with org.apache.http.conn.ConnectionPoolTimeoutException. This exception is thrown when the waiting time for a connection from connection pool exceeds the timeout. In HttpClientUtils we create CloseableHttpClient and use the same value for connect timeout and connection request timeout. We could check if we can increase or maybe disable the timeout for connection from the pool and bring the number of faults down.
We should check separately for patent metadata retrieval and http content retrieval.
Running patent metadata retriever and cached webcrawler jobs with empty cache resulted in many faults with
org.apache.http.conn.ConnectionPoolTimeoutException
. This exception is thrown when the waiting time for a connection from connection pool exceeds the timeout. InHttpClientUtils
we createCloseableHttpClient
and use the same value for connect timeout and connection request timeout. We could check if we can increase or maybe disable the timeout for connection from the pool and bring the number of faults down.We should check separately for patent metadata retrieval and http content retrieval.