Closed bobertb closed 8 years ago
Hi, I am attempting to use this committer, and am getting the following error.
INFO [HttpCrawler] 1 start URLs identified. INFO [CrawlerEventManager] CRAWLER_STARTED INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Crawling references... INFO [CrawlerEventManager] DOCUMENT_FETCHED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/index.cfm?action=per INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/shim.gif INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/cityseal.gif INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/apllogo.jpg INFO [CrawlerEventManager] URLS_EXTRACTED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] DOCUMENT_IMPORTED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] DOCUMENT_COMMITTED_ADD: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [AbstractCrawler] Libro.coacd.org Test Page 1: 100% completed (1 processed/1 total) INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Deleting orphan references (if any)... INFO [CrawlerEventManager] DOCUMENT_COMMITTED_REMOVE: http://libro.coacd.org/index.cfm INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Deleted 1 orphan references... INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Crawler finishing: committing documents. INFO [AbstractFileQueueCommitter] Committing 2 files INFO [SolrCommitter] Sending 2 documents to Solr for update/deletion. ERROR [AbstractBatchCommitter] Could not commit batched operations. com.norconex.committer.core.CommitterException: Cannot index document batch to Solr.
Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/#/libro: Expected mime type application/octet-stream but got text/html.
So the post fails because of mime type issue ?
Thanks for any insight, Bob
Nevermind, it was the # in the solr path. I'm good thanks..
Hi, I am attempting to use this committer, and am getting the following error.
INFO [HttpCrawler] 1 start URLs identified. INFO [CrawlerEventManager] CRAWLER_STARTED INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Crawling references... INFO [CrawlerEventManager] DOCUMENT_FETCHED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/index.cfm?action=per INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/shim.gif INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/cityseal.gif INFO [CrawlerEventManager] REJECTED_TOO_DEEP: http://libro.coacd.org/divisions/acao/policies/images/apllogo.jpg INFO [CrawlerEventManager] URLS_EXTRACTED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] DOCUMENT_IMPORTED: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [CrawlerEventManager] DOCUMENT_COMMITTED_ADD: http://libro.coacd.org/divisions/acao/policies/apl_telework.htm INFO [AbstractCrawler] Libro.coacd.org Test Page 1: 100% completed (1 processed/1 total) INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Deleting orphan references (if any)... INFO [CrawlerEventManager] DOCUMENT_COMMITTED_REMOVE: http://libro.coacd.org/index.cfm INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Deleted 1 orphan references... INFO [AbstractCrawler] Libro.coacd.org Test Page 1: Crawler finishing: committing documents. INFO [AbstractFileQueueCommitter] Committing 2 files INFO [SolrCommitter] Sending 2 documents to Solr for update/deletion. ERROR [AbstractBatchCommitter] Could not commit batched operations. com.norconex.committer.core.CommitterException: Cannot index document batch to Solr.
Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/#/libro: Expected mime type application/octet-stream but got text/html.
So the post fails because of mime type issue ?
Thanks for any insight, Bob