Closed jetnet closed 8 years ago
Thank your for your interest with our crawler. It's been used in different Autonomy projects with good success already (DIH and CFS).
Your problem does not appear to be a proxy problem, but a coding issue probably introduced in 2.3.0-SNAPSHOT. Stay tuned for a fix.
I could not replicate in your environment, but I think I managed to fix the issue nonetheless. Please give this new snapshot a try.
hi Pascal,
I can confirm - the issue is gone (tested with norconex-collector-http-2.3.0-20151017.032546-23.zip). Thank you very much for a such quick response / fix! Some time ago, we had an issue with the Autonomy fetch (it could not work with proxy too), and it took 3 or 4 months to get a proper fix from Autonomy (with escalations and working on-site and so on)! I feel the difference already! :)
hi Pascal, first of all, I'd like to thank you and your team for the developing a new free crawler! Since many years we've been trying to find an alternative solution for the Autonomy http connector/fetch. And I must say, the norconex http-collector is very promising crawler! We'll keep an eye on it :)
So, the very first issue I found: the downloading via proxy does not work: Version: norconex-collector-http-2.3.0-SNAPSHOT
and the error:
The robots.txt cannot be downloaded/checked with the same error.
Do you have an idea, what can be wrong here? Is it issue with our proxy server? Autonomy http connector does work with the same proxy :) Thank you!