ScaleUnlimited / flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons
Apache License 2.0
51 stars 18 forks source link

Can't resolve dependency com.github.crawler-commons:http-fetcher:0.1-SNAPSHOT #147

Closed IvanBiv closed 6 years ago

IvanBiv commented 6 years ago

In IntelliJ IDEA to try build project but dependency com.github.crawler-commons:http-fetcher:0.1-SNAPSHOT can't resolve but other is ok. I use Master branch on commit "Refactored CrawlTool to pull out private methods to a new class Crawl… (#144)". What is the reason?

vmagotra commented 6 years ago

Hi Ivan, Did you build and install crawler-commons http-fetcher locally in your maven repo as per the instructions on https://github.com/ScaleUnlimited/flink-crawler/wiki/Build-Instructions ?

IvanBiv commented 6 years ago

@vmagotra thanks. I didn't see this. Now, I run step by step and crawler was build. My recommendation:

git clone https://github.com/crawler-commons/http-fetcher.git
cd http-fetcher
mvn install 

@vmagotra, we build the service with web crawler and I tried something tools and I am not very satisfied. And I see you project is growing. Where I can ask you about some things? With create a new issue?

Schmed commented 6 years ago

I added the cd http-fetcher command you suggested to the instructions. OK to close this issue now?

kkrugler commented 6 years ago

Hi @IvanBiv - if you ask the question on Stack Overflow, we can provide help via that site. Don't know if you haven enough reputation points there to create a new "flink-crawler" tag - if not, just add the "flink" tag and I'll see it.

IvanBiv commented 6 years ago

@kkrugler ok, thanks.