platonai / exotic-amazon

A complete solution to crawl amazon at scale completely and accurately.
146 stars 47 forks source link

The crawler only crawled down 2k products, not all of Amazon's products. #10

Open wfh1300 opened 1 year ago

wfh1300 commented 1 year ago

de757c3e453c8aa75e360c311909869 Downloads are no longer changing at 2k. But Amazon's products are in the hundreds of millions.

platonai commented 1 year ago

Of course, our program will not crawl the entire amazon.com by default, which would be a very huge project. To start a real world data project, you need to make some simple configuration changes. Usually, each user has some special business logic, which requires a little code modifications to meet the business needs.

wfh1300 commented 1 year ago

Would you please tell me which file should I change?

platonai commented 1 year ago

Update your code, repackage and run:

ENV=prod java -jar target/exotic-amazon-0.0.2-SNAPSHOT.jar

platonai commented 1 year ago

Essentially, a java environment variable named prod must be passed in to run the program in production mode.