headrun / SWIFT

2 stars 0 forks source link

Ajio.com crawling #51

Open SreenivasDega opened 4 years ago

SreenivasDega commented 4 years ago

To crawl fashion items in Ajio

nigitha1995 commented 4 years ago

Screenshot from 2020-05-11 06-22-03

Got the api in which product details are available.Have taken only for men's Tshirts.Below mentioned is the api. https://www.ajio.com/api/category/830216014?fields=SITE&currentPage=1&pageSize=45&format=json&query=%3Arelevance&sortBy=relevance&gridColumns=3&advfilter=true

nigitha1995 commented 4 years ago

writing the crawler to get the product urls.since api is changing for every category.working on this. ex: https://www.ajio.com/api/category/8302?fields=SITE&currentPage=19&pageSize=45&format=json&query=%3Arelevance&sortBy=relevance&gridColumns=3&advfilter=true https://www.ajio.com/api/category/830207008?fields=SITE&currentPage=4&pageSize=45&format=json&query=%3Arelevance&sortBy=relevance&gridColumns=3&advfilter=true

Preethi266 commented 4 years ago

In this we are getting the two different api's for getting the data.We are getting the api data when we have the more product details in a page.In this we don't have a rating for the products.I have checking the html crawling for this.I have observed that the api data is available in the source page when we are requesting the product url so if the api and the source page data is same i will proceed in that way.

Preethi266 commented 4 years ago

I have collected the urls from the ajio need to collect the data and have to test any blockages is happening while we are requesting more requests.

Preethi266 commented 4 years ago

Taken the data and written the spider as browse and terminal parts.In browse part i have collected the urls and passed into the terminal part and getting the data.

Preethi266 commented 4 years ago

In the Api we are getting the more product details previously i have proceeded with the html crawling but the product details are missing so i am writing the crawler for the Api data to get the more product details

Preethi266 commented 4 years ago

Screenshot from 2020-05-26 15-26-35