Veerse / go_colly_and_kafka_POC

An example of the Go Colly scrapper library in action
1 stars 0 forks source link
scrapping

reezocar_test

Purpose

This program goes through the pages of a www.autoreflex.com announce listing and extracts for each announce :

Listing used for this POC : http://www.autoreflex.com/137.0.-1.-1.-1.0.999999.1900.999999.-1.99.0.1?fulltext=&geoban=M137R99

Prerequisite

Initialisation

A) Create an Announces Kafka topic :

B) Use following Docker commands to run a MongoDB container :

C) Download Go Colly and MongoDB libraries :

D) Run consumer and producer

Screenshots

Producer :

alt text

Consumer (log message hidden by default) :

alt text

MongoDB announces collection :

alt text

Areas for improvment