When we run the scraper cmd, it automatically create new jobs based on the jobs returned by the scrapers. Sometimes, the jobs is already exist in the DB, how do we prevent it to be inserted?
Some ideas, store the job links in db as well, then we query based on links before we insert the job to DB
When we run the scraper cmd, it automatically create new jobs based on the jobs returned by the scrapers. Sometimes, the jobs is already exist in the DB, how do we prevent it to be inserted?
Some ideas, store the job links in db as well, then we query based on links before we insert the job to DB