B-Open / jobbuzz

Brunei job search database and alert notification
https://jobbuzz.org
MIT License
1 stars 1 forks source link

Removal of inactive job listings #45

Open dsychin opened 2 years ago

dsychin commented 2 years ago

Currently implementation adds new job listings but does not remove old ones.

2 ways to do this.

  1. If the job page is no longer accessible when it is no longer valid, then just checking the page regularly and marking it appropriately should be fine.
  2. During the scraper job, scrape all job listings and compare with all entries in the database and mark missing ones as inactive.
syahnur197 commented 2 years ago

to implement number 2, maybe I can do it like this

  1. when running cmd/jobbuzz-scraper.go, mark all jobs as inactive
  2. Fetch all jobs
  3. Loop all jobs, for each job, if exist, mark back as active, else insert
syahnur197 commented 2 years ago

@dsychin what do you think of this approach? Is there any better approach?

dsychin commented 2 years ago

@syahnur197 I am leaning towards solution 1 at the moment. It has a few advantages:

dsychin commented 2 years ago

Implement expiry date for job listings.

Use "last date to apply" for jobcentre listings, and an arbitrary time period for bruneida.