July 2024

drodol  •  1 Jul 2024   •    
Screenshot

Since things have been going well on the code front, I will set the very ambitious goal of having jobsinenglish.dk fully ported into a Django app, and fully deployed.

I made great progress today regarding the automation of a lot of manual tasks.

Here is a short summary:

  1. The scraper now only saves jobs whose job description is in English. Jobs that are in Danish are not saved, but all their meta data is saved to a table that will allow me to pull interesting insights/stats from all employers and the vacancies they post. Eventually this could produce interesting reports.
  2. Apart from only saving jobs in English to the db, I reduced almost all the manual work I was doing with the scraper in the past. I had to manually discard certain job_links, after discarding some, I had to sometimes clean up the URLs for some jobs, after that, I had to open each job_link and do a CTRL+F and search for "danish". If no hits, then I would change the job_visible to true. Now, all of this is fully automated.
  3. I made a ton of nice improvements to the default Django admin interface.

Still do be done:

  1. Port the OpenAI code I wrote for categorising jobs based on my predefined set of categories.
  2. Port the frontend.

I have run out of things to do (to avoid dealing with the frontend) so I guess I will tackle the relatively simple OpenAI port and then work on the frontend.

Comments

Oh wow, awesome! You’re now using AI to scrape jobs 🥹

Ever thought of making that a standalone product?

jasonleow  •  2 Jul 2024, 1:13 am

I am not really “using” AI to scrape. I am using AI to enhance the scraped data.

I am now scraping from one source, but in the future I will build individual scrapers per ATS, and I could maybe sell API access to those.

drodol  •  2 Jul 2024, 6:26 am

That’s actually pretty useful service

jasonleow  •  3 Jul 2024, 2:30 am

Discover more

Sourced from other writers across Lifelog

Ooops we couldn't find any related post...