July 2024

drodol • 1 Jul 2024 •
Since things have been going well on the code front, I will set the very ambitious goal of having jobsinenglish.dk fully ported into a Django app, and fully deployed.
I made great progress today regarding the automation of a lot of manual tasks.
Here is a short summary:
- The scraper now only saves jobs whose job description is in English. Jobs that are in Danish are not saved, but all their meta data is saved to a table that will allow me to pull interesting insights/stats from all employers and the vacancies they post. Eventually this could produce interesting reports.
- Apart from only saving jobs in English to the db, I reduced almost all the manual work I was doing with the scraper in the past. I had to manually discard certain job_links, after discarding some, I had to sometimes clean up the URLs for some jobs, after that, I had to open each job_link and do a CTRL+F and search for "danish". If no hits, then I would change the job_visible to true. Now, all of this is fully automated.
- I made a ton of nice improvements to the default Django admin interface.
Still do be done:
- Port the OpenAI code I wrote for categorising jobs based on my predefined set of categories.
- Port the frontend.
I have run out of things to do (to avoid dealing with the frontend) so I guess I will tackle the relatively simple OpenAI port and then work on the frontend.
Comments
I am not really “using” AI to scrape. I am using AI to enhance the scraped data.
I am now scraping from one source, but in the future I will build individual scrapers per ATS, and I could maybe sell API access to those.

That’s actually pretty useful service

Oh wow, awesome! You’re now using AI to scrape jobs 🥹
Ever thought of making that a standalone product?