Dump job listings, to a CSV, scraped from searX instances.
Go to file
mharb da835d4fb0 Check-in Progress on V2
Saving my progress on job_scraper_v2. This overhaul will separate cURL requests and HTML parsing into distinct stages. Buyer beware: it is far from complete.

The flow is: requesting HTML documents first, closing all network connections, parsing HTML, and finally saving to disk.
2023-10-27 14:16:49 +00:00
.gitignore Initial commit 2023-07-31 13:55:55 +00:00
job_scraper.py Check-in Progress on V2 2023-10-27 14:16:49 +00:00
LICENSE Add a project license. 2023-07-31 14:57:35 +00:00
README.md Touch-up Documentation and Script User Agent 2023-10-02 20:32:19 -04:00

searX-py-job-scraper (WIP NOT COMPLETE)

Dump job listings, to a CSV, scraped from searX instances. A convenient list of all active public searX hosts.