This dataset and notebook can be also found on Kaggle dataset exploratory data anaylsis
Since we are here, we could agree that data science is a cool profession.
When "Data Science is the sexist job of the 21 century" by Harvard Business Review took a storm on the internet on 2012, Hong Kong was still starting out on the big data trend. So, 10 years has past, what is the status quo of the data industry? To gain insights to answer this question, I have scrapped job postings from one of popular job posting platform in Hong Kong and perform some data analysis.
Download and install ChromDriver ChromeDriver
python libraries:
A python script was written here to scrape job postings under "Information Technology\Data Scientist" category. The script was implemented in a way that for each job posting was read, an observation ( row of data) is added to the csv file. Preliminary data cleaning was incoporated to the script to make the dataset easier for downstream processing.
column | null placeholder | non-null example |
---|---|---|
title | (not applicable) | Data Analyst - Top ranked Virtual Bank |
salary | "salary" | HK$35,000 - HK$55,000 /month |
company | "company" | CGP |
posted | (not applicable) | 2022-11-18 |
District | "district" | Shatin district |
job description | (not applicable) | Job Description: Research, collate, obtain and analyze data ... |
Career level | empty | Entry Level |
Years of Experience | empty | N/A |
Company Website | empty | www.companyname.com |
Qualification | empty | Degree |
Job Type | empty | Full Time, Permanent |
Job Functions | empty | Banking / Finance, Others, Information Technology (IT), Others, Data Scientist |
url | empty | https://hk.jobsdb.com/hk/en/job/data-analyst-data-governance-... |
can be found inside the \analysis\data-scientist-job-postings-in-hong-kong-eda.ipynb