WebScraping Project

Description

Employ the technique of web scraping to gain comprehensive insights into the diverse Data Analyst job listings available on JobStreet, enabling you to explore a wide range of promising career prospects.

Skills Applied: Python

Libraries Used: csv, requests, BeautifulSoup

Implementation

To begin, I examine the source code of the webpage I intend to scrape for relevant information.
Next, I aim to fetch webpage content from a URL using the requests library. It then employs BeautifulSoup to parse the HTML and create a structured representation of the webpage. Specifically, it extracts all elements with the specified class names and saves them in the job_listings variable for later use.
Subsequently, I would print the extracted information from each job listing as per my requirements.
Lastly, I will store all the collected data in a CSV file.

Breakdown

PYTHON

1. Importing Libraries

I start the project by importing the necessary libraries (csv, requests, BeautifulSoup)

2. Extract Raw HTML

Next, I fetch the HTML content from a Jobstreet webpage for Data Analyst jobs in Singapore.
It then extracts the job listings by finding specific elements with the provided class names.
The length of the job_listings variable is obtained to determine the number of job postings on the page.

3. Scrape Information

Subsequently, I used a For Loop to iterate through each job listing in the job_listings variable. For each listing, it extracts the following using specific HTML elements and class names:
- Title
- Company Name
- Salary
- Description
- Date Posted
- How long ago it was posted

4. Print Scraped Information

I print out all the scraped information to make sure everything is scraped correctly.

5. Save data to CSV

Finally, I store all the extracted data into a CSV file for convenient reference and potential future analysis.

Important Note:

This scraping is only for one page. For multiple pages, please refer to this code.