Web Scraping with Puppeteer and Cheerio

Project-WebMobi

This Node.js script scrapes event details from a website using Cheerio and Puppeteer. It extracts event information such as name, date, location, description, and organizer information from a predefined HTML content. Additionally, it enriches the data by performing Google searches based on event names and retrieves additional information from the search results.

Getting Started Follow these instructions to set up and run the script: Prerequisites Node.js installed on your machine npm or yarn package manager

Running the project

Clone the repository or download the script files.
Navigate to the project directory.
Install dependencies using npm or yarn:

Navigate to the project directory.

Install dependencies using npm or yarn: npm install or yarn install

Usage Ensure that you have the HTML content of the events ready. You can either hardcode it into the script or retrieve it dynamically from a source.

Update the htmlContent variable in the script with your HTML content containing event details.

Run the script using Node.js: node eventbrite.js

The script will extract event details, perform Google searches for each event, and enrich the data with additional information.

The final enriched data will be saved in a structured format such as JSON or CSV in the event.csv file.

Built With

This website was built using the following technologies:

Cheerio,
JSON2CSV,
Puppeteer,
Request,
Request-promise

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Web Scraping with Puppeteer and Cheerio

Files

README.md

Latest commit

History

README.md

File metadata and controls

Web Scraping with Puppeteer and Cheerio