Skip to content

Latest commit

 

History

History
43 lines (29 loc) · 1.51 KB

README.md

File metadata and controls

43 lines (29 loc) · 1.51 KB

Web Scraping with Puppeteer and Cheerio

Project-WebMobi

This Node.js script scrapes event details from a website using Cheerio and Puppeteer. It extracts event information such as name, date, location, description, and organizer information from a predefined HTML content. Additionally, it enriches the data by performing Google searches based on event names and retrieves additional information from the search results.

Getting Started Follow these instructions to set up and run the script: Prerequisites Node.js installed on your machine npm or yarn package manager

Running the project

  1. Clone the repository or download the script files.
  2. Navigate to the project directory.
  3. Install dependencies using npm or yarn:

Navigate to the project directory.

Install dependencies using npm or yarn: npm install or yarn install

  1. Usage Ensure that you have the HTML content of the events ready. You can either hardcode it into the script or retrieve it dynamically from a source.

Update the htmlContent variable in the script with your HTML content containing event details.

Run the script using Node.js: node eventbrite.js

The script will extract event details, perform Google searches for each event, and enrich the data with additional information.

The final enriched data will be saved in a structured format such as JSON or CSV in the event.csv file.

Built With

This website was built using the following technologies:

  1. Cheerio,
  2. JSON2CSV,
  3. Puppeteer,
  4. Request,
  5. Request-promise