IPL-20-21-web-scraping

using nodejs

About

This is a web scraping project which scraps the cricinfo website to get information regarding Indian Premier League 2020/21. The following activities are carried out when we run this project-

The ipl directory is created inside the current directory.
Separate directories are created for each team inside the ipl directory
Inside each team's directory, separate excel sheet is created for each player.
Each row in the excel sheet represents different match played by the player and details like team name, opponent team name, player name, runs, balls, fours, sixes, sr, date of match, venue of match and the result of the match is shown.

How to run this project

Clone this repository in your local environment.
Run command npm install to install all the required packages.
Run command node main.js to get the required information.

Insights-

Cheerio module used here for web scraping.
Disadvantage of cheerio module: it only parses and extracts initial loaded html.
HTML seggregation is done using another file (table.html) to make information extraction easier.
Multiple page scraping is done here.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
main.js		main.js
matches.js		matches.js
package-lock.json		package-lock.json
package.json		package.json
scorecard.js		scorecard.js
table.html		table.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IPL-20-21-web-scraping

using nodejs

About

How to run this project

Insights-

About

Packages

Languages

swatijha-2906/IPL-20-21-web-scraping

Folders and files

Latest commit

History

Repository files navigation

IPL-20-21-web-scraping

using nodejs

About

How to run this project

Insights-

About

Topics

Resources

Stars

Watchers

Forks

Packages 0

Languages

Packages