A Node.js web scraper application that extracts HTML, CSS, JavaScript, text, images, and videos from specified URLs. Offers an interactive UI to display and download scraped content.
A powerful Node.js web scraper that allows users to extract and interact with HTML, CSS, JavaScript, text, images, and video content from any public web page. The application features a clean, user-friendly UI for viewing and downloading the extracted content directly.
- Extract Content by Type: Choose between HTML, JavaScript, text, images, and video content types.
- Preview and Download: View scraped content in the browser and download it as files.
- Retry Mechanism: Automatically retries requests for improved stability.
- Clean Interface: Simple, user-friendly UI.
- open cmd redirect to web-scraper: cd web-scraper
- Install Dependencies Install the necessary packages by running: npm install
- Run the Server Start the application server: npm start
- Open the Application Open your browser and go to: http://localhost:3000
Usage Enter a URL: Input the URL of the webpage you want to scrape. Choose Content Type: Select the type of content you wish to extract (HTML, JavaScript, text, images, or video). View and Download: Scraped content will display in the interface with a download option for each type.
#FILE PATH
web-scraper/
├── public/
│ └── index.html
├── scraper.js
├── package.json
└── README.md
This README provides a full overview, clear installation instructions, and a structured outline for usage, making it easy for users and collaborators to understand and work with the project.