wikinews-scraper

Wikinews article-content scraper written in Python.

The code provides to developer easy way for scraping the content of Wikinews.

Components that can be easly access are:

headline,
body,
date,
photos,
entities,
related-articles,
source-articles and etc.

Dependencies

The folowing packages are required:

If you do not have them, for more info how to install them according your OS please find more information on the internet.

Starting the demo

The demo.py is the first point that need to be addressed.

Script first is scraping (from the main page) all of the article's links. Second scraping-stage is getting the elements of the whole article (given throught article's link).

Linux:

python3 demo.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
article		article
common		common
console		console
img		img
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
main_page_news.py		main_page_news.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wikinews-scraper

Dependencies

Starting the demo

About

Releases

Packages

Languages

License

nix-/wikinews-scraper

Folders and files

Latest commit

History

Repository files navigation

wikinews-scraper

Dependencies

Starting the demo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages