Python wrapper for Wikipedia
-
Updated
Dec 2, 2024 - Python
Python wrapper for Wikipedia
Web scraping, data parsing and automation tutorials. Suited for both beginners and intermediate/advanced programmers.
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
Java tool to get wikipedia data
A 🤖 which provides features from Wikipedia like summary, title searches, location API etc.
Graphically display the connections between different Wikipedia articles
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
SpaceX Launches 🚀 and Starlink Satellites 🛰
Collects a multimodal dataset of Wikipedia articles and their images
Music tagger with GUI that parses wikipedia for information. Can also download album art and lyrics.
Just Refs - extract just the references and related topics from any page on the English Wikipedia
This project collects Wikipedia articles from a search term entered by the user and formats the data into a .docx (Word Document) document with images related to each section of the collected article.
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
A tutorial and code samples of web scraping with PHP
Wikipedia Article Summarizer a simple Python project based on NLP techniques
Taxonomic trees (cladograms) from Wikipedia-scraped data.
Wikipedia Entities Lexicon Extractor
Extracts geodata from a wikipedia dump
Linked Data Knowledge Base Population (KBP) framework built on top of Snorkel. The default configuration uses Wikipedia as text corpus and DBpedia as target.
Add a description, image, and links to the wikipedia-scraper topic page so that developers can more easily learn about it.
To associate your repository with the wikipedia-scraper topic, visit your repo's landing page and select "manage topics."