Amazon Order History Web Scraper

Uses Selenium to simulate login and going through all the users orders. Saves the received data in a json file for later evaluation.

Currently only works for the german version of Amazon (amazon.de). For amazon.com users there is already a built in feature to export your data to a csv file.

Install

clone the repo https://github.com/MaX-Lo/Amazon-Order-History.git
install requirements pip install -r requirements.txt
Make sure your Geckodriver is installed and on your PATH variable. For convenience there is a bash script in the project root dir for that. This script downloads the latest version of geckodriver, makes it executable and puts in the /usr/share/bin which is already in the PATH by default. It need sudo permission to do so though. For the Skript run:

chmod +x geckodriver_installer.sh

./geckodriver.sh.

Usage

If you are using a device where you've never logged in before, Amazon might require a confirmation code from an email it has send to you. Therefore it can be necessary to log into Amazon with your browser before using that script on a new device. After logging in the first time there shouldn't be anymore email confirmations necessary. The same applies if you have two-factor authentication activated.

Scraping

python -m scraping scrape --email [email protected] --password 123

If you don't want your password appearing in the bash history or on the terminal output, you can create a pw.txt in the project root directory (Amazon-Order-History), which contains your password and don't use the password parameter.

In case of import errors pay attention to start the script from the main folder (Scraping) and not from inside Scraping/scraping

Evaluation

python -m scraping dash starts a flask server (should be under http://127.0.0.1:8050/)

Help

There are some optional parameters available, python -m scraping --help shows a description for each of them.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
scraping		scraping
.gitignore		.gitignore
.pylintrc		.pylintrc
README.md		README.md
geckodriver_installer.sh		geckodriver_installer.sh
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Order History Web Scraper

Install

Usage

Scraping

Evaluation

Help

About

Releases

Packages

Contributors 2

Languages

MaX-Lo/Amazon-Order-History

Folders and files

Latest commit

History

Repository files navigation

Amazon Order History Web Scraper

Install

Usage

Scraping

Evaluation

Help

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages