diff --git a/README.md b/README.md index 6ebd3c0..db3227f 100644 --- a/README.md +++ b/README.md @@ -65,7 +65,7 @@ Includes: - ISBN numbers obtained via Google APIs also included in [data_raw/complete_data.csv](data_raw/complete_data.csv) - ### Cleaning and Processing -A description of data cleaning, joining and preprocessing can be found [Here](bookworm/data/Data_Processing_Slides.pdf) and [Here](scripts/README.md). A descripton of the final datasets used in production and testing can be found [Here](bookworm/data/README.md). +A description of data cleaning, joining and preprocessing can be found [Here](bookworm/data/Data_Slides.pdf) and [Here](scripts/README.md). A descripton of the final datasets used in production and testing can be found [Here](bookworm/data/README.md). ### Data Limitations This project is a proof of concept, executed on a small dataset (~13K books total after data cleaning), with some data sparsity even within those books. diff --git a/bookworm/data/Data_Processing_Slides.pdf b/bookworm/data/Data_Processing_Slides.pdf deleted file mode 100644 index c325404..0000000 Binary files a/bookworm/data/Data_Processing_Slides.pdf and /dev/null differ diff --git a/bookworm/data/Data_Slides.pdf b/bookworm/data/Data_Slides.pdf new file mode 100644 index 0000000..ac9a5ca Binary files /dev/null and b/bookworm/data/Data_Slides.pdf differ