- Write access to the
Migrants and the State
Shared Drive - Familiarity with the Project glossary
- Order group has been processed & files have been received
- Find the spreadsheet
ORDER GROUPS – INGEST TRACKING
withinDATA
in the shared Drive. On the first tabORDER GROUPS
, create a new row for the order group (e.g.,OG-YEAR-EXAMPLE
) and enter data for the first 3 columns. - Copy the second tab
OG-TEMPLATE
as a new tab named for the new order group (e.g.,OG-YEAR-EXAMPLE
). Then add each a-file as a row and fill out all the columns exceptpage_count
- Back in the
DATA
folder of theMigrants and the State
Shared Drive, create a new folder named for the order group (e.g.,OG-YEAR-EXAMPLE
). - Within the new folder, create a README document for notes about the order and a subfolder called
Data
. Then upload all of the PDFs for the order intoData
, ensuring that each PDF has the a-number in the file name. - Go back to the
ORDER GROUPS
tab inORDER GROUPS – INGEST TRACKING
and add theGDRIVE FILE COUNT
andGDRIVE LINK
to theData
folder of PDFs for the order group.
- Connect to the Research Workspace (RW) mount (See: instructions)
- Create a folder named after the order group (e.g.,
OG-YEAR-EXAMPLE
) in the RW space and create subfolder in it namedpdfs
- Download the order group PDFs from Google Drive and put them in
pdfs
. Stay connected to the RW mount for the next steps.
- Go to migrants-and-the-state/og-template use click "Use this template" to create a new repository named for your order group (all lowercase) within the
migrants-and-the-state
GitHub org. - Clone the new repo to your local machine
cd
into it, and open it in your editor (e.g., VS Code) - Change the name & description in the README.
- Update the
config.yml
file with the order name as thelabel
and thesource_dir
as the path to ajpgs
folder (which will get created later) within the order group on RW. (e.g.,/Volumes/migrants_state/OG-YEAR-EXAMPLE/jpgs
) - Create a
.env
file in the project root with AWS secrets/credentials (ask Marii) - Download the catalog sheet you made within
ORDER GROUPS – INGEST TRACKING
for the a-files as a CSV. Rename it torecords.csv
and put it within thesrc
folder in the project root - Install/update ruby & gems:
asdf install ruby && bundle install
- Run task to create list of anums to txt file on RW:
bundle exec rake pdfs:anum_txt
- Run task to add page count to CSV:
bundle exec rake pdfs:page_count_csv
- Replace Google sheet in Drive with the CSV you just created to include page count info
- Split the PDFs to JPGs on RW (creates the aforementioned
/Volumes/migrants_state/OG-YEAR-EXAMPLE/jpgs
):bundle exec rake pdfs:split_jpgs
- Use aperitiiif to build pyramidal tifs, IIIF json, and site HTML:
bundle exec aperitiiif batch build
- Upload tifs to AWS S3
bundle exec rake s3:push:tifs
- Upload json to AWS S#
bundle exec rake s3:push:json
- Git commit and push repo changes (including built HTML) to trigger deploy to github pages site