The Archival IIIF server indexes and provides IIIF services for digital collections. The server can be configured with a number of different services to index the digital collections and to create derivatives.
The IISH (International Institute of Social History) has created a number of services to index the DIPs created by Archivematica and to give access through IIIF.
The Archival IIIF server is composed of a web application and various service workers. The service workers index collections to ElasticSearch, while the web environment gives access to the index through various IIIF API's. Redis is used to manage the communication between various service workers and provides additional caching capabilities.
The Archival IIIF server comes with several services that can be turned on or off using the env variable
IIIF_SERVER_SERVICES
.
The web
service runs the IIIF web environment.
The worker services wait for new jobs to appear in a queue in Redis. A distinction is made between index workers that indexes data in ElasticSearch and derivative workers that create specific derivatives of collection items. At the moment, the Archival IIIF server identifies five different types of worker services:
- Index worker: Gets a job with the path of a collection to be indexed in
ElasticSearch. Current implementations:
iish-archivematica-index
: A specific IISH implementation of the index worker. Indexes DIPs created by the Archivematica instance of the IISH.
- Text index worker: Gets a job with a collection id and a list of all transcriptions/transliterations to be indexed
in ElasticSearch. Current implementations:
text-index
: Indexes plain text files and ALTO files.
- Metadata index worker: Gets a job with a collection id and/or a OAI identifier and obtains the metadata from an
OAI endpoint to be indexed in ElasticSearch. Current
implementations:
iish-metadata
: Looks for and indexes metadata from the OAI service of the IISH.niod-metadata
: Looks for and indexes metadata from NIOD.
- Waveform derivative worker: Gets a job with a collection id and then builds waveform representations of all audio
files of the collection with the given collection id. Current implementations:
waveform
: Default implementation.
- PDF image derivative worker: Gets a job with a collection id and then builds JPG representations of the first page
of all PDF files of the collection with the given collection id. Current implementations:
pdf-image
: Default implementation.
- Video image derivative worker: Gets a job with a collection id and then extracts a still as JPG and creates a
mosaic of stills with a WebVTT file from all video files of the collection with the given collection id. Current
implementations:
video-image
: Default implementation.
The cron job services run periodically. At the moment, the Archival IIIF server identifies one cron job:
- Metadata update: Checks periodically whether some metadata has to be updated. Current implementations:
iish-metadata-update
: Runs daily to query the OAI service of the IISH for updates and sends those to the metadata indexer.
The standalone services do not wait on a trigger like the workers or cron jobs. At the moment, the Archival IIIF server identifies one standalone service:
- Directory watcher: Watches a directory for any changes (new collections) and sends those to the index worker to be
indexed. Current implementations:
directory-watcher-changes
: When a directory has had no changes for a certain amount of time, it is assumed that it is safe to send the directory to the index worker to be indexed.directory-watcher-file-trigger
: When a directory is updated with a specific file, it triggers the index worker to be indexed.
The libraries are lightweight services with specific implementation details that can run together with the
web
service on the same running instance. At the moment, the Archival IIIF server identifies three different
libraries:
- Access: Determines whether a user has (limited) access to a specific item. Current implementations:
default-access
: All granted; default implementationiish-access
: IISH specific implementation.niod-access
: NIOD specific implementation.
- IIIF metadata: Provides implementation specific IIIF metadata. Current implementations:
default-iiif-metadata
: Default (no IIIF metadata) implementation.iish-iiif-metadata
: IISH specific implementation.
- Authentication texts: Provides implementation specific texts to help the user with authenticating. Current
implementations:
default-auth-texts
: Default implementation.iish-auth-texts
: IISH specific implementation.
See also the IIIF Image API 2.1 and the IIIF Image API 3.0
URL: /iiif/image/[id]
/ /iiif/image/[id]/info.json
Method: GET
IIIF Image API. Returns the JSON-LD description for the image with the given id.
URL: /iiif/image/[id]/[region]/[size]/[rotation]/[quality].[format]
Method: GET
IIIF Image API. Returns the image with the given id for the specified options.
See also the IIIF Presentation API 2.1 and the IIIF Presentation API 3.0
URL: /iiif/presentation/collection/[id]
Method: GET
IIIF Presentation API. Returns the JSON-LD description for the collection with the given id.
URL: /iiif/presentation/[id]/manifest
Method: GET
IIIF Presentation API. Returns the JSON-LD description for the manifest with the given id.
See also the IIIF Authentication API
URL: /iiif/auth/login
Method: GET
IIIF login service. Shows a login screen based on an internal token store.
URL: /iiif/auth/login
Method: POST
Parameters: token
Checks the provided token with the internal token store.
URL: /iiif/auth/cookie
Method: GET
Parameters: token
, redirect
IIIF cookie service. Sets an authorization cookie based on the internal token store and then redirects the user.
URL: /iiif/auth/token
Method: GET
Parameters: token
IIIF token service.
URL: /iiif/auth/logout
Method: GET
IIIF logout service.
URL: /file/[id]
/ /file/[id]/original
/ /file/[id]/access
Method: GET
Provides access to the file with the given id. Explicit access to the original copy or the access copy can be provided.
URL: /file/[id]/<derivative type>
Method: GET
Provides access to the derivative of the given type for the file with the given id.
URL: /pdf/[id]
Method: GET
Parameters: pages
Generates a PDF version of a collection with the given id.
URL: /admin/index
Method: POST
Parameters: path
Creates a job for the index worker to index the collection on the given path. Can only be used by an administrator with a valid access token.
URL: /admin/update_metadata
Method: POST
Parameters: oai_identifier
, root_id
, collection_id
Creates a job for the metadata worker to force-update the metadata for the given OAI identifier and/or root/collection id. Can only be used by an administrator with a valid access token.
URL: /admin/all_metadata_update
Method: POST
Create jobs for the metadata worker to force-update all the metadata. Can only be used by an administrator with a valid access token.
URL: /admin/index_api
Method: POST
Indexes the request body right away. Can only be used by an administrator with a valid access token.
URL: /admin/register_token
Method: POST
Parameters: token
, collection
, from
, to
Registers a token which may give access to a specific collection for a specific period of time. Can only be used by an administrator with a valid access token.
Use the provided Docker Compose or install manually.
- Set up any IIIF image compliant server:
- The Docker Compose comes with support for Loris.
- Or use our image server.
- See for example the provided
docker-compose.yml.example
:- Note: Clone the
web
service definition to create multiple services and use the env variableIIIF_SERVER_SERVICES
to define which services that container should run.
- Note: Clone the
- Set up the configuration (See .env.example for the example configuration)
- Set up the environment variables in the Docker Compose file
- Set up volumes for the following:
universal-viewer-conf
: The Universal Viewer configuration fileloris-conf
: The Loris configuration file (if Loris is used)data
: The volume which contains the collections to be indexed or files to be read, but also allows write access for derivative creationindexes
: The volume for ElasticSearch indexes to be storedredis-persistance
: The volume for Redis storage
- Set up any IIIF image compliant server:
- Use our image server.
- Or set up any IIIF image compliant server.
- Install
- Node.js 14.x LTS
- yarn or npm
- ElasticSearch 7.x.x
- IIIF image server (e.g. Loris)
- (Optional) Redis 5.x (Required for caching, workers and/or IIIF authentication)
- (Optional) pm2 (Required for managing the processes)
- Install optional dependencies for derivative creation
- audiowaveform (Required by the
waveform
service)
- audiowaveform (Required by the
- Set up the configuration (See .env.example for the example configuration)
- Copy .env.example to .env and set up the parameters for development
- Set up the environment variables for production
- With PM2, set up a config.yml file with the environment variables
- Run
yarn install
ornpm install
- Run
tsc
to transpile the application - Start the application:
- Run
node src/app.js
- With PM2:
pm2 start config.yml
- Run
The environment variables used to configure the application:
NODE_ENV
: Should beproduction
in a production environmentIIIF_SERVER_SERVICES
: Comma separated list of services to run on this instance:- General services:
web
: Sets up a web server and the web environmentdirectory-watcher-changes
: Runs a standalone script that watches a directory for new collections to index: when a collection has had no changes for a certain amount of time, the index is triggereddirectory-watcher-file-trigger
: Runs a standalone script that watches a directory for new collections to index: when a collection includes a trigger file, the index is triggeredtext-index
: Runs a worker that indexes texts (transcriptions, translations, etc.)
- Derivative services:
waveform
: Runs a worker that creates waveforms from audio files
- IISH specific services:
iish-archivematica-index
: Runs a worker that indexes IISH DIPs from Archivematicaiish-metadata
: Runs a worker that indexes IISH metadata (MARCXML / EAD)iish-metadata-update
: Runs a cron job that processes changes in the IISH metadataiish-access
: Loads a library that determines access to items for IISH collectionsiish-auth-texts
: Loads a library that provides authentication assistance texts of items from IISH collectionsiish-iiif-metadata
: Loads a library that provides IIIF metadata of items from IISH collections
- General services:
IIIF_SERVER_SECRET
: Signed cookie keyIIIF_SERVER_ACCESS_TOKEN
: Access token for administrator accessIIIF_SERVER_IMAGE_SERVER_URL
: URL of the external IIIF image server (such as Loris)IIIF_SERVER_IMAGE_SERVER_NAME
: Name of the image server (either 'loris' or 'sharp')IIIF_SERVER_METADATA_OAI_URL
: URL of the OAI metadata providerIIIF_SERVER_METADATA_SRW_URL
: URL of the SRW metadata providerIIIF_SERVER_IMAGE_TIER_SEPARATOR
: Separator character to separate between the image identifier and the image tierIIIF_SERVER_MAX_TASKS_PER_WORKER
: The maximum number of tasks a single type of worker can load at the same timeIIIF_SERVER_PORT
: Port to run the web serverIIIF_SERVER_ATTRIBUTION
: Attribution to add to the IIIF manifestsIIIF_SERVER_BASE_URL
: The public base URL of the applicationIIIF_SERVER_HOT_FOLDER_PATH
: The path to the hot folder where new collections to be indexed are placedIIIF_SERVER_HOT_FOLDER_PATTERN
: The pattern of a file in the root of a new collection to trigger indexingIIIF_SERVER_DATA_ROOT_PATH
: The root path of the data storageIIIF_SERVER_COLLECTIONS_REL_PATH
: The relative path of the (read-only) collections under the data storage root pathIIIF_SERVER_DERIVATIVE_REL_PATH
: The relative path of the (read-write) derivatives under the data storage root pathIIIF_SERVER_LOGO_REL_PATH
: The relative path to the image with the logo to add to the IIIF manifestsIIIF_SERVER_AUDIO_REL_PATH
: The relative path to the image with the audio icon to add to the IIIF manifestsIIIF_SERVER_LOGO_DIM
: The dimensions of the logo, separated by a ':'IIIF_SERVER_AUDIO_DIM
: The dimensions of the audio icon, separated by a ':'IIIF_SERVER_PDF_PAGES_THRESHOLD
: If defined, limit dynamic PDF creation per IP address when over this configured threshold (Requires Redis volatile server)IIIF_SERVER_PDF_SESSION_SECONDS
: If defined, the number of seconds to limit dynamic PDF creation per IP address ( Requires Redis volatile server)IIIF_SERVER_PDF_IMAGE_SIZE
: The (IIIF) size of images for the dynamic PDF creation (defaults to 'max')IIIF_SERVER_VIDEO_MOSAIC_WIDTH
: The width of the thumbnails in the video mosaic derivative (defaults to 500)IIIF_SERVER_VIDEO_TILES_ROWS
: The number of rows in the video mosaic derivative (defaults to 6)IIIF_SERVER_VIDEO_TILES_COLUMNS
: The number of columns in the video mosaic derivative (defaults to 5)IIIF_SERVER_LOG_LEVEL
: The logging levelIIIF_SERVER_IP_ADDRESS_HEADER
: The header to read the IP address from, instead ofHTTP_X_FORWARDED_FOR
IIIF_SERVER_INTERNAL_IP_ADDRESSES
: If access may be granted based on IP address, provide a comma separated white list of ip addresses (Requires Redis persistent server)IIIF_SERVER_LOGIN_DISABLED
: Turn login based authentication on/off (Requires Redis persistent server)IIIF_SERVER_EXTERNAL_DISABLED
: Turn external based authentication on/off (Requires Redis persistent server)IIIF_SERVER_ELASTICSEARCH_URL
: URL of the ElasticSearch indexerIIIF_SERVER_ELASTICSEARCH_USER
: Username of the ElasticSearch indexer if authentication is enabledIIIF_SERVER_ELASTICSEARCH_PASSWORD
: Password of the ElasticSearch indexer if authentication is enabledIIIF_SERVER_REDIS_VOLATILE_DISABLED
: Turn Redis volatile server on/off (Sets up caching)IIIF_SERVER_REDIS_VOLATILE_HOST
: Host of the Redis caching serverIIIF_SERVER_REDIS_VOLATILE_PORT
: Port of the Redis caching serverIIIF_SERVER_REDIS_PERSIST_DISABLED
: Turn Redis persistent server on/off (Sets up jobs and auth tokens)IIIF_SERVER_REDIS_PERSIST_HOST
: Host of the Redis persistent serverIIIF_SERVER_REDIS_PERSIST_PORT
: Port of the Redis persistent server