23 Nov 12:14

stchris

8fcdc9f

v1.23.2 Latest

Latest

What's Changed

Check redis set membership properly by @stchris in #211
This fixes a performance regression especially noticeable when there are >10000 jobs queued.
Use trusted publishing for PyPI releases

Full Changelog: v1.23.0...v1.23.2

Contributors

stchris

Assets 2

07 Nov 09:55

stchris

v1.23.1

050fc2e

v1.23.1

What's changed

Bugfixes

Proper clean-up of tasks which have exhausted the maximum number of retries by @catileptic and @stchris in #210

Full Changelog: v1.23.0...v1.23.1

Contributors

stchris and catileptic

Assets 2

09 Oct 12:17

catileptic

v1.23.0

131171c

v1.23.0

⚠️ This release contains breaking changes

The custom messaging queue used by Aleph has been replaced with RabbitMQ. As of this version of servicelayer, Aleph will use a persistent messaging queue. We have seen an increase in stability, predictability and also in the clarity of debugging since making these changes.

The implementation uses a Default, direct Exchange. RabbitMQ allows users to monitor the activity of the messaging queues using a management interface that one can access from the browser, if the proper port is exposed.

In order to populate the System Status view in Aleph, Redis is used to independently track the state of tasks. ⚠️ A breaking change was introduced in terms of the structure of the status API response - we no longer track job_ids, instead tracking tasks (task_ids). The structure of Redis keys has also changed as follows:

Redis keys used by the Dataset object:

tq:qdatasets: set of all collection_ids of active datasets (a dataset is considered active when it has either running or pending tasks)
tq:qdj:<dataset>:taskretry:<task_id>: the number of times task_id was retried

All of the following keys refer to task_ids or statistics about tasks per a certain dataset (collection_id):

tq:qdj:<dataset>:finished: number of tasks that have been marked as "Done" and for which an acknowledgement is also sent by the Worker over RabbitMQ.
tq:qdj:<dataset>:running: set of all task_ids of tasks currently running. A "Running" task is a task which has been checked out, and is being processed by a worker.
tq:qdj:<dataset>:pending: set of all task_ids of tasks currently pending. A "Pending" task has been added to a RabbitMQ queue (via a basic_publish call) by a producer (an API call, a UI action etc.).
tq:qdj:<dataset>:start: the UTC timestamp when either the first task_id has been added to a RabbitMQ queue (so, we have our first Pending task) or the timestamp when the first task_id has been checked out (so, we have our first Running task). The start key is updated when the first task is handed to a Worker.
tq:qdj:<dataset>:last_update: the UTC timestamp from the latest change to the state of tasks running for a certain collection_id. This is set when: a new task is Pending, a new task is Running, a new task is Done, a new task is canceled.
tq:qds:<dataset>:<stage>: a set of all task_ids that are either running or pending, for a certain stage.
tq:qds:<dataset>:<stage>:finished: number of tasks that have been marked as "Done" for a certain stage.
tq:qds:<dataset>:<stage>:running: set of all task_ids of tasks currently running for a certain stage.
tq:qds:<dataset>:<stage>:pending: set of all task_ids of tasks currently pending for a certain stage.

Tasks are assigned a random priority before being added to the appropriate queues to ensure a fair distribution of execution. The current implementation also allows admin users of Aleph to chose to assign a task either a global minimum priority or a global maximum priority.

What's Changed

Adds a last_updated timestamp to the dataset status by @stchris in #136
Pin moto because of breaking changes in version 5.0 + by @stchris in #155
Remove unused GitHub Actions workflow by @tillprochaska in #154
Standardize development dependencies / refactor GHA workflow by @tillprochaska in #153

Dependency upgrades

Bump black from 23.9.1 to 23.11.0 by @dependabot in #135
Bump wheel from 0.41.2 to 0.42.0 by @dependabot in #134
Bump prometheus-client from 0.17.1 to 0.19.0 by @dependabot in #133
Bump ruff from 0.0.292 to 0.1.8 by @dependabot in #138
Bump pytest from 7.4.2 to 7.4.3 by @dependabot in #121
Bump pytest-env from 1.0.1 to 1.1.3 by @dependabot in #132
Bump pytest-mock from 3.11.1 to 3.12.0 by @dependabot in #126
Update development dependencies in groups by @stchris in #139
Bump the dev-dependencies group with 1 update by @dependabot in #140
Bump fakeredis from 2.19.0 to 2.20.1 by @dependabot in #141
Release 1.22.2 by @tillprochaska in #167
Bump the dev-dependencies group with 6 updates by @dependabot in #170
Bump fakeredis from 2.20.1 to 2.22.0 by @dependabot in #168
Bump prometheus-client from 0.19.0 to 0.20.0 by @dependabot in #159
Bump structlog from 23.2.0 to 24.1.0 by @dependabot in #151
Release/1.23.0 by @stchris in #143

Full Changelog: v1.22.1...v1.23.0

Contributors

stchris, tillprochaska, and dependabot

Assets 2

22 Apr 12:43

tillprochaska

v1.22.2

252178b

v1.22.2

⚠️ This release fixes a potential security vulnerability. We strongly encourage you to use this release and disregard previous ones. ⚠️

This release includes a fix for the archive functionality in servicelayer. Previously, the generate_url methods of the Google Cloud Storage archive adapter and the AWS S3 archive adapter were generating URLs instructing AWS S3 and Google Cloud Storage to send a Content-Disposition: inline header in the response.

When sending this header, most browsers will automatically open the file if the file’s MIME type is supported by the browser. This may not be desired in some cases, for example when downloading files from untrustworthy sources.

Starting with this version of servicelayer, the generated URLs will instead instruct AWS S3 and Google Cloud Storage to send a Content-Disposition: attachment header. Browsers won’t open files without user interaction if this header is set.

Assets 2

21 Nov 13:37

tillprochaska

v1.22.1

bd68c29

v1.22.1

What's Changed

Change default port for Prometheus metrics endpoint to 9100 by @tillprochaska in #129
Misc Promethus changes by @tillprochaska in #130

Full Changelog: v1.22.0...v1.22.1

Contributors

tillprochaska

Assets 2

13 Oct 11:24

stchris

v1.22.0

fddf4d7

v1.22.0

What's Changed

Add basic Prometheus instrumentation for workers by @tillprochaska in #111
Log worker retry count and retry count exhaustion by @stchris in #113

Dependency upgrades

Bump pytest-env from 0.8.1 to 1.0.1 by @dependabot in #110
Bump wheel from 0.40.0 to 0.41.2 by @dependabot in #108
Bump ruff from 0.0.270 to 0.0.292 by @dependabot in #119
Bump fakeredis from 2.13.0 to 2.19.0 by @dependabot in #118
Bump black from 23.3.0 to 23.9.1 by @dependabot in #117
Bump structlog from 23.1.0 to 23.2.0 by @dependabot in #116
Bump pytest from 7.3.1 to 7.4.2 by @dependabot in #115
Bump normality from 2.4.0 to 2.5.0 by @dependabot in #114
Bump pytest-mock from 3.10.0 to 3.11.1 by @dependabot in #98

New Contributors

@tillprochaska made their first contribution in #111

Full Changelog: v1.21.2...v1.22.0

Contributors

stchris, tillprochaska, and dependabot

Assets 2

14 Jun 17:28

stchris

v1.21.0

eb343c8

v1.21.0

What's Changed

Add Sentry support to servicelayer workers by @stchris in #88

This release adds support for sending error tracebacks to sentry.io (or a self-hosted instance). This is controlled by two environment variables: SENTRY_DSN and SENTRY_ENVIRONMENT. Note that you also have to take care of installing the sentry_sdk package.
Add and enforce linter (ruff) and code formatter (black) by @stchris in #89

This updates the development environment and CI configuration to be closer to what we have in Aleph.

Full Changelog: v1.20.7...v1.21.0

Contributors

stchris

Assets 2

02 May 12:43

stchris

v1.20.7

7ce4108

v1.20.7

What's Changed

Bump fakeredis to 2.11.2
Add release steps to README

New Contributors

@stchris made their first contribution in #87

Full Changelog: v1.20.6...v1.20.7

Contributors

stchris

Assets 2

25 Apr 14:20

catileptic

v1.20.6

ec9a07c

v1.20.6

What's Changed

Bump pika from 1.3.0 to 1.3.1 by @dependabot in #76
Bump fakeredis from 1.9.1 to 1.10.0 by @dependabot in #77
Bump fakeredis from 1.10.0 to 1.10.1 by @dependabot in #79
refactor to suport SQLAlchemy 2.0 migration by @catileptic in #82

New Contributors

@catileptic made their first contribution in #82

Full Changelog: v1.20.5...v1.20.6

Contributors

catileptic and dependabot

Assets 2

29 Mar 13:11

stchris

v1.20.5

ebbd6ed

v1.20.5

What's Changed

Bump fakeredis from 1.8.1 to 1.9.0 by @dependabot in #71
Bump fakeredis from 1.9.0 to 1.9.1 by @dependabot in #73
Bump pika from 1.2.0 to 1.3.0 by @dependabot in #68
Update structlog requirement from <22.0.0,>=20.2.0 to >=20.2.0,<23.0.0 by @dependabot in #70

Full Changelog: v1.20.4...v1.20.5

Contributors

dependabot

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's changed

Bugfixes

Contributors

What's Changed

Dependency upgrades

Contributors

What's Changed

Contributors

What's Changed

Dependency upgrades

New Contributors

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

Releases: alephdata/servicelayer

v1.23.2

What's Changed

Contributors

v1.23.1

What's changed

Bugfixes

Contributors

v1.23.0

What's Changed

Dependency upgrades

Contributors

v1.22.2

v1.22.1

What's Changed

Contributors

v1.22.0

What's Changed

Dependency upgrades

New Contributors

Contributors

v1.21.0

What's Changed

Contributors

v1.20.7

What's Changed

New Contributors

Contributors

v1.20.6

What's Changed

New Contributors

Contributors

v1.20.5

What's Changed

Contributors