Skip to content

Releases: criteo-forks/marathon

v1.9.109-criteo25

04 Feb 14:01
Compare
Choose a tag to compare
Allow multiple health check shields per task id

* Allow multiple health check shields per task id
  Shield is now identified by /v2/shield/<task_id>/<shield_name>

* Renames after live review

* Fixed tests

* Better error handling in the actor

v1.9.109-criteo24

04 Feb 13:41
Compare
Choose a tag to compare
Fix a log uppon offer receival

It's better to have hostname on same line, otherwise multiline logs are
a real pain to parse...

v1.9.109-bes-ivnestigation-criteo1

03 Feb 16:59
Compare
Choose a tag to compare
Fix log presentation

Putting back-off delay at the end makes it unlikely to see it because
when there are too many scheduledInstances, log get truncated by
rsyslog. Delay is more important and having an extract of few
scheduledInstances is enough.

v1.9.109-oneline-criteo1

02 Feb 16:18
Compare
Choose a tag to compare
Put logs for unsatisfied offers in one line

Before, log was spread accross multiple lines, and it was making
investigation very difficult for incidents related to offer matching

v1.9.109-criteo23

02 Feb 16:40
Compare
Choose a tag to compare
Put logs for unsatisfied offers in one line

Before, log was spread accross multiple lines, and it was making
investigation very difficult for incidents related to offer matching

v1.9.109-criteo20-cni1

01 Feb 12:02
Compare
Choose a tag to compare

v1.9.109-criteo22

25 Jan 09:34
Compare
Choose a tag to compare
Remove old comments

v1.9.109-criteo21

25 Jan 09:32
Compare
Choose a tag to compare
Put runspec id along with logs

v1.9.109-criteo20

15 Jan 11:22
Compare
Choose a tag to compare
Let HealthCheckActor forget outdated healthcheck result

We observed an issue with Mesos healthchecks.
When a task failed its Mesos HC, it was killed and replaced by a new
task - this was expected. What was unexpected however was that the new
task was also seen as unhealthy (even if it was running correctly), with
2 healthcheck results attached to it (an outdated one reporting a
failure, and a correct one reporting success).

This commit fixes the issue by letting the HealthCheckActor forget
outdated healthckeck result.

JIRA: MESOS-4437

v1.9.109-criteo19

12 Jan 13:52
Compare
Choose a tag to compare
Let TaskReplaceActor ignore outdated health checks

We observed issues when mixing Marathon and Mesos health checks.
Sometimes, the Mesos health checks get duplicated, and one 'replica'
keeps saying the instance is not alive, even if it is.
These broken/outdated health checks can be spotted easily, since their last
update timestamp is prior to the timestamp at which the task was
staged at.
Ignoring these outdated health checks allows the TaskReplaceActor to
correctly kill the healthy instances, and perform the deployment of the
new version of the application.

JIRA: MESOS-4437