From b0e9f7fd655ec626ad1a8f16ab6db53647bf8bd1 Mon Sep 17 00:00:00 2001 From: Joram Wilander Date: Mon, 16 Sep 2024 15:46:42 -0400 Subject: [PATCH 1/4] Add 'ensuring releases perform at scale' doc --- source/guides/scale-mattermost.rst | 1 + .../ensuring-releases-perform-at-scale.rst | 45 +++++++++++++++++++ 2 files changed, 46 insertions(+) create mode 100644 source/scale/ensuring-releases-perform-at-scale.rst diff --git a/source/guides/scale-mattermost.rst b/source/guides/scale-mattermost.rst index 9170a5ea663..c13a1561248 100644 --- a/source/guides/scale-mattermost.rst +++ b/source/guides/scale-mattermost.rst @@ -41,3 +41,4 @@ Scale and monitor your Mattermost deployment. * :doc:`Deploy Prometheus and Grafana for performance monitoring ` - Use Prometheus and Grafana to monitor the health and performance of your Mattermost cluster. * :doc:`Performance monitoring metrics ` - The custom and standard Go metrics available for monitoring system performance. * :doc:`Mattermost performance alerting guide ` - Learn strategies and best practices for monitoring your Mattermost cluster. +* :doc:`Ensuring releases perform at scale ` - Learn how Mattermost ensures new releases perform at scale. diff --git a/source/scale/ensuring-releases-perform-at-scale.rst b/source/scale/ensuring-releases-perform-at-scale.rst new file mode 100644 index 00000000000..5636854a234 --- /dev/null +++ b/source/scale/ensuring-releases-perform-at-scale.rst @@ -0,0 +1,45 @@ +Ensuring Releases Perform at Scale +====================== + +To ensure each release of Mattermost upholds our high standards for performance at scale the team performs thorough load testing, develops features with scale in mind, and follows strict guidelines for database schema migrations. + +Monthly release load tests +--------------------------------- + +Each month, before being approved for distribution, a release candidate of Mattermost is load tested via a comprehensive and mature set of `load test tooling `__ using simulated data that matches real-world, high-scale usage patterns. + +Multiple tests covering different configurations are run against highly-available deployments of Mattermost with thousands of users and millions of posts. Both PostgreSQL and MySQL are tested, although MySQL tests will be dropped when :doc:`MySQL goes out of support with Mattermost v11 `. + +The load tests generate a report detailing average API request times, database i/o, memory usage, concurrency, requests per second and more. This report on the performance of the release candidate is then compared to the previous month's report. Any non-normal deviations are investigated and remedied before the release candidate is promoted to the final release. + +Each report, along with analysis, is posted into the public `~Developers: Performance `__ channel. + + +Developing scalable features and systems +-------------------------------- + +Scale is a major consideration during the development of new features and systems. It is included in technical specifications from the beginning of the software design process. + +As a part of implementation, the load test agent is updated to include coverage for the feature or system. A load test is then run with results being compared to baselines. + +The code changes are then reviewed by at least two developers and a SDET/QA analyst before being merged. + +Once merged, a new build is created and deployed the next day to `community.mattermost.com `__ where any impacts on performance are monitored for 3-4 weeks before the changes are included in a release candidate. + + +Database schema changes +----------------------------- + +Database schema changes are kept to a minimum to reduce risk on upgrades and impact on performance. When a change in schema is required, Mattermost follows `strict migration guidelines `__ that minimize risk and prevents performance impact during and after migrations. + +If a more involved migration is required, detailed analysis is performed and published with guidance. An example analysis is the `Mattermost v6.0 schema migration `__. + +Additionally, all database schema changes are load tested as a part of our monthly release process. + + +Monitoring post-release +----------------------------- + +After a new version of Mattermost is released it is rolled out to Mattermost Cloud customers over a multi-week period. During rollout performance and error rate metrics are monitored for any unexpected changes. If any user-impacting changes are observed, the release is reverted and the deviations are investigated with any fixes being delivered as a part of patch release. + +Additionally Mattermost channels, user forums and support tickets are closely monitored for reports of any issues. Any reports are investigated and resolved appropriately. From 7e046f5d6a7152006673b93aa18d85910ac0f71e Mon Sep 17 00:00:00 2001 From: Joram Wilander Date: Tue, 17 Sep 2024 11:30:47 -0400 Subject: [PATCH 2/4] Apply suggestions from code review Co-authored-by: Carrie Warner (Mattermost) <74422101+cwarnermm@users.noreply.github.com> --- .../scale/ensuring-releases-perform-at-scale.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/source/scale/ensuring-releases-perform-at-scale.rst b/source/scale/ensuring-releases-perform-at-scale.rst index 5636854a234..570bc697e70 100644 --- a/source/scale/ensuring-releases-perform-at-scale.rst +++ b/source/scale/ensuring-releases-perform-at-scale.rst @@ -1,30 +1,30 @@ Ensuring Releases Perform at Scale ====================== -To ensure each release of Mattermost upholds our high standards for performance at scale the team performs thorough load testing, develops features with scale in mind, and follows strict guidelines for database schema migrations. +To ensure each release of Mattermost upholds our high standards for performance at scale, the Mattermost Engineering team performs thorough load testing, develops features with scale in mind, and follows strict guidelines for database schema migrations. Monthly release load tests --------------------------------- Each month, before being approved for distribution, a release candidate of Mattermost is load tested via a comprehensive and mature set of `load test tooling `__ using simulated data that matches real-world, high-scale usage patterns. -Multiple tests covering different configurations are run against highly-available deployments of Mattermost with thousands of users and millions of posts. Both PostgreSQL and MySQL are tested, although MySQL tests will be dropped when :doc:`MySQL goes out of support with Mattermost v11 `. +Multiple tests covering different configurations are run against highly-available deployments of Mattermost with thousands of users and millions of posts. Both PostgreSQL and MySQL are tested, although MySQL tests will be dropped when :ref:`MySQL goes out of support with Mattermost v11 `. -The load tests generate a report detailing average API request times, database i/o, memory usage, concurrency, requests per second and more. This report on the performance of the release candidate is then compared to the previous month's report. Any non-normal deviations are investigated and remedied before the release candidate is promoted to the final release. +The load tests generate a report detailing average API request times, database I/O, memory usage, concurrency, requests per second, and more. This performance report on the release candidate is then compared to the previous month's report. Any deviations are investigated and remedied before the release candidate is promoted to the final release. -Each report, along with analysis, is posted into the public `~Developers: Performance `__ channel. +Each report, along with analysis, is posted into the public `Developers: Performance `__ channel. Developing scalable features and systems -------------------------------- -Scale is a major consideration during the development of new features and systems. It is included in technical specifications from the beginning of the software design process. +Scale is a major consideration during the development of new features and systems, and it's included in technical specifications from the beginning of the software design process. As a part of implementation, the load test agent is updated to include coverage for the feature or system. A load test is then run with results being compared to baselines. The code changes are then reviewed by at least two developers and a SDET/QA analyst before being merged. -Once merged, a new build is created and deployed the next day to `community.mattermost.com `__ where any impacts on performance are monitored for 3-4 weeks before the changes are included in a release candidate. +Once merged, a new build is created and deployed the next day to the `Mattermost Community Server `__ where any impacts on performance are monitored for 3-4 weeks before the changes are included in a release candidate. Database schema changes @@ -40,6 +40,6 @@ Additionally, all database schema changes are load tested as a part of our month Monitoring post-release ----------------------------- -After a new version of Mattermost is released it is rolled out to Mattermost Cloud customers over a multi-week period. During rollout performance and error rate metrics are monitored for any unexpected changes. If any user-impacting changes are observed, the release is reverted and the deviations are investigated with any fixes being delivered as a part of patch release. +After a new version of Mattermost is released, it is rolled out to Mattermost Cloud customers over a multi-week period. During roll out, performance and error rate metrics are monitored for any unexpected changes. If any user-impacting changes are observed, the release is reverted and the deviations are investigated with any fixes being delivered as a part of patch release. -Additionally Mattermost channels, user forums and support tickets are closely monitored for reports of any issues. Any reports are investigated and resolved appropriately. +In addition, Mattermost channels, user forums, and support tickets are closely monitored for reports of any issues. Any reports are investigated and resolved appropriately. From 8d9c7cb096f849c701d230241d6986129da844ff Mon Sep 17 00:00:00 2001 From: Joram Wilander Date: Tue, 17 Sep 2024 11:33:06 -0400 Subject: [PATCH 3/4] Updates per review feedback --- source/guides/scale-mattermost.rst | 1 + source/scale/ensuring-releases-perform-at-scale.rst | 10 +++++----- 2 files changed, 6 insertions(+), 5 deletions(-) diff --git a/source/guides/scale-mattermost.rst b/source/guides/scale-mattermost.rst index c13a1561248..f4f89d3582b 100644 --- a/source/guides/scale-mattermost.rst +++ b/source/guides/scale-mattermost.rst @@ -21,6 +21,7 @@ Scale Mattermost Deploy Prometheus and Grafana for performance monitoring Performance monitoring metrics Mattermost performance alerting guide + Ensuring releases perform at scale Scale and monitor your Mattermost deployment. diff --git a/source/scale/ensuring-releases-perform-at-scale.rst b/source/scale/ensuring-releases-perform-at-scale.rst index 570bc697e70..8a52237618b 100644 --- a/source/scale/ensuring-releases-perform-at-scale.rst +++ b/source/scale/ensuring-releases-perform-at-scale.rst @@ -1,10 +1,10 @@ Ensuring Releases Perform at Scale -====================== +================================== To ensure each release of Mattermost upholds our high standards for performance at scale, the Mattermost Engineering team performs thorough load testing, develops features with scale in mind, and follows strict guidelines for database schema migrations. Monthly release load tests ---------------------------------- +-------------------------- Each month, before being approved for distribution, a release candidate of Mattermost is load tested via a comprehensive and mature set of `load test tooling `__ using simulated data that matches real-world, high-scale usage patterns. @@ -16,7 +16,7 @@ Each report, along with analysis, is posted into the public `Developers: Perform Developing scalable features and systems --------------------------------- +---------------------------------------- Scale is a major consideration during the development of new features and systems, and it's included in technical specifications from the beginning of the software design process. @@ -28,7 +28,7 @@ Once merged, a new build is created and deployed the next day to the `Mattermost Database schema changes ------------------------------ +----------------------- Database schema changes are kept to a minimum to reduce risk on upgrades and impact on performance. When a change in schema is required, Mattermost follows `strict migration guidelines `__ that minimize risk and prevents performance impact during and after migrations. @@ -38,7 +38,7 @@ Additionally, all database schema changes are load tested as a part of our month Monitoring post-release ------------------------------ +----------------------- After a new version of Mattermost is released, it is rolled out to Mattermost Cloud customers over a multi-week period. During roll out, performance and error rate metrics are monitored for any unexpected changes. If any user-impacting changes are observed, the release is reverted and the deviations are investigated with any fixes being delivered as a part of patch release. From 037224b003e20be37d5601246498f089087fba07 Mon Sep 17 00:00:00 2001 From: Joram Wilander Date: Tue, 17 Sep 2024 12:01:57 -0400 Subject: [PATCH 4/4] Clarification per feedback --- source/scale/ensuring-releases-perform-at-scale.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/source/scale/ensuring-releases-perform-at-scale.rst b/source/scale/ensuring-releases-perform-at-scale.rst index 8a52237618b..f41f6318320 100644 --- a/source/scale/ensuring-releases-perform-at-scale.rst +++ b/source/scale/ensuring-releases-perform-at-scale.rst @@ -10,7 +10,7 @@ Each month, before being approved for distribution, a release candidate of Matte Multiple tests covering different configurations are run against highly-available deployments of Mattermost with thousands of users and millions of posts. Both PostgreSQL and MySQL are tested, although MySQL tests will be dropped when :ref:`MySQL goes out of support with Mattermost v11 `. -The load tests generate a report detailing average API request times, database I/O, memory usage, concurrency, requests per second, and more. This performance report on the release candidate is then compared to the previous month's report. Any deviations are investigated and remedied before the release candidate is promoted to the final release. +The load tests generate a report detailing average API request times, database I/O, memory usage, concurrency, requests per second, and more. This performance report on the release candidate is then compared to the report of the latest previous stable version. Any deviations are investigated and remedied before the release candidate is promoted to the final release. Each report, along with analysis, is posted into the public `Developers: Performance `__ channel.