Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

STF 1.5.5 release ops #546

Merged
merged 54 commits into from
Sep 11, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
54 commits
Select commit Hold shift + click to select a range
87686b0
Fix xrefs for director Operator (#481)
leifmadsen Jun 19, 2023
be5cec0
Initial pass for external ES (#483)
csibbitt Aug 22, 2023
982cec2
Trivial leftover suggestions (#485)
csibbitt Aug 22, 2023
1f7aca7
Link ES section to KB article (#486)
csibbitt Sep 5, 2023
564130c
Initial changes to installation for STF 1.5.3 (#484)
leifmadsen Sep 5, 2023
4856eda
use_redhat and migration link (#462)
csibbitt Sep 7, 2023
b248a57
Override qdr::router_id defaults in stf-connectors (#487)
leifmadsen Sep 11, 2023
563a551
Don't enable event collection by default on OSP (#488)
leifmadsen Sep 12, 2023
72d91b6
No longer import the events dashboard (#490)
leifmadsen Sep 13, 2023
e2ba966
Installation of cluster monitoring is no longer necessary (#491)
leifmadsen Sep 14, 2023
ae4bbb9
Adjust the default polling interval for collectd (#489)
leifmadsen Sep 21, 2023
a17f01e
Remove logs configuration from sample CR (#493)
leifmadsen Sep 26, 2023
720ac91
mg_master_RHOSPDOC-1380_chunk-installation-procedure (#492)
mickogeary Sep 26, 2023
428b8ae
Reduce the number of Ceilometer pollsters (#497)
leifmadsen Sep 28, 2023
bd2ba0c
Deprecate the use of high availability mode in STF (#494)
leifmadsen Sep 28, 2023
56ba82e
Fix up the table syntax in Observability Strategy (#495)
leifmadsen Sep 28, 2023
24f9d92
Do not manage the event pipeline by default (#498)
leifmadsen Sep 28, 2023
e99b8a1
Minor clean up and user experience updates (#496)
leifmadsen Oct 2, 2023
ac64c72
Creating an alert does not use curl (#500)
leifmadsen Oct 3, 2023
a5a82aa
Eliminate duplicate line (#501)
csibbitt Oct 4, 2023
3e400fc
Adding details for QDR password auth (#502)
csibbitt Oct 12, 2023
13e7f21
Support OCP versions 4.12 through 4.14 (#503)
leifmadsen Oct 24, 2023
3802f4a
Summary: Replace incorrect stf-connectors.yaml filename with enable-s…
rheslop Oct 24, 2023
f94a996
Clean up the STF install (#505)
leifmadsen Oct 24, 2023
741aba4
Provide the preferred STF object for deployment (#507)
leifmadsen Oct 24, 2023
51e894c
Fix various RHOSP links and versions (#508)
leifmadsen Oct 25, 2023
0c88867
Update and adjust dashboard procedures (#509)
leifmadsen Oct 25, 2023
0c5a236
Add deprecation note for Grafana authentication (#510)
leifmadsen Oct 25, 2023
c8386ab
Update deprecated Grafana login warning (#511)
leifmadsen Oct 25, 2023
e2f1961
Add updated architecture diagrams (#499)
leifmadsen Nov 3, 2023
9c79b75
Update install guide for dependent operators (#513)
leifmadsen Nov 23, 2023
b7ef9b6
Clean up the prerequisites lists (#514)
leifmadsen Nov 23, 2023
0cad5de
Add removal instructions for COO (#516)
leifmadsen Nov 28, 2023
66bd308
Refer to cert-manager removal documentation (#515)
leifmadsen Nov 28, 2023
86aec74
Pre-STF 1.5.3 Documentation Walkthrough and Cleanup (#517)
leifmadsen Nov 30, 2023
bd3472c
Modularize STF architecture changes (#518)
leifmadsen Nov 30, 2023
d098587
Update diagrams for Cluster Observability Operator (#519)
leifmadsen Dec 1, 2023
c1d1be8
mg_master_517_minor-style-edits (#521)
mickogeary Dec 5, 2023
0e56add
Reference 17.1 in docinfo.xml (#522)
leifmadsen Dec 5, 2023
36d54d1
PrometheusRules must reference monitoring.rhobs (#523)
leifmadsen Dec 5, 2023
eea657b
Basic Auth in Grafana no longer supported (#525)
csibbitt Dec 7, 2023
3b88889
Adjust prometheus query to use token (#520)
csibbitt Dec 12, 2023
e534bf3
Update installation to target Grafana Operator v5 (#526)
leifmadsen Jan 30, 2024
0d5626e
Add enable dashboard procedure (#527)
leifmadsen Jan 31, 2024
8e1b8fa
Update OCP version support status (#529)
leifmadsen Feb 8, 2024
d8e2c12
Update required resource permission reference (#528)
leifmadsen Feb 8, 2024
8477306
Drop unused module found in other issue (#533)
leifmadsen Mar 1, 2024
fe90630
mg-master_RHOSPDOC-1200_STF-disconnected (#531)
mickogeary Mar 5, 2024
dbb15fb
Use OpenShift supported version vars vs static (#536)
leifmadsen Mar 6, 2024
2033bb6
minor doc updates from OSPRH-7722, OSP-32244, OSP-32175 (#541)
mickogeary Jun 20, 2024
0480450
add-callouts-for-stfconnectorsyaml-in-proc_configuring-the-stf-connec…
mickogeary Jun 21, 2024
3b72ed4
Use plural resource names in permissions (#544)
csibbitt Jul 17, 2024
bf28151
Update supported OCP versions (#545)
vkmc Sep 10, 2024
cba41a6
Merge branch 'master' into release-prep-1.5.5
vkmc Sep 10, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions common/global/stf-attributes.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -49,8 +49,8 @@ ifeval::["{build}" == "upstream"]
:Project: Service{nbsp}Telemetry{nbsp}Framework
:ProjectShort: STF
:MessageBus: Apache{nbsp}Qpid{nbsp}Dispatch{nbsp}Router
:SupportedOpenShiftVersion: 4.12
:NextSupportedOpenShiftVersion: 4.14
:SupportedOpenShiftVersion: 4.14
:NextSupportedOpenShiftVersion: 4.16
:CodeReadyContainersVersion: 2.19.0
endif::[]

Expand All @@ -67,6 +67,6 @@ ifeval::["{build}" == "downstream"]
:Project: Service{nbsp}Telemetry{nbsp}Framework
:ProjectShort: STF
:MessageBus: AMQ{nbsp}Interconnect
:SupportedOpenShiftVersion: 4.12
:NextSupportedOpenShiftVersion: 4.14
:SupportedOpenShiftVersion: 4.14
:NextSupportedOpenShiftVersion: 4.16
endif::[]
Original file line number Diff line number Diff line change
Expand Up @@ -10,9 +10,9 @@ You need the following permissions to access the corresponding component UI's:

[source,json,options="nowrap"]
----
{"namespace":"service-telemetry", "resource":"grafana", "group":"grafana.integreatly.org", "verb":"get"}
{"namespace":"service-telemetry", "resource":"prometheus", "group":"monitoring.rhobs", "verb":"get"}
{"namespace":"service-telemetry", "resource":"alertmanager", "group":"monitoring.rhobs", "verb":"get"}
{"namespace":"service-telemetry", "resource":"grafanas", "group":"grafana.integreatly.org", "verb":"get"}
{"namespace":"service-telemetry", "resource":"prometheuses", "group":"monitoring.rhobs", "verb":"get"}
{"namespace":"service-telemetry", "resource":"alertmanagers", "group":"monitoring.rhobs", "verb":"get"}
----

For more information about RBAC, see https://docs.openshift.com/container-platform/{NextSupportedOpenShiftVersion}/authentication/using-rbac.html[Using RBAC to define and apply permissions].
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@ data:
verifyHostname: false
sslProfile: sslProfile
saslUsername: guest@default-interconnect
saslPassword: pass:<password_from_stf>
saslPassword: <password_from_stf>

MetricsQdrSSLProfiles:
- name: sslProfile
Expand All @@ -58,6 +58,18 @@ data:
CollectdSensubilityResultsChannel: sensubility/cloud1-telemetry
----

* The `resource_registry` configuration directly loads the collectd service because you do not include the `collectd-write-qdr.yaml` environment file for multiple cloud deployments.
* Replace the `host` sub-parameter of `MetricsQdrConnectors` with the value that you retrieved in xref:retrieving-the-qdr-route-address_assembly-completing-the-stf-configuration[].
* Replace the `<password_from_stf>` portion of the `saslPassword` sub-parameter of `MetricsQdrConnectors` with the value you retrieved in xref:retrieving-the-qdr-password_assembly-completing-the-stf-configuration[].
ifdef::include_when_13,include_when_17[]
* Replace the `caCertFileContent` parameter with the contents retrieved in xref:getting-ca-certificate-from-stf-for-overcloud-configuration_assembly-completing-the-stf-configuration[].
endif::include_when_13,include_when_17[]
* Set `topic` value of `CeilometerQdrMetricsConfig.topic` to define the topic for Ceilometer metrics. The value is a unique topic identifier for the cloud such as `cloud1-metering`.
* Set `CollectdAmqpInstances` sub-parameter to define the topic for collectd metrics. The section name is a unique topic identifier for the cloud such as `cloud1-telemetry`.
ifndef::include_when_13[]
* Set `CollectdSensubilityResultsChannel` to define the topic for collectd-sensubility events. The value is a unique topic identifier for the cloud such as `sensubility/cloud1-telemetry`.
endif::[]

[role="_additional-resources"]
.Additional resources
* For more information about the `stf-connectors.yaml` environment file, see xref:configuring-the-stf-connection-for-the-overcloud_assembly-completing-the-stf-configuration[].
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ parameter_defaults:
verifyHostname: false
sslProfile: sslProfile
saslUsername: guest@default-interconnect
saslPassword: pass:<password_from_stf>
saslPassword: <password_from_stf>

MetricsQdrSSLProfiles:
- name: sslProfile
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -54,8 +54,9 @@ data:
CollectdConnectionType: amqp1
CollectdAmqpInterval: 30
CollectdDefaultPollingInterval: 30
CollectdExtraPlugins:
- vmem
# to collect information about the virtual memory subsystem of the kernel
# CollectdExtraPlugins:
# - vmem

# set standard prefixes for where metrics are published to QDR
MetricsQdrAddresses:
Expand All @@ -74,13 +75,16 @@ data:
# note: this may need an adjustment if there are many metrics to be sent.
collectd::plugin::amqp1::send_queue_limit: 5000

# receive extra information about virtual memory
collectd::plugin::vmem::verbose: true
# to receive extra information about virtual memory, you must enable vmem plugin in CollectdExtraPlugins
# collectd::plugin::vmem::verbose: true

# provide name and uuid in addition to hostname for better correlation
# to ceilometer data
collectd::plugin::virt::hostname_format: "name uuid hostname"

# to capture all extra_stats metrics, comment out below config
collectd::plugin::virt::extra_stats: cpu_util vcpu disk

# provide the human-friendly name of the virtual instance
collectd::plugin:ConfigMap :virt::plugin_instance_format: metadata

Expand All @@ -90,6 +94,9 @@ data:
local:
host: "%{hiera('fqdn_canonical')}"
port: 11211

# report root filesystem storage metrics
collectd::plugin::df::ignoreselected: false
----

[role="_additional-resources"]
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,10 @@ parameter_defaults:
CollectdConnectionType: amqp1
CollectdAmqpInterval: 30
CollectdDefaultPollingInterval: 30
CollectdExtraPlugins:
- vmem

# to collect information about the virtual memory subsystem of the kernel
# CollectdExtraPlugins:
# - vmem

# set standard prefixes for where metrics are published to QDR
MetricsQdrAddresses:
- prefix: 'collectd'
Expand All @@ -55,8 +56,8 @@ parameter_defaults:
# note: Adjust the value of the `send_queue_limit` to handle your required volume of metrics.
collectd::plugin::amqp1::send_queue_limit: 5000

# receive extra information about virtual memory
collectd::plugin::vmem::verbose: true
# to receive extra information about virtual memory, you must enable vmem plugin in CollectdExtraPlugins
# collectd::plugin::vmem::verbose: true

# set memcached collectd plugin to report its metrics by hostname
# rather than host IP, ensuring metrics in the dashboard remain uniform
Expand All @@ -65,6 +66,9 @@ parameter_defaults:
host: "%{hiera('fqdn_canonical')}"
port: 11211

# report root filesystem storage metrics
collectd::plugin::df::ignoreselected: false

# align defaults across OSP versions
collectd::plugin::cpu::reportbycpu: true
collectd::plugin::cpu::reportbystate: true
Expand Down Expand Up @@ -105,8 +109,9 @@ parameter_defaults:
CollectdConnectionType: amqp1
CollectdAmqpInterval: 30
CollectdDefaultPollingInterval: 30
CollectdExtraPlugins:
- vmem
# to collect information about the virtual memory subsystem of the kernel
# CollectdExtraPlugins:
# - vmem

# set standard prefixes for where metrics are published to QDR
MetricsQdrAddresses:
Expand All @@ -125,13 +130,16 @@ parameter_defaults:
# note: this may need an adjustment if there are many metrics to be sent.
collectd::plugin::amqp1::send_queue_limit: 5000

# receive extra information about virtual memory
collectd::plugin::vmem::verbose: true
# to receive extra information about virtual memory, you must enable vmem plugin in CollectdExtraPlugins
# collectd::plugin::vmem::verbose: true

# provide name and uuid in addition to hostname for better correlation
# to ceilometer data
collectd::plugin::virt::hostname_format: "name uuid hostname"

# to capture all extra_stats metrics, comment out below config
collectd::plugin::virt::extra_stats: cpu_util vcpu disk

# provide the human-friendly name of the virtual instance
collectd::plugin::virt::plugin_instance_format: metadata

Expand All @@ -141,5 +149,9 @@ parameter_defaults:
local:
host: "%{hiera('fqdn_canonical')}"
port: 11211

# report root filesystem storage metrics
collectd::plugin::df::ignoreselected: false

----
endif::include_when_16[]
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ Since {Project} ({ProjectShort}) version 1.5.4, you can deploy {ProjectShort} in

.Prerequisites

* {OpenShift} Extended Update Support (EUS) version 4.12 or 4.14 deployed in a restricted network.
* {OpenShift} Extended Update Support (EUS) version {SupportedOpenShiftVersion} or {NextSupportedOpenShiftVersion} deployed in a restricted network.
* A mirror registry so that the {OpenShift} cluster can access the required images. For more information about mirror registries, see link:https://access.redhat.com/documentation/en-us/openshift_container_platform/{NextSupportedOpenShiftVersion}/html/installing/disconnected-installation-mirroring[Disconnected installation mirroring] in the {OpenShift} _Installing_ guide.
* All the {ProjectShort} dependencies are available in the {OpenShift} cluster mirror registry.

Expand Down
Loading