Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create and expose metrics for 1-1 restore service #4204

Open
karol-kokoszka opened this issue Jan 13, 2025 · 0 comments
Open

Create and expose metrics for 1-1 restore service #4204

karol-kokoszka opened this issue Jan 13, 2025 · 0 comments
Milestone

Comments

@karol-kokoszka
Copy link
Collaborator

As 1-1 restore is going to be a new service in Scylla Manager, there is a need to create new set of metrics.

There must be a possibility to:

  • track the SSTables copy part
    First, SM must count how much of the data must be copied (size is a part of manifest file). Then, SM must periodically query RClone for the given job-id to see what amount of bytes is copied already.
    scylla_manager_<clone_restore_or_other_name>_remaining_bytes{source_cluster_id, cluster_id, snapshot, keyspace, column_family, node_id}
  • track the overall restore
    Gauge to show:
    • 1 - copying data (transition from any other when SM started to copy the data)
    • 2 - call to refresh sstables on the node (transition from 1 after data is copied)
    • 3 - done (transition from 2 when refresh is done)
      scylla_manager_<clone_restore_or_other_name>_state{source_cluster_id, cluster_id, snapshot, keyspace, column_family, node_id}
@karol-kokoszka karol-kokoszka added this to the 1-1 Restore milestone Jan 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant