-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(traces): OTeL Traces implementation(duty flow) #1980
Draft
oleg-ssvlabs
wants to merge
17
commits into
stage
Choose a base branch
from
traces
base: stage
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
34dc244
to
819f619
Compare
c79a8cc
to
f63cc5e
Compare
* chore(networkconfig): add new SSV Labs bootnodes * Update mainnet.go * Update holesky.go
…tivation (#1689) * fix: (EventHandler) update non-committee shares upon liquidation/reactivation --------- Co-authored-by: olegshmuelov <[email protected]>
Co-authored-by: Matus Kysel <[email protected]> Co-authored-by: rehs0y <[email protected]>
…adata sync (#1805) * rename setupEventHandling to syncContractEvents * refactor beaconprotocol.UpdateValidatorsMetadata to MetadataFetcher * don't pass logger to operatorNode.Start() * don't pass logger to reportOperators * don't pass logger to p2pNetwork Setup and Start * fix TestSetupValidatorsExporter * reduce errors text in fetchAndUpdateValidatorsMetadata * extract a package for metadata updating; start it before p2p setup * minor cleanup * pass context * get rid of update metadata loop in validator controller * remove unused code from validator controller * fix tests for StartValidators * initialize metadata updater before validator controller * various fixes in metadata updater * sharesStorage -> shareStorage * remove redundant comment * avoid blocking on channel send * return shares instead of nil on timeout * fix TODO's; add tests * fix linter * review comments and some code improvements * review comments [2] * minor improvements * move metadata updater inside validator * review comments [3] * add comments * add another comment * network/p2p: extract logger changes to another PR * network/p2p: revert leftovers * resolve a busy loop * remove logic with indices diff * wrap context in reportIndicesChange * review comments * ValidatorSyncer -> Syncer * fix comment * rename receiver * get rid of fetcher * fix TestUpdateValidatorMetadata * NewValidatorSyncer -> NewSyncer * minor renames * add a comment in HandleMetadataUpdates * revert removal of active index comparison * add self subnets logic missed on merge conflicts * fix leftovers * apply changes from #1969 * filter shares by own subnets * use fixed subnets * improve the last batch sleep comment * minor rename * comments * comment * comment * logs --------- Co-authored-by: zippoxer <[email protected]>
Co-authored-by: moshe-blox <[email protected]> Co-authored-by: Matus Kysel <[email protected]>
zktaiga
reviewed
Feb 27, 2025
cmd.Parent().Short, | ||
cmd.Parent().Version, | ||
observability.WithMetrics()) | ||
observability.WithMetrics(), | ||
observability.WithTraces("stage-alloy.alloy.svc:4317", true)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Despite this being a stale PR that we'll soon revive, this should be configured via env vars.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll take it in consideration
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description/Questions/Suggestions
Are both logs and events needed for recording of the same type of event? Should it be either-or? Example.
Is
Duty ID
as a Trace Attribute necessary? Currently bothSlot
andEpoch
attributes are added separately. Would it be better to just add aCommittee
attribute instead?Duty flows are separated into
Committee
versus everything else (method/function level). This is reflected in different span names. Should we keep separating them on a span level, or should we use the same names with attributes that help differentiate duties (likessv.runner.role,
etc)Span name examples:
ssv.validator.execute_committee_duty
ssv.validator.execute_duty
ssv.validator.start_committee_duty
ssv.validator.start_duty
There are three statuses for spans:
Ok
,Error
, andUnset
. Ideally, all spans should have their status explicitly set to eitherOk
orError
—Unset
is not expected. Please use Grafana UI and verify if we receive any of the spans inUnset
status.Look into namespaces for metrics, traces, and attributes. Namespaces should be consistent across all observability "primitives". There is a chance we have some inconsistencies
Example:
ssv.validator.duty
vs.ssv.duty
. IfDuty
belongs toValidator
, usessv.validator.duty
everywhere.OpenTelemetry Specification explains why
OK
status is used without a message. ("Description MUST be IGNORED for StatusCode Ok & Unset values."). Even if the message is set for OK statuses, it will be ignored by OTeL and not displayed in Grafana (yeah, SDKs could have been better here)Some libraries needs to be updated for proper context propagation, especially for methods that perform I/O (e.g., HTTP calls). Example:
p2p.Broadcast()
.Some enums in the libraries used by SSV Node lack a
String()
method, which complicates logging and tracing.Example:
types.PartialSigMsgType
(ssv-spec
lib). Something that should potentially be implemented by these libraries (we own source code)Should these enums be moved to the
spec types
package instead?