Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Access] Add registerDB pruning module #6397

Open
wants to merge 74 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 19 commits
Commits
Show all changes
74 commits
Select commit Hold shift + click to select a range
5584146
Added new cli flags to access and observer nodes
UlyanaAndrukhiv Aug 15, 2024
fd5fa65
Updated last commit
UlyanaAndrukhiv Aug 15, 2024
8aab5d9
Added skeleton of register db pruner module
UlyanaAndrukhiv Aug 16, 2024
08274d1
Added missed logic for checkPrune implementation for register db pruning
UlyanaAndrukhiv Aug 20, 2024
61d3be2
Added pruner integration to AN and ON
UlyanaAndrukhiv Aug 20, 2024
8910083
Added prune up to height implementation
UlyanaAndrukhiv Aug 21, 2024
b863b61
Updated iteration over the register keys, moved update first height t…
UlyanaAndrukhiv Aug 22, 2024
fc3c44c
Updated collecting keys to remove
UlyanaAndrukhiv Aug 22, 2024
64e9c9b
Added tests for register pruner
UlyanaAndrukhiv Aug 23, 2024
177651c
Added basic metrics for pruner
Guitarheroua Aug 23, 2024
33fe4a1
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Aug 23, 2024
2d847ba
Added aditional metrics
Guitarheroua Aug 26, 2024
1018ba2
Added missing metric
Guitarheroua Aug 26, 2024
0e8cc4c
Added throttle delay to prevent excessive system load during pruning
UlyanaAndrukhiv Aug 26, 2024
afe9e56
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
UlyanaAndrukhiv Aug 26, 2024
fec4ba2
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
UlyanaAndrukhiv Aug 26, 2024
b9247b6
Updated checkPrune implementation
UlyanaAndrukhiv Aug 26, 2024
97a046d
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
UlyanaAndrukhiv Aug 26, 2024
0e7f9e5
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
UlyanaAndrukhiv Aug 27, 2024
8080a15
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
UlyanaAndrukhiv Aug 29, 2024
b57bc4e
Added first batch of suggested changes according to comments
UlyanaAndrukhiv Aug 29, 2024
3ca7883
Removed ComponentManager
UlyanaAndrukhiv Aug 29, 2024
1ff10bc
Simplified prune throttle delay implementation and updated error hand…
UlyanaAndrukhiv Aug 29, 2024
25e1356
Removed unnecessary checks for metrics and updated tests
UlyanaAndrukhiv Aug 29, 2024
66e810b
Removed extra scope according to comments
UlyanaAndrukhiv Aug 29, 2024
0aada71
Fixed check for pruning
UlyanaAndrukhiv Aug 29, 2024
f039d15
Updated godoc
UlyanaAndrukhiv Aug 29, 2024
b6c8040
Added more comments
UlyanaAndrukhiv Aug 29, 2024
db82c62
Updated comments for register priner tests
UlyanaAndrukhiv Aug 29, 2024
dc984cf
Added functional test for error handling for register pruner, added m…
UlyanaAndrukhiv Aug 30, 2024
84a8f9a
Merged with master
UlyanaAndrukhiv Aug 30, 2024
bf15f92
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Sep 3, 2024
919f03b
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
UlyanaAndrukhiv Sep 9, 2024
9ed6ca8
Added basic integration test functionality
Guitarheroua Sep 10, 2024
2cb1264
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Sep 10, 2024
f040c83
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
UlyanaAndrukhiv Sep 12, 2024
fc4d6cb
Merged with master
UlyanaAndrukhiv Sep 13, 2024
87fe8fe
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Sep 19, 2024
d70106a
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Sep 19, 2024
f276544
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Sep 19, 2024
7ffc1a4
db path fix
Guitarheroua Sep 23, 2024
66418af
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Sep 24, 2024
ebcc13f
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Sep 24, 2024
10cb2f2
Fixed crashes in tests
Guitarheroua Sep 26, 2024
23f1e36
Merge branch 'master' of github.com:The-K-R-O-K/flow-go into UlyanaAn…
Guitarheroua Oct 9, 2024
d251445
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Oct 17, 2024
5833735
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Oct 30, 2024
c238bac
Fixed remarks
Guitarheroua Nov 1, 2024
e41451c
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 1, 2024
74c4f97
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 3, 2024
c4bf9f3
changed naming of test data
Guitarheroua Nov 4, 2024
d086c1b
Added log for total keys pruned and duration
Guitarheroua Nov 4, 2024
16e7887
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Nov 4, 2024
0f3b5aa
Added comments and clarify check for pruning keys
Guitarheroua Nov 4, 2024
4961567
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 5, 2024
a67e776
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 7, 2024
c1c274d
fixed remarks
Guitarheroua Nov 7, 2024
3e0a2f9
revert unnecessary changes
Guitarheroua Nov 7, 2024
b2ec85e
fixed metrics and tests
Guitarheroua Nov 7, 2024
ce04abe
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 8, 2024
b2574ab
Fixed remarks
Guitarheroua Nov 12, 2024
fdae17c
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 12, 2024
65da21d
Fixed remarks
Guitarheroua Nov 13, 2024
8145af3
Merge branch 'UlyanaAndrukhiv/6068-registerDB-pruning-module' of gith…
Guitarheroua Nov 13, 2024
0dd31f1
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 13, 2024
2775174
fixed tests
Guitarheroua Nov 13, 2024
a77a746
fixed linter issue
Guitarheroua Nov 14, 2024
250627a
added more test cases
Guitarheroua Nov 14, 2024
6b51f5d
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 14, 2024
ab4bacd
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 19, 2024
b3775a3
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Nov 20, 2024
f375cd0
fixed integration test
Guitarheroua Nov 25, 2024
496e3db
Merge branch 'master' of github.com:The-K-R-O-K/flow-go into UlyanaAn…
Guitarheroua Nov 25, 2024
1d0e753
Merge branch 'master' into UlyanaAndrukhiv/6068-registerDB-pruning-mo…
Guitarheroua Dec 2, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
170 changes: 114 additions & 56 deletions cmd/access/node_builder/access_node_builder.go
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@ import (
"github.com/onflow/flow-go/module/execution"
"github.com/onflow/flow-go/module/executiondatasync/execution_data"
execdatacache "github.com/onflow/flow-go/module/executiondatasync/execution_data/cache"
"github.com/onflow/flow-go/module/executiondatasync/pruner"
edpruner "github.com/onflow/flow-go/module/executiondatasync/pruner"
edstorage "github.com/onflow/flow-go/module/executiondatasync/storage"
"github.com/onflow/flow-go/module/executiondatasync/tracker"
finalizer "github.com/onflow/flow-go/module/finalizer/consensus"
Expand Down Expand Up @@ -173,6 +173,9 @@ type AccessNodeConfig struct {
programCacheSize uint
checkPayerBalance bool
versionControlEnabled bool
registerDBPruningEnabled bool
registerDBPruneTickerInterval time.Duration
registerDBPruneThrottleDelay time.Duration
}

type PublicNetworkConfig struct {
Expand Down Expand Up @@ -264,8 +267,8 @@ func DefaultAccessNodeConfig() *AccessNodeConfig {
executionDataIndexingEnabled: false,
executionDataDBMode: execution_data.ExecutionDataDBModeBadger.String(),
executionDataPrunerHeightRangeTarget: 0,
executionDataPrunerThreshold: pruner.DefaultThreshold,
executionDataPruningInterval: pruner.DefaultPruningInterval,
executionDataPrunerThreshold: edpruner.DefaultThreshold,
executionDataPruningInterval: edpruner.DefaultPruningInterval,
registersDBPath: filepath.Join(homedir, ".flow", "execution_state"),
checkpointFile: cmd.NotSet,
scriptExecutorConfig: query.NewDefaultConfig(),
Expand All @@ -276,6 +279,9 @@ func DefaultAccessNodeConfig() *AccessNodeConfig {
programCacheSize: 0,
checkPayerBalance: false,
versionControlEnabled: true,
registerDBPruningEnabled: false,
registerDBPruneTickerInterval: pstorage.DefaultPruneTickerInterval,
registerDBPruneThrottleDelay: pstorage.DefaultPruneThrottleDelay,
}
}

Expand All @@ -287,43 +293,46 @@ type FlowAccessNodeBuilder struct {
*AccessNodeConfig

// components
FollowerState protocol.FollowerState
SyncCore *chainsync.Core
RpcEng *rpc.Engine
FollowerDistributor *consensuspubsub.FollowerDistributor
CollectionRPC access.AccessAPIClient
TransactionTimings *stdmap.TransactionTimings
CollectionsToMarkFinalized *stdmap.Times
CollectionsToMarkExecuted *stdmap.Times
BlocksToMarkExecuted *stdmap.Times
TransactionMetrics *metrics.TransactionCollector
RestMetrics *metrics.RestCollector
AccessMetrics module.AccessMetrics
PingMetrics module.PingMetrics
Committee hotstuff.DynamicCommittee
Finalized *flow.Header // latest finalized block that the node knows of at startup time
Pending []*flow.Header
FollowerCore module.HotStuffFollower
Validator hotstuff.Validator
ExecutionDataDownloader execution_data.Downloader
PublicBlobService network.BlobService
ExecutionDataRequester state_synchronization.ExecutionDataRequester
ExecutionDataStore execution_data.ExecutionDataStore
ExecutionDataBlobstore blobs.Blobstore
ExecutionDataCache *execdatacache.ExecutionDataCache
ExecutionIndexer *indexer.Indexer
ExecutionIndexerCore *indexer.IndexerCore
ScriptExecutor *backend.ScriptExecutor
RegistersAsyncStore *execution.RegistersAsyncStore
Reporter *index.Reporter
EventsIndex *index.EventsIndex
TxResultsIndex *index.TransactionResultsIndex
IndexerDependencies *cmd.DependencyList
collectionExecutedMetric module.CollectionExecutedMetric
ExecutionDataPruner *pruner.Pruner
ExecutionDatastoreManager edstorage.DatastoreManager
ExecutionDataTracker tracker.Storage
versionControl *version.VersionControl
FollowerState protocol.FollowerState
SyncCore *chainsync.Core
RpcEng *rpc.Engine
FollowerDistributor *consensuspubsub.FollowerDistributor
CollectionRPC access.AccessAPIClient
TransactionTimings *stdmap.TransactionTimings
CollectionsToMarkFinalized *stdmap.Times
CollectionsToMarkExecuted *stdmap.Times
BlocksToMarkExecuted *stdmap.Times
TransactionMetrics *metrics.TransactionCollector
RestMetrics *metrics.RestCollector
RegisterDBPrunerMetrics *metrics.RegisterDBPrunerCollector
AccessMetrics module.AccessMetrics
PingMetrics module.PingMetrics
Committee hotstuff.DynamicCommittee
Finalized *flow.Header // latest finalized block that the node knows of at startup time
Pending []*flow.Header
FollowerCore module.HotStuffFollower
Validator hotstuff.Validator
ExecutionDataDownloader execution_data.Downloader
PublicBlobService network.BlobService
ExecutionDataRequester state_synchronization.ExecutionDataRequester
ExecutionDataStore execution_data.ExecutionDataStore
ExecutionDataBlobstore blobs.Blobstore
ExecutionDataCache *execdatacache.ExecutionDataCache
ExecutionIndexer *indexer.Indexer
ExecutionIndexerCore *indexer.IndexerCore
ScriptExecutor *backend.ScriptExecutor
RegistersAsyncStore *execution.RegistersAsyncStore
Reporter *index.Reporter
EventsIndex *index.EventsIndex
TxResultsIndex *index.TransactionResultsIndex
IndexerDependencies *cmd.DependencyList
collectionExecutedMetric module.CollectionExecutedMetric
ExecutionDataPruner *edpruner.Pruner
ExecutionDatastoreManager edstorage.DatastoreManager
ExecutionDataTracker tracker.Storage
versionControl *version.VersionControl
RegisterDB *pebble.DB
RegisterDBPrunerDependencies *cmd.DependencyList

// The sync engine participants provider is the libp2p peer store for the access node
// which is not available until after the network has started.
Expand Down Expand Up @@ -542,6 +551,10 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
requesterDependable := module.NewProxiedReadyDoneAware()
builder.IndexerDependencies.Add(requesterDependable)

// setup dependency chain to ensure register db pruner starts after the indexer
indexerDependable := module.NewProxiedReadyDoneAware()
builder.RegisterDBPrunerDependencies.Add(indexerDependable)

executionDataPrunerEnabled := builder.executionDataPrunerHeightRangeTarget != 0

builder.
Expand Down Expand Up @@ -771,16 +784,16 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
}

var err error
builder.ExecutionDataPruner, err = pruner.NewPruner(
builder.ExecutionDataPruner, err = edpruner.NewPruner(
node.Logger,
prunerMetrics,
builder.ExecutionDataTracker,
pruner.WithPruneCallback(func(ctx context.Context) error {
edpruner.WithPruneCallback(func(ctx context.Context) error {
return builder.ExecutionDatastoreManager.CollectGarbage(ctx)
}),
pruner.WithHeightRangeTarget(builder.executionDataPrunerHeightRangeTarget),
pruner.WithThreshold(builder.executionDataPrunerThreshold),
pruner.WithPruningInterval(builder.executionDataPruningInterval),
edpruner.WithHeightRangeTarget(builder.executionDataPrunerHeightRangeTarget),
edpruner.WithThreshold(builder.executionDataPrunerThreshold),
edpruner.WithPruningInterval(builder.executionDataPruningInterval),
)
if err != nil {
return nil, fmt.Errorf("failed to create execution data pruner: %w", err)
Expand Down Expand Up @@ -845,16 +858,16 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
// Note: using a DependableComponent here to ensure that the indexer does not block
// other components from starting while bootstrapping the register db since it may
// take hours to complete.

pdb, err := pstorage.OpenRegisterPebbleDB(builder.registersDBPath)
var err error
builder.RegisterDB, err = pstorage.OpenRegisterPebbleDB(builder.registersDBPath)
if err != nil {
return nil, fmt.Errorf("could not open registers db: %w", err)
}
builder.ShutdownFunc(func() error {
return pdb.Close()
return builder.RegisterDB.Close()
})

bootstrapped, err := pstorage.IsBootstrapped(pdb)
bootstrapped, err := pstorage.IsBootstrapped(builder.RegisterDB)
if err != nil {
return nil, fmt.Errorf("could not check if registers db is bootstrapped: %w", err)
}
Expand Down Expand Up @@ -886,7 +899,7 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
}

rootHash := ledger.RootHash(builder.RootSeal.FinalState)
bootstrap, err := pstorage.NewRegisterBootstrap(pdb, checkpointFile, checkpointHeight, rootHash, builder.Logger)
bootstrap, err := pstorage.NewRegisterBootstrap(builder.RegisterDB, checkpointFile, checkpointHeight, rootHash, builder.Logger)
if err != nil {
return nil, fmt.Errorf("could not create registers bootstrap: %w", err)
}
Expand All @@ -899,7 +912,7 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
}
}

registers, err := pstorage.NewRegisters(pdb)
registers, err := pstorage.NewRegisters(builder.RegisterDB)
if err != nil {
return nil, fmt.Errorf("could not create registers storage: %w", err)
}
Expand Down Expand Up @@ -993,8 +1006,31 @@ func (builder *FlowAccessNodeBuilder) BuildExecutionSyncComponents() *FlowAccess
return nil, err
}

// add indexer into ReadyDoneAware dependency passed to pruner. This allows the register db pruner
// to wait for the indexer to be ready before starting.
indexerDependable.Init(builder.ExecutionIndexer)

return builder.ExecutionIndexer, nil
}, builder.IndexerDependencies)
}, builder.IndexerDependencies).
DependableComponent("register db pruner", func(node *cmd.NodeConfig) (module.ReadyDoneAware, error) {
if !builder.registerDBPruningEnabled {
return &module.NoopReadyDoneAware{}, nil
}

registerDBPruner, err := pstorage.NewRegisterPruner(
node.Logger,
builder.RegisterDB,
pstorage.WithPrunerMetrics(builder.RegisterDBPrunerMetrics),
//pstorage.WithPruneThreshold(builder.registerDBPruneThreshold),
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

WithPruneThreshold is temporarily commented out and will be re-enabled once PR #6345 is merged.

pstorage.WithPruneThrottleDelay(builder.registerDBPruneThrottleDelay),
pstorage.WithPruneTickerInterval(builder.registerDBPruneTickerInterval),
)
if err != nil {
return nil, fmt.Errorf("failed to create register db pruner: %w", err)
}

return registerDBPruner, nil
}, builder.RegisterDBPrunerDependencies)
}

if builder.stateStreamConf.ListenAddr != "" {
Expand Down Expand Up @@ -1121,10 +1157,11 @@ func FlowAccessNode(nodeBuilder *cmd.FlowNodeBuilder) *FlowAccessNodeBuilder {
dist := consensuspubsub.NewFollowerDistributor()
dist.AddProposalViolationConsumer(notifications.NewSlashingViolationsConsumer(nodeBuilder.Logger))
return &FlowAccessNodeBuilder{
AccessNodeConfig: DefaultAccessNodeConfig(),
FlowNodeBuilder: nodeBuilder,
FollowerDistributor: dist,
IndexerDependencies: cmd.NewDependencyList(),
AccessNodeConfig: DefaultAccessNodeConfig(),
FlowNodeBuilder: nodeBuilder,
FollowerDistributor: dist,
IndexerDependencies: cmd.NewDependencyList(),
RegisterDBPrunerDependencies: cmd.NewDependencyList(),
}
}

Expand Down Expand Up @@ -1297,6 +1334,8 @@ func (builder *FlowAccessNodeBuilder) extraFlags() {
"execution-data-db",
defaultConfig.executionDataDBMode,
"[experimental] the DB type for execution datastore. One of [badger, pebble]")

// Execution data pruner
flags.Uint64Var(&builder.executionDataPrunerHeightRangeTarget,
"execution-data-height-range-target",
defaultConfig.executionDataPrunerHeightRangeTarget,
Expand All @@ -1310,6 +1349,20 @@ func (builder *FlowAccessNodeBuilder) extraFlags() {
defaultConfig.executionDataPruningInterval,
"duration after which the pruner tries to prune execution data. The default value is 10 minutes")

// RegisterDB pruning
flags.BoolVar(&builder.registerDBPruningEnabled,
"registerdb-pruning-enabled",
defaultConfig.registerDBPruningEnabled,
"whether to enable the pruning for register db")
Guitarheroua marked this conversation as resolved.
Show resolved Hide resolved
flags.DurationVar(&builder.registerDBPruneThrottleDelay,
"registerdb-prune-throttle-delay",
defaultConfig.registerDBPruneThrottleDelay,
"delay for controlling a pause between batches of registers inspected and pruned")
Guitarheroua marked this conversation as resolved.
Show resolved Hide resolved
flags.DurationVar(&builder.registerDBPruneTickerInterval,
"registerdb-prune-ticker-interval",
defaultConfig.registerDBPruneTickerInterval,
"duration after which the pruner tries to prune data. The default value is 10 minutes")
Guitarheroua marked this conversation as resolved.
Show resolved Hide resolved

// Execution State Streaming API
flags.Uint32Var(&builder.stateStreamConf.ExecutionDataCacheSize, "execution-data-cache-size", defaultConfig.stateStreamConf.ExecutionDataCacheSize, "block execution data cache size")
flags.Uint32Var(&builder.stateStreamConf.MaxGlobalStreams, "state-stream-global-max-streams", defaultConfig.stateStreamConf.MaxGlobalStreams, "global maximum number of concurrent streams")
Expand Down Expand Up @@ -1673,11 +1726,16 @@ func (builder *FlowAccessNodeBuilder) Build() (cmd.Node, error) {
builder.RestMetrics = m
return nil
}).
Module("register db metrics", func(node *cmd.NodeConfig) error {
builder.RegisterDBPrunerMetrics = metrics.NewRegisterDBPrunerCollector()
return nil
}).
Module("access metrics", func(node *cmd.NodeConfig) error {
builder.AccessMetrics = metrics.NewAccessCollector(
metrics.WithTransactionMetrics(builder.TransactionMetrics),
metrics.WithBackendScriptsMetrics(builder.TransactionMetrics),
metrics.WithRestMetrics(builder.RestMetrics),
metrics.WithRegisterDBPrunerMetrics(builder.RegisterDBPrunerMetrics),
)
return nil
}).
Expand Down
Loading
Loading