You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
we are interested in following metrics to be displayed: jobs processed, average elapsed time, average lag (from job created to job started)
Implementation
you'll need to use package state to store extract metrics (ExtractInfo) and normalize metrics
if those elements are not present in the state you must fallback gracefully ie. reporting only the progress of the files. the job processing must be plain: if there are files they will be processed even if state is not present
ADDITIONAL THOUGHTS (@IlyaFaer ):
There are two different cases:
We extract and then normalize data - in this case we can take rows count from ExtractInfo
We normalize the data, extracted earlier
The text was updated successfully, but these errors were encountered:
Background
Progress reporting in
normalize
andload
steps are far from perfect.normalize
we report progress on file level but that only is updated when a worker process is finishedload
the reported metrics do not survive restarts (see implementLoadInfo
andExtractInfo
missing tracing #853 )Tasks
Step1. fix
normalize
:extract
(per job and resource) to correctly report processed row per resource (where we have total number of records as well)Step 2. Fix
load
:LoadInfo
andExtractInfo
missing tracing #853 use package state to track the elapsed times (task created, start, stop of job)Implementation
ADDITIONAL THOUGHTS (@IlyaFaer ):
There are two different cases:
ExtractInfo
The text was updated successfully, but these errors were encountered: