Improving ORT Server UI Dashboard performance and facilitate trend graphs #1915

wkl3nk · 2025-01-30T12:18:46Z

wkl3nk
Jan 30, 2025

Motivation

Pain: Currently, the ORT Server UI dashboard displays key statistics such as the number of vulnerabilities, issues, rule violations, and packages. However, retrieving this data from the backend is quite slow, as it requires extensive queries to the database. This results in users having to wait several seconds for the UI activity indicators to disappear before the numbers are displayed.

Once the numbers for number of vulnerabilities, issues, rule violations and packages are there, it is nice to look at, but wouldn't it be even nicer to see how these numbers have developed over time, e.g. by displaying a historical trend graph in the background?

From

To

High Level Solution Idea

Add a basic event log to the system

We could start by introducing a basic event log (as a database table) to track important system activities. Initial events could be:

SCAN_STARTED (sequential id, timestamp, ..., context information)
SCAN_FINISHED (sequential id, timestamp, ..., context information)

Whenever a scan is started or finished, such an event is written to the event log.

Asynchronously process these events (batch processing)

We could have a periodic job that scans the event log for events that are not yet processed and processes them.
Processing could mean:

Calculate number of vulnerabilities, issues, rule violations and packages and write the results to database tables that are tailored to generate reports from (e.g. use fact tables and dimension tables. See Star Schema. We could use the already existing SQL queries for that.
Calculate total execution duration of the scan and also write to such kind of table structures (batch-processing database tables)

It should be possible to also change the batch processing code, reset the batch-processing database tables and re-process all (or parts) of the event log.

Optimize UI Dashboard performance by using this pre-calculated numbers

To ensure both responsiveness and accuracy, the UI dashboard can leverage a hybrid approach inspired by Lambda Architecture:

Primary Data Source: The UI should first fetch pre-calculated numbers from the batch-processing database tables (Batch Layer).
Fallback: If the batch-processed numbers are not yet available, the UI can temporarily compute and display real-time data (Speed Layer) using an on-the-fly query (as it currently already exists).
Final State: Once the batch results are available, the real-time values should be replaced with the more accurate precomputed ones.
Future: Introduce historical trend visualization (e.g., graphs) to provide insights how the numbers changed over time.

Feedback & Next Steps

I’d love to hear your thoughts on this approach. Are there any potential challenges or alternative solutions we should consider? Would this enhancement align with ORT Server’s long-term goals?

sschuberth · 2025-01-30T12:25:16Z

sschuberth
Jan 30, 2025
Collaborator

However, retrieving this data from the backend is quite slow, as it requires extensive queries to the database.

BTW, this is something that should have improved significantly recently, thanks to the back-end work done by @lamppu.

wouldn't it be even nicer to see how these numbers have developed over time, e.g. by displaying a historical trend graph in the background?

Yes, displaying time series in general is on our internal agenda as well. I'd also like to see e.g. how analyzer and scan times evolved across runs on a given repository.

1 reply

lamppu Jan 30, 2025
Collaborator

However, retrieving this data from the backend is quite slow, as it requires extensive queries to the database.

BTW, this is something that should have improved significantly recently, thanks to the back-end work done by @lamppu.

Yes, though as @mnonnenmacher pointed out in #1762, the request can still be very slow when there are a lot of repositories in the product.

oheger-bosch · 2025-01-30T15:11:16Z

oheger-bosch
Jan 30, 2025
Collaborator

Gathering statistics of various kinds has always been a goal for ORT Server.

Your solution idea is similar to something I had in mind regarding a general optimization of the database: Currently, the database schema is rather normalized, and most entities are deduplicated. Especially if the database grows in size, this is likely to have a negative impact on ORT runs, since INSERT operations will become slower (due to the additional queries to check for the presence of entities during deduplication).

To prevent this, there could be different schemas: one that is rather temporary for ORT runs currently in progress, and one storing the relevant results. In the first schema, no deduplication is done, but all data gathered during a run is just inserted. This should be fast, and it should also be easy from a processing step to access the data of previous steps from the current run. After the completion of the run or based on a periodic schedule, the run schema is transformed asynchronously into the results schema. This schema is designed to support queries for the relevant statistics efficiently. The transformation can be expensive, but it is done in background, so that users should not be impacted.

It could even be possible to use different databases for these purposes or maybe specialized NoSQL databases if required by use cases.

1 reply

mnonnenmacher Jan 30, 2025
Maintainer

As far as I remember all big database performance issues we have seen have been either caused by missing indexes, bad schema design, or inefficient client code (too many queries...). I'm confident that we can keep the database performance stable by further optimizing that.

Delaying the write to the database to make the runs faster will not help with the performance of queries like those mentioned here, because they still have to operate on the same schema.

mnonnenmacher · 2025-01-30T15:32:34Z

mnonnenmacher
Jan 30, 2025
Maintainer

I agree with the general idea to have a denormalized read model for these kinds of requests. For the implementation we should try to keep it simple, for example, I would like to avoid the mentioned hybrid approach.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving ORT Server UI Dashboard performance and facilitate trend graphs #1915

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Improving ORT Server UI Dashboard performance and facilitate trend graphs #1915

wkl3nk Jan 30, 2025

Motivation

High Level Solution Idea

Add a basic event log to the system

Asynchronously process these events (batch processing)

Optimize UI Dashboard performance by using this pre-calculated numbers

Feedback & Next Steps

Replies: 3 comments · 2 replies

sschuberth Jan 30, 2025 Collaborator

lamppu Jan 30, 2025 Collaborator

oheger-bosch Jan 30, 2025 Collaborator

mnonnenmacher Jan 30, 2025 Maintainer

mnonnenmacher Jan 30, 2025 Maintainer

wkl3nk
Jan 30, 2025

Replies: 3 comments 2 replies

sschuberth
Jan 30, 2025
Collaborator

lamppu Jan 30, 2025
Collaborator

oheger-bosch
Jan 30, 2025
Collaborator

mnonnenmacher Jan 30, 2025
Maintainer

mnonnenmacher
Jan 30, 2025
Maintainer