-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluating the Kedro-Viz experience for large pipelines #1726
Comments
I think the only details I'm missing from this are two things:
I see in the linked data that you talk about projects with 25 - 35 nodes but is that a large pipeline? The reason I ask this is because your quotes talk about pipelines with 1,500 nodes experiencing the slow down. |
Updated the research objective section with this:
|
Related: kedro-org/kedro#3790 |
This is not priority for now. There are existing tickets on creating complex/larger pipelines to demo kedro/kedro-viz against. We have solved a major issue on this #1673 using Stateful URLS. We can reopen if necessary. |
Description
This research study is aimed at investigating the experience of technical users when rendering their pipelines in Kedro-Viz. These technical users include: Data Scientists, Data Engineers, and Machine Learning Engineers.
Research Objective
The primary objective is to assess the overall experience of Kedro-Viz for technical users with large pipelines.
It would seek to identify their workflow, pain points, and unmet needs.
A large project is defined as one with 1000 nodes (this is the current size warning on Kedro-Viz).
As shown in the graph below, this make up only 5% of Kedro projects. This is still important as this (internal) user group are active Kedro-Viz users.
The hypothesis are:
Supporting Data
Quantitative
link to chart
The graph above shows the proportion of users, for each node count, as an indication of their project size (Jan 2023 - Jan 2024).
There is a significant proportion of users with 25 - 35 nodes in their project, but very little (5%) with 1000 nodes and above.
Qualitative
From Slack:
This was also the premise behind creating a massive test pipeline for the kedro team here.
Method
We aim to speak to 5-8 internal/external technical users on zoom, that use Kedro-Viz.
Participants
The technical users include: Data Scientists, Data Engineers, and Machine Learning Engineers.
Who will we be speaking to?
Interview Guide (45 mins)
User details (10 mins)
User motivation and workflow (10 mins)
User pain points and workflow (20 mins)
User recommendations & Wrap up (5 mins)
What decisions will this research enable?
Research Outcomes
The text was updated successfully, but these errors were encountered: