feat: Add deployment config #77

ayirr7 · 2025-03-19T00:33:19Z

Adds deployment configuration files which operate on a per-pipeline basis. Each file has an env section for general configuration (e.g. topics), a pipeline section for the local / dev configs, and allows for additional runtime-specific sections. For example, there is a flink section.

Adds a flag to the runner (-c) to denote container mode. This is useful in toggling between the local/dev configuration and container/non-local configuration. For the FlinkAdapter, this will mean that we can run the same example locally and in container mode down the road. Open to other suggestions for how to do this in the config formatting, etc.

ayirr7 · 2025-03-20T06:38:53Z

sentry_streams/sentry_streams/deployment_config/alerts_config.yaml

+    myreduce:
+      parallelism: 3
+
+flink:


I'd like us to move towards runtime-specific overrides (right now it's a full copy of the general pipeline config). I wanted to use Pydantic for some utils that make operations like deep updates of dictionaries easier, but I can't manage to install it.

ayirr7 · 2025-03-20T06:44:36Z

Haven't split the PR yet because I think seeing whole diff is useful

fpacifici

A couple of high level suggestions:

Define a schema for the config so we can validate it upfront rather than fail when a missing field is loaded. You can use jsonschema (yes. it is yaml but json is a subset so you can use jsonschema to validate).
Let's have a config class that provides some solid abstracitons to the specific conmponents (i's recommend TypedDict that are lightweight but still type checkable).

fpacifici · 2025-03-20T18:50:34Z

platforms/flink/docker-compose.yml

@@ -10,7 +10,7 @@ x-flink-config: &flink-config |
  jobmanager.memory.process.size: 1024m
  jobmanager.rpc.address: jobmanager
  taskmanager.memory.process.size: 1024m
-  taskmanager.numberOfTaskSlots: 2
+  taskmanager.numberOfTaskSlots: 10


I did this so I can test out parallelism of steps with Flink

fpacifici · 2025-03-20T19:56:33Z

sentry_streams/sentry_streams/deployment_config/alerts_config.yaml

I think we are missing the concept of pipeline segments to define where we break chains.
Plus I am not sure the adapter specific config should be allowed to override any parameter. For example the flink version should not be able to override the kafka broker config. This makes me think that should be a separation between adapter specific config vs generic config.

What about something like this:

graph TD; subgraph pipeline subgraph segment subgraph step common_config subgraph adapter_config adapter1 adapter2 end end end end

Loading

Where a pipeline is composed of semgnets (or chains) chains are wired up together in the same worker.
In each chain we provide config for each step (that needs config)
each step config can have a common config and a list of config elements per adapter.

There should be a way to provide some common environment like you did to map streams to topics.

Will it be the case that every Step will have access to the chain or segment it belongs to?

fpacifici · 2025-03-20T19:58:27Z

sentry_streams/sentry_streams/deployment_config/alerts_config.yaml

+flink:
+  parallelism: 2
+  sources_config:
+    myinput:


In #79 I abstracted away the type of source/sink. So we should pick the right source type in configuration. So you should have a representation of KafkaSource (with kafka paramters) and opening up the possibility to have other types.

fpacifici · 2025-03-20T19:58:51Z

sentry_streams/sentry_streams/deployment_config/alerts_config.yaml

+
+  reduce_config:
+    myreduce:
+      parallelism: 3


Isn't parallelism a property of a segment that contains a sequence of steps.

ayirr7 added 7 commits March 18, 2025 16:10

init

30a8f41

basic config

ba9110a

Merge remote-tracking branch 'origin' into riya/deployment-config

9c982c8

some parallelism for flink steps

d204ff3

alerts example config

3e3766a

Merge remote-tracking branch 'origin' into riya/deployment-config

c990ba4

more examples

370ee4f

ayirr7 commented Mar 20, 2025

View reviewed changes

clean up, use -c to denote container mode

198d69a

ayirr7 changed the title ~~wip: deployment config~~ feat: Add deployment config Mar 20, 2025

ayirr7 marked this pull request as ready for review March 20, 2025 06:44

remove extra uv lock changes

f37fce1

fpacifici reviewed Mar 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add deployment config #77

feat: Add deployment config #77

ayirr7 commented Mar 19, 2025 •

edited

Loading

ayirr7 Mar 20, 2025 •

edited

Loading

ayirr7 commented Mar 20, 2025

fpacifici left a comment

fpacifici Mar 20, 2025

ayirr7 Mar 20, 2025

fpacifici Mar 20, 2025

fpacifici Mar 20, 2025

ayirr7 Mar 21, 2025

fpacifici Mar 20, 2025

fpacifici Mar 20, 2025

feat: Add deployment config #77

Are you sure you want to change the base?

feat: Add deployment config #77

Conversation

ayirr7 commented Mar 19, 2025 • edited Loading

ayirr7 Mar 20, 2025 • edited Loading

Choose a reason for hiding this comment

ayirr7 commented Mar 20, 2025

fpacifici left a comment

Choose a reason for hiding this comment

fpacifici Mar 20, 2025

Choose a reason for hiding this comment

ayirr7 Mar 20, 2025

Choose a reason for hiding this comment

fpacifici Mar 20, 2025

Choose a reason for hiding this comment

fpacifici Mar 20, 2025

Choose a reason for hiding this comment

ayirr7 Mar 21, 2025

Choose a reason for hiding this comment

fpacifici Mar 20, 2025

Choose a reason for hiding this comment

fpacifici Mar 20, 2025

Choose a reason for hiding this comment

ayirr7 commented Mar 19, 2025 •

edited

Loading

ayirr7 Mar 20, 2025 •

edited

Loading