Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ingest LAMP Alerts #101

Open
devinmatte opened this issue May 17, 2024 · 1 comment
Open

Ingest LAMP Alerts #101

devinmatte opened this issue May 17, 2024 · 1 comment

Comments

@devinmatte
Copy link
Member

https://performancedata.mbta.com/lamp/tableau/alerts/LAMP_RT_ALERTS.parquet

This file should be processed in a daily job (its 84mbs) to split the alerts into daily files/dynamo tables

Once processed, the dashboard can pull alerts from here, and fall back to v3 alerts for same day

We still have v2 alerts we ingested to fall back on until May 2024 if we need to, but ideally this removes the need for them

@maxtkc
Copy link

maxtkc commented Oct 2, 2024

Is the goal to have both daily files and dynamo, or just dynamo tables? For dynamo, is it just be one table with items that are each row in the parquet table? Do we want to keep all of the columns?

For reference, these are the columns:

cause
cause_detail
effect
effect_detail
severity_level
severity
alert_lifecycle
duration_certainty
header_text.translation.text
description_text.translation.text
service_effect_text.translation.text
timeframe_text.translation.text
recurrence_text.translation.text
created_datetime
created_timestamp
last_modified_datetime
last_modified_timestamp
last_push_notification_datetime
last_push_notification_timestamp
closed_datetime
closed_timestamp
active_period.start_datetime
active_period.start_timestamp
active_period.end_datetime
active_period.end_timestamp
informed_entity.route_id
informed_entity.route_type
informed_entity.direction_id
informed_entity.stop_id
informed_entity.facility_id
informed_entity.activities

I think dynamo batch write is capped at 16 MB, so we may need to only batch write the past two days for the daily job and then separately write the older alerts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants