Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: How do i use Ruptures to detect large data streaming? #320

Closed
nntp4 opened this issue Jan 17, 2024 · 2 comments
Closed

Question: How do i use Ruptures to detect large data streaming? #320

nntp4 opened this issue Jan 17, 2024 · 2 comments

Comments

@nntp4
Copy link

nntp4 commented Jan 17, 2024

Description

I need to process more than 10 TB of data in Kafka clusters per day.
In other words how do I use ruptures with the distributed system to process large data streaming?

@tg12
Copy link

tg12 commented Jan 17, 2024

This question doesn't make much sense. It's time to close the book on this one. This is largely for Offline Change Point Detection project, but honestly, this isn't the place for it. Change point detection methods fall into two categories: online methods, which spot changes in real-time, and offline methods, which look back after all data is in. If you want to dive deeper, check out this Wikipedia page on change detection: https://en.wikipedia.org/wiki/Change_detection. Try looking for Bayesian Online Changepoint Detection if you are interested in streaming.

@nntp4
Copy link
Author

nntp4 commented Jan 18, 2024

@tg12 THX your reply, I close this issue now

@nntp4 nntp4 closed this as completed Jan 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants