Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🚀 feat(roadmap): add tools related to CDC, logging, streaming platform, data integration, and web development frameworks (#9) #9

Merged
merged 14 commits into from
May 29, 2024

Conversation

FaridFArab
Copy link
Contributor

No description provided.

@FaridFArab
Copy link
Contributor Author

Can anyone check and merge this please ?

@mostafaghadimi
Copy link
Contributor

Hi @FaridFArab jan, thank you for your contribution. Yes, of course. Your invaluable contribution will be reviewed soon and will let you know.

Copy link
Contributor

@niyushabaghayi niyushabaghayi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @FaridFArab for your contribution, I think the changes are good to be added.

Go:
- Fiber
- Gin
Java:
- Spring
- SpringBoot
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's Spring Boot I think, please add a space.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @niyushabaghayi. I did it and it is OK now.

Copy link
Contributor

@mostafaghadimi mostafaghadimi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @FaridFArab jan for your invaluable contribution.

Your PR was reviewed by our colleagues, and it can be merged after applying a few changes.

Yours,
Mostafa

roadmap.yaml Outdated
Comment on lines 38 to 43
ETL:
- Petl
- Pandas
- PySpark
- Requests
- Beautiful Soup
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is a specific section for ETL and some of these frameworks (libraries) are not ETL inherently. IMO, I suggest to remove this part, because it's not necessary to have this granularity for roadmap! You can also contribute on our wiki project.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes I agree with you @mostafaghadimi but we don't mention any ETL frameworks there. I think we can keep them now and if we add more tools and libraries we divide them into the appropriate places.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They are related somehow but not specifically developed for ETL.
As an example, beautiful soup and requests only help us with E (extract) and have nothing to do with T (transform) and L (load).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK I will remove Requests and Beautiful Soup from there.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@FaridFArab, Let's remove ETL section from this part.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

roadmap.yaml Outdated
- Flask
- Django
- FastAPI
Web Development and API:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this line should be removed (it is under web framework and API development section and caused redundancy)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I separated Python frameworks and libraries.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now this section looks like:

Web Frameworks and API Development:
…
      Web Development and API:

Which caused redundancy. It’s data engineering roadmap and we can keep related topics with higher of granularity.

Do you agree with that Farid jan? In case of any disagreement feel free to share your opinion with us.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK I will remove them.

roadmap.yaml Outdated
Comment on lines 184 to 187
- Kafka Connect
- Schema Registry
- KSQL
- Kafka Rest
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are they the core components of the Kafka? No. I think they can be placed in another section or may be we can elaborate these components on wiki.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes I think in data engineering world we have to use this tools and a company like Confluent included all of them into a Kafka cluster. I will add them into the wiki but I think we can keep them.

Copy link
Contributor

@mostafaghadimi mostafaghadimi May 26, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Of course, about 80 percent of top 500 fortunate companies use Kafka and related tools like schema registry in their stacks. I totally agree with you on the importance of these tools. However, mentioning them under Kafka can misguide our readers.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So what's your idea for them ?

Copy link
Contributor

@mostafaghadimi mostafaghadimi May 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It can be added in a separated section or can be added and dive deeply to it on wiki!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

roadmap.yaml Outdated
- Great Expectations
- Soda Core
- Data Security:
Hashicorp Vault
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It should be a list, as well.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do you mean ? need changes ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It should be like the following:

    - Data Security:
        - Hashicorp Vault

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

roadmap.yaml Outdated
Comment on lines 179 to 180
- Connector:
- Kafka Connect
Copy link
Contributor

@mostafaghadimi mostafaghadimi May 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the term connector frequently used and understood in the context of messaging and streaming platforms?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes connector used for connect 2 platforms together and here I mean Kafka connect

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about the other tools like NATS and Pulsar? Is it still in common?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

and apply this one to another PR, please!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

roadmap.yaml Outdated
Comment on lines 66 to 67
- KSQL
- Kafka Stream
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's review the roadmap again, which is accessible through this link. Here we don't introduce any tools. The tools are introduced in its dedicated section (data processing).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed them.

roadmap.yaml Outdated
Comment on lines 178 to 179
- Schema Control:
- Schema Registry
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am of the opinion that schema control is a distinct section. From a bird's-eye view perspective, it should be positioned as an independent section. It's not only related to the messaging and streaming platform.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let me separate it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's finalize this PR and have it in another PR.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's finalize this PR and have it in another PR.

Do you mean I make my change to another PR ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, I mean just remove the schema control from this PR and add it in another PR to discuss about it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ahaa OK.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

Copy link
Contributor

@mostafaghadimi mostafaghadimi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @FaridFArab jan, That's LGTM.

@mostafaghadimi mostafaghadimi changed the title Improvement and Add more items 🚀 feat(roadmap): add tools related to CDC, logging, streaming platform, data integration, and web development frameworks May 29, 2024
@mostafaghadimi mostafaghadimi changed the title 🚀 feat(roadmap): add tools related to CDC, logging, streaming platform, data integration, and web development frameworks 🚀 feat(roadmap): add tools related to CDC, logging, streaming platform, data integration, and web development frameworks (#9) May 29, 2024
@mostafaghadimi mostafaghadimi merged commit dc682d4 into data-burst:main May 29, 2024
1 check passed
@mostafaghadimi
Copy link
Contributor

Well done @FaridFArab jan, welcome to DataBurst. <3

mostafaghadimi pushed a commit that referenced this pull request May 30, 2024
…m, data integration, and web development frameworks (#9) (#9)

* (Improvement) - Add Webframework and correct some of them.

* (Improvement) - Add CDC Debezium, DataQuality frameworks, DataLineage tools, pipeline Orchestrations and NATS streaming platform.

* (Improvement) - Add Hashicorp Vault to Data Security.

* Update roadmap.yaml
mostafaghadimi pushed a commit that referenced this pull request May 30, 2024
…m, data integration, and web development frameworks (#9) (#9)

* (Improvement) - Add Webframework and correct some of them.

* (Improvement) - Add CDC Debezium, DataQuality frameworks, DataLineage tools, pipeline Orchestrations and NATS streaming platform.

* (Improvement) - Add Hashicorp Vault to Data Security.

* Update roadmap.yaml
mostafaghadimi pushed a commit that referenced this pull request May 30, 2024
…m, data integration, and web development frameworks (#9) (#9)

* (Improvement) - Add Webframework and correct some of them.

* (Improvement) - Add CDC Debezium, DataQuality frameworks, DataLineage tools, pipeline Orchestrations and NATS streaming platform.

* (Improvement) - Add Hashicorp Vault to Data Security.

* Update roadmap.yaml
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants