Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Marquez: The complete OpenLineage solution #116

Open
amotl opened this issue May 7, 2024 · 4 comments
Open

Marquez: The complete OpenLineage solution #116

amotl opened this issue May 7, 2024 · 4 comments
Labels
pitch A pitch for adding a new integration item, after evaluating it, and writing a tutorial

Comments

@amotl
Copy link
Member

amotl commented May 7, 2024

About

Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem's metadata. It maintains the provenance of how datasets are consumed and produced, provides global visibility into job runtime and frequency of dataset access, centralization of dataset lifecycle management, and much more. Marquez was released and open sourced by WeWork.

Highlights

One Source of Truth

Marquez enables consuming, storing, and visualizing OpenLineage metadata from across an organization, serving use cases including data governance, data quality monitoring, and performance analytics.

One Service for Lineage

  • Real-time metadata server
  • Unified visual graph
  • Flexible Lineage API

Resources

@amotl amotl changed the title Marquez Marquez: The complete OpenLineage solution May 7, 2024
@amotl
Copy link
Member Author

amotl commented May 7, 2024

Is it true that you may have worked with that application a while ago, @hlcianfagna? If you can recall the outcome, do you remember if it is suitable to be used together with CrateDB?

/cc @wierdvanderhaar

@hlcianfagna
Copy link
Contributor

It is true, yes, and it was working fine with CrateDB, but we did not arrive to publish an article on it, let me share with you what I got by private message.

@amotl
Copy link
Member Author

amotl commented May 7, 2024

Wonderful. Thanks for your reply. Maybe @matkuliak could slot it in, to converge your outcome into a corresponding tutorial/documentation, in the same spirit like/after finishing crate/cratedb-guide#75?

@amotl
Copy link
Member Author

amotl commented May 7, 2024

Hi. I converted a document shared by @hlcianfagna (thanks!) into Markdown format, and added it as an article on the community forum, still in "unlisted" mode. It will need verification, possible updates, and a few formatting improvements. If someone wants to pick up from there...?

-- OpenLineage with Airflow, Marquez, and CrateDB

@amotl amotl added the pitch A pitch for adding a new integration item, after evaluating it, and writing a tutorial label May 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pitch A pitch for adding a new integration item, after evaluating it, and writing a tutorial
Projects
None yet
Development

No branches or pull requests

2 participants