Data Lineage
On this page
If the origin and history of data are untraceable, it compromises data quality and security, leading to inefficient data analysis and troubleshooting. To mitigate this, fully managed Flink offers a feature to view data lineage. This lineage illustrates the relationships formed during data generation, processing, transmission, and consumption. It outlines the flow and dependencies among metadata and highlights the ties between metadata and both streaming and batch deployments. Understanding data lineage gives you a comprehensive insight into your data's journey. It also equips you with vital information for metadata management, data analysis, governance, and strategic decision-making. The following table describes the benefits of data lineage.
Data Lineage Model
The following figure shows the model of data lineage.

The diagram above contains nodes (entities) and edges (relationships). The combination of entities and relationships is displayed as data lineage.
Limits
- To see data lineage from the metadata perspective, you must use a catalog. However, to see it from the deployment perspective, you don't need a catalog.
- You can view and search data lineage for SQL deployments only.
- You must start an SQL deployment at least once to view its data lineage. After you cancel the deployment, it retains the most recent data lineage.