What Is Data Lineage, And Why Does It Matter?

原文英文,约1300词,阅读约需5分钟。发表于:

If you’ve ever had conversations with data professionals, you’ve probably heard “data lineage” pop up quite a few times. So what is data lineage all about, and why is it important?

数据血统是追踪和可视化数据在数据管道或系统中流动和转换的过程。它提供了对数据的起源、移动和转换的详细了解,帮助组织提高数据质量、确保合规性等。数据血统的重要性在于维护数据质量、满足法规要求、故障排除、影响分析、管理风险、审计和治理、提高操作效率等方面。一些常用的数据血统工具包括Collibra、Informatica Axon、IBM InfoSphere Information Governance Catalog、Apache Atlas和Erwin Data Intelligence。

What Is Data Lineage, And Why Does It Matter?
相关推荐 去reddit讨论