Implementing Data Catalog Capabilities
Details
Without a Data Catalog, Data projects such as Data Governance, AI, Analytics, Data Lakes will not be successful. Without a Data Catalog the Data Lake is just a swamp, no one can find anything, and security and privacy regulations are beyond reach.
The creation of a Data Catalog may be triggered and justified to support many different business drivers, especially in Data Governance and Analytics. However, successful implementation to support those drivers can be elusive. What is needed to implement the needed capabilities fast to meet the business demand?
This session will cover strategies and priorities for a successful Data Catalog implementation
Analysts and industry experts agree that a huge risk to any data-centric project is the lack of metadata management for physical inventory, business meaning, access security, audit trail, and general asset management. The Data Catalog is a critical factor to a Data Governance and Analytics program success that must be implemented as additional pieces to the distributed data environment.
It also enables knowledge of where data lives, including PII data; this knowledge is a key part of complying with privacy regulations.
Topics covered include:
- The Data Lake / Big Data Analytics environment and the Data Catalog
- What data do we have? – managing business metadata (meaning), technical metadata (inventory), operational metadata, data lineage
- What granularity / detail is needed and realistic?
- Managing data security and privacy regulatory requirements
- What can be automated (and what can’t)?
- What should be prioritized?
- What people resources are needed?
- A look at the Alation Data Catalog (DAMA Philadelphia does not endorse specific products or services but will be providing a look at how this vendor implements some of these ideas in their product)
