Schema on Read Doesn’t Mean Governance on Read
Share this Session:
  Brian Cox   Brian Cox
CapTech Consulting


Monday, March 30, 2015
12:00 PM - 12:30 PM

Level:  Intermediate

Companies today face increasing pressure to capitalize on Big Data, which is increasingly a cost of doing business rather than a competitive advantage. They are making substantial investments to collect and analyze the ever-growing volume of available data.

This pressure has led many companies to make decisions without planning ahead for data management and governance, thereby hamstringing their ability to mine the data they are collecting. The base principals of data management and governance not only still apply but actually become even more important in Big Data implementations.

This presentation will discuss the critical data governance and metadata best practices that drive Big Data success based on practical project examples. These include but are not limited to:

  • The Ability to capture custom business metadata
  • Automatic cataloging of Hadoop technical metadata
  • Search capability to enable data discovery in the data hub / lake
  • Full data Lineage at both the file and column level

Mr. Cox has 15 years of hands-on experience with data technologies, ranging from transactional systems to enterprise data warehouses. His breadth of knowledge includes technical design, data modeling, development, training, testing and system implementation. He is a key thought leader in CapTech's Big Data Practice working to evolve traditional warehousing skills into ones useful in the ever-changing Big Data landscape. Mr. Cox trains and mentors consultants on both existing and new technologies within the ETL and big data space, including Ab Initio, SSIS, Informatica, MapReduce/Hadoop, Pig and Hive.

Close Window