How do you view data governance and modeling in a big and fast data world?

Seth Sanusi
Mar 27, 2021
2 min read

Published on 2018-06-27 17:36 (LinkedIn: sethsanu)

Depending on who you ask there are three or more “V”’s of Big Data attributes: Volume, Velocity, and Variety (additionally Veracity, Value, Vicinity and any other pertinent V noun). Enterprise Data governance programs are set up to ensure the quality, safekeeping and optimal utilization of data. The principles of good data governance can be mapped to each “V” of big data:

· Volume: With larger amounts of structured and unstructured data being utilized, a data governance program should have a good inventory or both data and metadata. Dynamic data catalogs, data dictionaries and data profiles are becoming more commonplace in large scale data lakes just as they are in relational and analytical platforms.

· Velocity: As organizations ingest fast moving streams of data through IoT devices, social media and globally distributed applications, it’s important to manage the practicality of data in motion. Some consuming systems may need real time streams of data, while others may only require daily aggregations. Creating an understanding of how data is consumed by each application has value; do you know where your lambda architecture needs are?

· Variety: The schema of right now, may not be the same schema of tomorrow. The format of unstructured data may change, structured data platforms may migrate. Tools are available that can swiftly scan data for changes in structure and content and alert or adapt downstream data consumers. Well designed consuming systems are reactive and resolute to changing data with late binding schema designs or written for adaptability. The dashboard that consumes relational SQL data can be updated for Polybase queries against a data lake with little code churn.

There are additional governance considerations per industry, region and technology set. For a more comprehensive perspective, I highly recommend Sunil Soares's book "Big Data Governance: An Emerging Imperative" available from Amazon:

https://www.amazon.com/Big-Data-Governance-Emerging-Imperative/dp/1583473777

38 Comments

Orion Hunter

12 hours ago

I found the information on this home service providers platform very useful for comparing local businesses and checking reviews before making a booking decision for household services.

peterlenb

Apr 29

In an era of college football defined by the transfer portal's chaos and NIL deals that would make Fortune 500 executives blush, loyalty has become the rarest of commodities. Jeremiah Smith, the Ohio State wide receiver universally regarded as the best player in college football, recently turned down a transfer offer exceeding $10 million to remain a Buckeye . It was a decision that stunned the sport—and one that cemented his legacy before he ever plays another down. Jeremiah Smith Ohio State Jersey

peterlenb

Apr 27

In the annals of American sports, no family name carries more weight than Manning. From Archie's heroic days in a New Orleans Saints uniform to Peyton's five MVP awards and two Super Bowl rings, to Eli's two Super Bowl victories over Tom Brady, the Manning dynasty has defined quarterback excellence for three generations. Now, the torch has passed to Arch Manning—the 6-foot-4, 219-pound redshirt junior at the University of Texas who carries the weight of his family's legacy while determined to write his own chapter. Arch Manning Texas Jersey

peterlenb

Apr 26

From the heavy cotton jerseys of the 1980s to the high-tech, skin-tight uniforms of today, NCAA basketball and football apparel have always been about more than just covering the players. They are a canvas for tradition, a battleground for corporate innovation, and a multi-billion dollar statement of identity. The story of the NCAA jersey is a fascinating intersection of technology, marketing, culture, and strict—but evolving—regulation. Johnny Manziel Texas A&M Jersey

peterlenb

Apr 25

Iamaleava has played in 29 games and made 25 starts over three seasons at UCLA (2025) and Tennessee (2023-24) … has completed 449-of-702 pass attempts (64.0%) for 4,858 yards and 34 touchdowns with 12 interceptions in his career … has rushed 241 times for 934 yards and 10 touchdowns … totaled 2,930 passing yards and 21 touchdowns on 241-of-379 passing (64.0%) with five interceptions during his time at Tennessee … rushed for 435 yards on 129 attempts with six touchdowns as a Volunteer … career-long completion in 86 yards, occurring in Week 12 of 2024 at Vanderbilt … career-long rush is 52 yards, coming against Penn State in Week 5 of 2025 … career high in passing yards is 314…

Post: Blog2_Post

How do you view data governance and modeling in a big and fast data world?

Recent Posts

38 Comments

Subscribe Form