SDS 499: Data Meshes and Data Reliability — with Barr Moses

Topics covered
Popular Clips
Episode Highlights
Data Reliability
explains data reliability as akin to software service uptime, emphasizing its critical role in modern data usage. She highlights the necessity of maintaining data pipelines to ensure continuous access and decision-making capabilities. underscores the importance of this concept, noting that "the stakes are higher now" as data becomes increasingly mission-critical 1. Moses's effective communication skills are praised, reflecting her ability to convey complex technical content clearly 2.
Observability Pillars
To ensure data reliability, Moses introduces the five pillars of data observability: freshness, volume, schema, distribution, and lineage. These pillars provide a comprehensive view of data health, enabling organizations to maintain high data quality. She explains, "if you can automatically collect information, monitor those five pillars, you can actually have a holistic, unified view of the health of your data" 3. This approach draws from software engineering practices, leveraging tools like Snowflake and Databricks to enhance data processing and accuracy 4.
Accuracy Challenges
Moses discusses the challenges of maintaining data accuracy in complex organizational environments. She notes that manual verification is no longer feasible due to the vast number of data sources. "The question is, as an analogy, sort of call this kind of like data downtime," she explains, highlighting the need for rapid detection of data issues 5. Her background in the Israeli Air Force instilled a strong sense of responsibility for data accuracy, emphasizing the importance of minimizing defects in data-driven decisions 6.
Related Episodes


SDS 609: Data Mesh — with Zhamak Dehghani
Answers 383 questions

SDS 595: Data Engineering 101 — with Joe Reis and Matt Housley
Answers 383 questions

SDS 541: Data Observability — with Dr. Kevin Hu
Answers 383 questions

SDS 619: Tools for Deploying Data Models into Production — with Erik Bernhardsson
Answers 383 questions

SDS 587: Data Engineering for Data Scientists — with Mark Freeman
Answers 383 questions
SDS 468: The History of Data — with Jon Krohn
Answers 383 questions

SDS 487: Fixing Dirty Data — with Susan Walsh
Answers 383 questions

SDS 479: Knowledge Graphs — with Maureen Teyssier
Answers 383 questions

SDS 545: Scaling Data-Intensive Real-Time Applications — with Matthew Russell
Answers 383 questions

SDS 485: Financial Data Engineering — with Doug Eisenstein
Answers 383 questions

SDS 555: Sports Analytics and 66 Days of Data with @KenJee_ds
Answers 383 questions

SDS 493: Bringing Data to the People — with Anjali Shrivastava
Answers 383 questions

SDS 539: Interpretable Machine Learning — with Serg Masís
Answers 383 questions













