Published Jan 18, 2022

SDS 541: Data Observability — with Dr. Kevin Hu

Jon Krohn is joined by Dr. Kevin Hu to unpack the critical role of data observability in modern industries, illuminating his innovative work at Metaplane and the intersection of SQL and visualization tools for superior data analysis. With insights from his Y Combinator journey, Hu explores the dynamic landscape of startup culture and the challenges of transitioning from academia to entrepreneurship.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

  • Data Observability

    Data observability is crucial for understanding the state of data infrastructure, akin to how software observability works for software systems. explains that while software engineers have tools to monitor their infrastructure, data teams often lack the metadata needed to confidently assess their data's status 1. This gap is where data observability tools like Metaplane come in, providing visibility into data systems.

    We strive to provide data teams as much visibility into the state of their data as software engineering teams have into the state of their software infrastructure.

    ---

    By focusing on metrics, metadata, lineage, and logs, Metaplane helps identify data quality issues and anomalies, ensuring data reliability and integrity 2.

       

    Metaplane Approach

    Metaplane's approach to data observability is designed to address the challenges data teams face in maintaining data quality. highlights how Metaplane acts as a first alert system, detecting anomalies and providing critical insights into data freshness and accuracy 3. This proactive monitoring is essential as data is now integral to various organizational functions, from AI and machine learning to customer interactions.

    We make so many more data assets, whether they're the tables or downstream dashboards, that no one can audit everything.

    ---

    By offering detailed information on data lineage and potential impacts, Metaplane empowers teams to prioritize and address issues effectively 4.

       

    Use Cases

    Metaplane's tools have demonstrated significant impact across diverse industries, from e-commerce to B2B SaaS. shares a case where Metaplane alerted an e-commerce company about data freshness issues due to AWS outages, enabling timely resolution 3. Such real-time insights prevent downstream disruptions and enhance operational efficiency.

    Let Metaplane do that for you. No one does.

    ---

    The platform's ability to detect skewed data from instrumentation changes further underscores its value in maintaining data integrity across complex systems 4.

Related Episodes