Published Oct 18, 2024

828: Are “Citizen Data Scientists” A Myth? — with Keith McCormick

Jon Krohn and Keith McCormick delve into the evolving landscape of data science, discussing the balance between automated tools like AutoML and expert guidance, and debating the practicality of 'citizen data scientists' utilizing low code platforms such as KNIME and IBM SPSS Modeler for impactful enterprise solutions.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

  • AutoML Efficiency

    AutoML is seen as a tool that enhances efficiency in data science but doesn't replace the need for expert knowledge. emphasizes that while AutoML can speed up processes, it cannot fully automate the machine learning lifecycle, which still requires expert oversight 1. He argues that the notion of democratizing data science through low code and no code tools is more of a marketing strategy than a practical reality 1.

    In the hands of an expert, AutoML is a workforce multiplier, just like the best of GenAI done well. It speeds things. It saves you a few minutes here, a few minutes there. Next thing you know, you're working twice as fast.

    ---

    and Keith also discuss the importance of understanding the limitations of AutoML, especially when explaining its capabilities to management 2.

       

    Team Roles

    The introduction of AutoML is reshaping the roles within data science teams, but it doesn't eliminate the need for skilled professionals. Keith points out that while AutoML can handle certain tasks, it cannot replace the nuanced understanding that expert data scientists bring to the table 1. He stresses that AutoML should be viewed as a tool that complements, rather than replaces, the expertise of data teams 1.

    You don't fire the team. You still need the team. You just change the way they work.

    ---

    adds that the myth of the "citizen data scientist" is challenged by the reality that effective data science requires a deep understanding of both the tools and the data 3.

Related Episodes