Episode 408: Mike McCourt on Voice and Speech Analysis

Topics covered
Popular Clips
Episode Highlights
Model Selection
Choosing the right machine learning model for voice data is a nuanced process. emphasizes starting with the simplest model possible to establish a baseline, which can then be iterated upon to address specific challenges 1. He shares an example from his experience analyzing the Federalist Papers to determine authorship, where he began with a basic statistical model and gradually refined it 1.
My conclusion was that Madison had written almost all of the disputed ones.
---
This approach highlights the importance of simplicity and adaptability in model selection.
Data Sensitivity
Audio quality and call length present significant challenges in voice data analysis. explains that varying call qualities, from crystal clear to garbled, require models that can handle diverse audio inputs 2. Additionally, call length affects model sensitivity; shorter calls may not provide enough data, while longer calls can introduce noise 3.
If your model is really sensitive to patterns, so that even in a really noisy transcript with bad audio quality, it can still pick out the relevant patterns.
---
Finding a balance in sensitivity is crucial for effective analysis.
Supervised vs Unsupervised
The complexity of phone call analysis often requires a blend of supervised and unsupervised learning techniques. describes how businesses provide examples of calls to train models, but the diversity in language and personal expression makes it challenging to rely solely on supervised learning 4. By combining both approaches, they can identify common themes across calls while accounting for individual variations 5.
We use a combination of unsupervised learning and a supervised algorithm.
---
This hybrid method allows for more nuanced and accurate call analysis.
Related Episodes


Episode 20: Interview Michael Stal
Answers 383 questions

SE-Radio-Episode-286-Katie-Malone-Intro-to-Machine-Learning
Answers 383 questions
Episode 115: Architecture Analysis
Answers 383 questions

Episode 130: Code Visualization with Michele Lanza
Answers 383 questions

Episode 86: Interview Dave Thomas
Answers 383 questions

Episode 111: About Us 2008
Answers 383 questions

Episode 36: Interview Guy Steele
Answers 383 questions

Episode 59: Static Code Analysis
Answers 383 questions

seradio-episode130-codeVisualizationWithMicheleLanza.mp3
Answers 383 questions

Episode 479: Luis Ceze on the Apache TVM Machine Learning Compiler
Answers 383 questions

Episode 39: Interview Steve Vinoski
Answers 383 questions

Episode 193: Apache Mahout
Answers 383 questions

Episode 395: Katharine Jarmul on Security and Privacy in Machine Learning
Answers 383 questions

Episode 206: Ken Collier on Agile Analytics
Answers 383 questions

Episode 403: Karl Hughes on Speaking at Tech Conferences
Answers 383 questions














