653: Efficiently Glean-ing Insights from Vast Data Warehouses — with Carlos Aguilar

Topics covered
Popular Clips
Episode Highlights
Data Tools
Carlos Aguilar, founder and CEO of Glean, explains how their platform leverages existing data warehousing tools to provide powerful analytics. By utilizing technologies like Snowflake and Google BigQuery, Glean performs live computations and data profiling without extracting data, ensuring fast and efficient performance 1. Additionally, DuckDB serves as an in-process columnar database, enabling isolated testing and lightweight demos without relying on external dependencies 2. This approach allows Glean to maintain a lightweight transformation library while still delivering robust analytics capabilities.
We're just relying on that data warehouse.
---
Visualization
Glean's visualization tools, such as D3.js, enable users to create sophisticated visual representations of their data. Carlos highlights the importance of scalability and compatibility with advanced analytics functions in data warehouses like BigQuery and Snowflake 3. The platform's lightweight semantic layer allows for quick setup and easy integration, making it accessible for both technical and non-technical users 4. This ensures that users can efficiently explore and visualize data without extensive configuration.
Glean tries to have that sort of lightweight semantic layer.
---
Integration
Glean's APIs and integration strategies are designed to facilitate seamless data exploration and insights extraction. The platform supports collaborative data insights by providing tools for both technical and non-technical stakeholders, enabling them to work together effectively 5. DuckDB's integration with Apache Arrow APIs further enhances Glean's capabilities, allowing for fast and efficient data serialization and computation 2. This combination of tools and strategies ensures that Glean can cater to diverse user needs while maintaining high performance.
We try to create incredible tools for each of those types of stakeholders.
---
Related Episodes


SDS 535: How to Found, Grow, and Sell a Data Science Start-up — with Austin Ogilvie
Answers 383 questions

682: Business Intelligence Tools — with Mico Yuk
Answers 383 questions

645: Machine Learning for Video Games — with Carly Taylor
Answers 383 questions

679: The A.I. and Machine Learning Landscape — with investor George Mathew
Answers 383 questions

631: Data Analytics Career Orientation — with @LukeBarousse
Answers 383 questions

SDS 435: Scaling Up Machine Learning — with Erica Greene
Answers 383 questions

SDS 587: Data Engineering for Data Scientists — with Mark Freeman
Answers 383 questions

SDS 495: Successful AI Projects and AI Startups — with Greg Coquillo
Answers 383 questions

741: How to Visualize Data Effectively — with Prof. Alberto Cairo
Answers 383 questions

803: How to Thrive in Your (Data Science) Career — with Daliana Liu
Answers 383 questions

677: Digital Analytics — with Avinash Kaushik
Answers 383 questions

841: AI Vision, Agents and Business Value — with Andrew Ng
Answers 383 questions

657: How to Learn Data Engineering — with Andreas Kretz (@andreaskayy)
Answers 383 questions













