669: Streaming, reactive, real-time machine learning — with Adrian Kosowski

Topics covered
Popular Clips
Episode Highlights
Transformers
Transformers in data processing play a crucial role in transforming data structures efficiently. explains that these transformers differ from those in AI, like GPT models, as they focus on converting one data format to another within data flows 1. This process is essential for streaming and reactive data processing, allowing for seamless data transformation and logic expression.
A transformer is something that transforms one kind of data into another kind of data. In our case, it's a box which transforms tables into other tables.
---
Kosowski highlights the flexibility of these operations, which can iterate transformations until convergence, making them adaptable for various data engineering tasks 2.
Energy Efficiency
Energy efficiency in machine learning is becoming increasingly important, especially in distributed systems. Kosowski emphasizes that these systems optimize energy by performing local computations, reducing the need for extensive communication and computation 3. This approach mirrors natural systems, like ants, which optimize energy use by relying on external storage, such as pheromones, to minimize internal processing.
The incremental way of computation, and also streaming computations, are more energy efficient.
---
Kosowski suggests that lightweight models with the ability to add or delete data points offer an edge in energy efficiency, presenting a rich area for further research 3.
Related Episodes


671: Cloud Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions

661: Designing Machine Learning Systems — with Chip Huyen
Answers 383 questions

649: Introduction to Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions

SDS 435: Scaling Up Machine Learning — with Erica Greene
Answers 383 questions

632: Liquid Neural Networks — with Adrian Kosowski
Answers 383 questions

826: In Case You Missed It in September 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

SDS 571: Collaborative, No-Code Machine Learning — with Tim Kraska
Answers 383 questions

657: How to Learn Data Engineering — with Andreas Kretz (@andreaskayy)
Answers 383 questions

SDS 599: MLOps: Machine Learning Operations — with @Miki_ML
Answers 383 questions

699: The Modern Data Stack — with Harry Glaser
Answers 383 questions

645: Machine Learning for Video Games — with Carly Taylor
Answers 383 questions

819: PyTorch: From Zero to Hero — with Luka Anicin
Answers 383 questions

SDS 605: Upskilling in Data Science and Machine Learning — with Kian Katanforoosh
Answers 383 questions














