Published Sep 3, 2019

SE-Radio Episode 346: Stephan Ewen on Streaming Architecture

Join Stephan Ewen as he explores the transformative architecture of Apache Flink, revealing its groundbreaking approach to unifying batch and stream processing while predicting the future of stream technologies. Discover the advantages and challenges of stream processing, including real-time data handling and resource efficiency.
Episode Highlights
Software Engineering Radio - the podcast for professional software developers logo

Popular Clips

Episode Highlights

  • Processing Comparison

    In the realm of data processing, the choice between batch and stream processing is pivotal. highlights that stream processing offers significant advantages by minimizing delays in data computation, allowing for real-time data processing. This immediacy is crucial as it ensures data remains fresh and valuable, which is often not the case with batch processing that introduces delays due to its periodic nature 1. Ewen explains that stream processing aligns more naturally with how data is produced, continuously and in an unbounded fashion, making it a more intuitive approach compared to the arbitrary boundaries imposed by batch processing 2.

    Stream processing really helps with immediately reacting to data when it's still very fresh, when it's very valuable.

    ---

    This natural alignment with data production is driving the growing interest in stream processing technologies.

       

    Historical Shift

    The evolution from batch to stream processing is rooted in technological advancements and changing data needs. Historically, batch processing was the norm due to the immaturity of stream processing technologies, which required data to be segmented into finite sets for analysis 2. notes that as stream processing technologies have matured, there is a shift back to treating data as continuous streams, which is a more natural representation of how data is produced 3.

    As stream processing technology continues to mature, we actually see that a lot of people are very interested in again, adopting this quite natural paradigm of processing a stream as a stream.

    ---

    This shift is driven by the need for real-time insights and the ability to handle increasing volumes of data efficiently.

Related Episodes