Apache Flink

Apache Flink is distributed runtime that continuously reads data from sources like Kafka, transforms it using SQL or its API, and writes the results to destinations like Iceberg tables (stored as Parquet), databases, or other Kafka topics. It runs as a cluster to scale out to high volumes and to be resilient, and it correctly handles failure recovery to avoid duplicate writes.

Glenn's Digital Garden

Explorer

Apache Flink

Graph View