Why Choose One: Multi-Engine Analytics with Apache Wayang #bbuzz

More: https://2026.berlinbuzzwords.de/sessi... Speaker: Zoi Kaoudi, Haralampos Gavriilidis Choosing the best engine for each data task sounds right, but in modern data stacks doing so requires expertise and effort. Apache Wayang, a recently graduated TLP, addresses this by decoupling logical dataflows from execution engines. From big data platforms to SQL and ML engines, Wayang enables cross-platform execution that maximizes performance. Modern analytics pipelines frequently span databases, big data engines, and machine learning frameworks. Connecting these systems manually leads to complex orchestration, high data movement cost, and platform-specific rewrites. This challenge also appears in agent driven workflows where different steps of a task naturally map to different engines. Apache Wayang is a recently graduated Apache Top Level Project that provides a unified data analytics framework for cross-platform execution. Pipelines are expressed with platform independent operators using Java, Scala, Python, or SQL APIs. A cross-platform optimizer then maps operators to execution backends such as Spark, Flink, JDBC databases, and ML systems, and produces execution plans that may span multiple engines. It models operator and data movement cost and supports runtime re optimization when estimates are wrong. In practice, this lets developers write a pipeline once and run it efficiently across multiple engines without hard-wiring platform choices. The talk is technical and system focused, aimed at practitioners working with heterogeneous data stacks. It has three parts: 1. Motivation (10-15min) Why single engine execution is often not enough. Concrete ETL, ML, and agent-based workflows that require multiple systems and create optimization and integration challenges. 2. System architecture and optimizer (20-25min) Wayang’s platform agnostic plans, operator mappings, cross-platform data movement handling, and stage-based execution model. How the cost-based optimizer inflates plans, evaluates alternatives, and selects mixed engine execution strategies. Brief coverage of SQL, ML, and multi-language UDF support. 3. Project history, status, and next steps (5-10min) From multi year cross-platform analytics research to Apache and recent Top Level Project graduation. Extensibility for new platforms and current work on improved cost models and optimizer enhancements. Attendees will gain a practical understanding of how cross-platform analytics can be executed efficiently and how to design pipelines that are not locked to a single processing engine. ### Follow us on Social Media and join the Community! Mastodon: https://floss.social/@berlinbuzzwords LinkedIn:   / berlin-buzzwords   Website: https://berlinbuzzwords.de Mail: [email protected] Berlin Buzzwords is an event by Plain Schwarz – https://plainschwarz.com

Shailesh Kumar Singh – Constant-Time Aggregations with Star-Tree in OpenSearch #bbuzz
▶︎

Shailesh Kumar Singh – Constant-Time Aggregations with Star-Tree in OpenSearch #bbuzz

Jarek Potiuk – Empowering OSS maintainers in the age of AI #bbuzz
▶︎

Jarek Potiuk – Empowering OSS maintainers in the age of AI #bbuzz

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Varant Zanoyan – Real-Time ML Pipelines: Feature Chaining with Chronon #bbuzz
▶︎

Varant Zanoyan – Real-Time ML Pipelines: Feature Chaining with Chronon #bbuzz

Blue Ridge Ruby 2026 - LLM Telemetry as a First-Class Rails Concern by  David Paluy
▶︎

Blue Ridge Ruby 2026 - LLM Telemetry as a First-Class Rails Concern by David Paluy

Why Aliens Would NEVER Invade Africa
▶︎

Why Aliens Would NEVER Invade Africa

The NoSQL Lie That Keeps Developers Overbuilding
▶︎

The NoSQL Lie That Keeps Developers Overbuilding

Hartmut Armbruster – What If We've Been Scaling Stream Processing Wrong All Along #bbuzz
▶︎

Hartmut Armbruster – What If We've Been Scaling Stream Processing Wrong All Along #bbuzz

10 Images | Coastal Citrus Floral Summer Paintings Screensaver l Frame TV ART |
▶︎

10 Images | Coastal Citrus Floral Summer Paintings Screensaver l Frame TV ART |

Gülçin Yıldırım Jelinek – What you should know about constraints in PostgreSQL 18 #bbuzz
▶︎

Gülçin Yıldırım Jelinek – What you should know about constraints in PostgreSQL 18 #bbuzz

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

FIFA World Cup Uncut | 8 Minutes of Unforgettable Madness | Brazil vs Germany (2014 Semi-Final)
▶︎

FIFA World Cup Uncut | 8 Minutes of Unforgettable Madness | Brazil vs Germany (2014 Semi-Final)

NYC's Joyous Knicks Victory Celebration vs. Trump's Joyless White House UFC Fight | The Daily Show
▶︎

NYC's Joyous Knicks Victory Celebration vs. Trump's Joyless White House UFC Fight | The Daily Show

How ASML Makes Chips Faster With Its New $400 Million High NA Machine
▶︎

How ASML Makes Chips Faster With Its New $400 Million High NA Machine

Ex-Google Recruiter Explains Why "Lying" Gets You Hired
▶︎

Ex-Google Recruiter Explains Why "Lying" Gets You Hired

The FDE Playbook for AI Startups with Bob McGrew
▶︎

The FDE Playbook for AI Startups with Bob McGrew

Rahul Goswami – Zero downtime index upgrade in Apache Solr #bbuzz
▶︎

Rahul Goswami – Zero downtime index upgrade in Apache Solr #bbuzz

How I’m Preparing For The “Supercycle”
▶︎

How I’m Preparing For The “Supercycle”

Designing Data-intensive Applications with Martin Kleppmann
▶︎

Designing Data-intensive Applications with Martin Kleppmann

Vintage Painting | TV Frame Screensaver Art for TV Wallpaper | TV Artwork
▶︎

Vintage Painting | TV Frame Screensaver Art for TV Wallpaper | TV Artwork