PyCon.DE 2017 Alexander Bauer - Large-scale machine learning pipelines using Luigi,...n

Large-scale machine learning pipelines using Luigi, PySpark and scikit-learn Alexander Bauer Alexander Bauer holds a Ph.D. in computer science. He has around 10 years industry experience, currently leading a team of data scientists at Lidl, one of the largest global discount supermarket chains. He is a Kaggle Master and regular speaker at the Frankfurt Predictive Analytics Meetup. He believes in agile software development practices and promotes Python as a primary language for data science applications in production. Abstract Tags: data-science analytics python machine learning For prescriptive analytics applications, data science teams need to design, build and maintain complex machine learning pipelines. In this talk, we demonstrate how such pipelines can be implemented in a robust, scalable and extensible manner using Python, Luigi, PySpark and scikit-learn. Description Data science teams working on real-world prescriptive analytics applications face the challenge to design, build and maintain considerably complex machine learning pipelines on a daily basis. Such pipelines include parsing data from multiple data sources, extracting relevant predictive features, executing training, validation, prediction steps and finally optimizing actions to meet desired business outcome so that they can be shared and visualized to business users. In this talk, we demonstrate how such pipelines can be implemented end-to- end in a robust, scalable and extensible manner using Python, Luigi, PySpark and scikit-learn. We will share our lessons learned from using this framework in a real-world demand forecasting use case. Recorded at PyCon.DE 2017 Karlsruhe: pycon.de Video editing: Sebastian Neubauer & Andrei Dan Tools: Blender, Avidemux & Sonic Pi

PyCon.DE 2017 Nils Braun - Time series feature extraction with tsfresh - “get rich or die..
▶︎

PyCon.DE 2017 Nils Braun - Time series feature extraction with tsfresh - “get rich or die..

Peter Owlett - Lessons from 6 months of using Luigi in production
▶︎

Peter Owlett - Lessons from 6 months of using Luigi in production

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains
▶︎

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

Dylan Barth, Stuart Coleman: A beginner's guide to building data pipelines with Luigi
▶︎

Dylan Barth, Stuart Coleman: A beginner's guide to building data pipelines with Luigi

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

The French Do Not Care About Work
▶︎

The French Do Not Care About Work

The FULL VIDEO of Trump they didn’t want released
▶︎

The FULL VIDEO of Trump they didn’t want released

Yann LeCun: World Models: Enabling the next AI revolution
▶︎

Yann LeCun: World Models: Enabling the next AI revolution

"Dynamic Data Pipelining with Luigi" - Trey Hakanson (Pyohio 2019)
▶︎

"Dynamic Data Pipelining with Luigi" - Trey Hakanson (Pyohio 2019)

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service
▶︎

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

AlphaFold - The Most Useful Thing AI Has Ever Done
▶︎

AlphaFold - The Most Useful Thing AI Has Ever Done

What to teach when AI writes the code | Rainer Stropek | TEDxLinz
▶︎

What to teach when AI writes the code | Rainer Stropek | TEDxLinz

Semiconductors explained in 16 mins | Chris Miller
▶︎

Semiconductors explained in 16 mins | Chris Miller

Build Large-Scale Data Analytics and AI Pipeline Using RayDP
▶︎

Build Large-Scale Data Analytics and AI Pipeline Using RayDP

Think Fast, Talk Smart: Communication Techniques
▶︎

Think Fast, Talk Smart: Communication Techniques

6 Tips on Being a Successful Entrepreneur | John Mullins | TED
▶︎

6 Tips on Being a Successful Entrepreneur | John Mullins | TED

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech
▶︎

Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech

Conquering the Queue: Lessons from processing one billion Celery tasks
▶︎

Conquering the Queue: Lessons from processing one billion Celery tasks