Designing data pipelines for analytics and machine learning in industrial settings
Machine learning has made it possible for technologists to do amazing things with data. Its arrival coincides with the evolution of networked manufacturing systems driven by IoT. In this presentation we’ll examine the rise of IoT and ML from a practitioners perspective to better understand how applications of AI can be built in industrial settings. We'll walk through a case study that combines multiple IoT and ML technologies to monitor and optimize an industrial heating and cooling HVAC system. Through this instructive example you'll see how the following components can be put into action: 1. A StreamSets data pipeline that sources from MQTT and persists to OpenTSDB 2. A TensorFlow model that predicts anomalies in streaming sensor data 3. A Spark application that derives new event streams for real-time alerts 4. A Grafana dashboard that displays factory sensors and alerts in an interactive view By walking through this solution step-by-step, you'll learn how to build the fundamental capabilities needed in order to handle endless streams of IoT data and derive ML insights from that data: 1. How to transport IoT data through scalable publish/subscribe event streams 2. How to process data streams with transformations and filters 3. How to persist data streams with the timeliness required for interactive dashboards 4. How to collect labeled datasets for training machine learning models At the end of this presentation you will have learned how a variety of tools can be used together to build ML enhanced applications and data products for instrumented manufacturing systems. Speakers IAN DOWNARD Sr. Developer Evangelist MapR WILLIAM OCHANDARENA Senior Director of Product Management MapR

Foundations of streaming SQL: stream & table theory

AWS Summit ANZ 2022 - End-to-end MLOps for architects (ARCH3)

How to Build Data Pipelines for ML Projects (w/ Python Code)

The rise of big data governance: insight on this emerging trend from active open source initiatives

Programable Logic Controller Basics Explained - automation engineering

Should You Still Become a Software Engineer in 2026? GitHub VP

How to Create a High-Performing IoT Data Pipeline: Best Practices

But what is a neural network? | Deep learning chapter 1

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

Introduction to Jupyter Lab for Python

OpenCV Course - Full Tutorial with Python

The World's Most Important Machine

Integrating Apache Phoenix with Distributed Query Engines

What is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipeline (2025)

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Streaming Machine Learning with Apache Kafka and TensorFlow

Data Pipelines Explained

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)

NestJS Full Course for Beginners in 2026 | Build a Production-Ready API

