Stop Rebuilding Your AI Pipelines: The Hidden 90% of Production AI Infrastructure - Maher Hanafi

Stop Rebuilding Your Pipelines: Scalable Data Architecture Patterns for Production AI Most AI teams focus on models. The teams that succeed in production focus on infrastructure. In this talk from the Data Infrastructure Summit 2026, Maher Hanafi (SVP of Engineering at Betterworks) shares lessons learned building and operating AI systems at scale—including the reliability failures, architectural challenges, and hidden infrastructure requirements that emerge after the first successful launch. The talk explores why production AI is far more than model selection and prompt engineering, and why the real work happens in the layers beneath the surface: reliability, evaluation, observability, security, retrieval, governance, and operational feedback loops. Topics covered: • Why Betterworks chose an inference-only, model-agnostic strategy • The correct optimization sequence for production AI systems • Reliability lessons from real-world deployments • Offline inference and pre-processing strategies • In-flight batching and GPU utilization optimization • Building Responsible AI guardrails without sacrificing latency • Prompt management as production infrastructure • Custom evaluation pipelines and quality gates • RBAC-aware retrieval in multi-tenant systems • Designing AI platforms that work across both monoliths and microservices • The hidden 90% of AI architecture that users never see If you're building AI products, GenAI platforms, RAG systems, agentic workflows, or enterprise AI infrastructure, this session provides practical architectural patterns that can help you avoid common reliability, security, and scaling pitfalls. Speaker: Maher Hanafi SVP of Engineering, Betterworks Event: Data Infrastructure Summit 2026 #AIEngineering #GenAI #MachineLearning #DataEngineering #MLOps #RAG #LLM #SoftwareArchitecture #PlatformEngineering #AIInfrastructure #ArtificialIntelligence #EnterpriseAI

Google & AWS Veteran: What Top Tier Software Architects Actually Do
▶︎

Google & AWS Veteran: What Top Tier Software Architects Actually Do

Generative AI Deep Dive: Advancing from Proof of Concept to Production by Maher Hanafi
▶︎

Generative AI Deep Dive: Advancing from Proof of Concept to Production by Maher Hanafi

Doubling the productivity of your engineering team using AI (Brian Scanlan)
▶︎

Doubling the productivity of your engineering team using AI (Brian Scanlan)

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Is the AI Boom About to COLLAPSE?
▶︎

Is the AI Boom About to COLLAPSE?

Surviving the AI Trust Gap: How Engineering Leaders Must Adapt to AI Code Generation - Maher Hanafi
▶︎

Surviving the AI Trust Gap: How Engineering Leaders Must Adapt to AI Code Generation - Maher Hanafi

The French Do Not Care About Work
▶︎

The French Do Not Care About Work

How SpaceX Humiliated Wall Street
▶︎

How SpaceX Humiliated Wall Street

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit
▶︎

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Anthopic, OpenAI Should Not Be Allowed to IPO, Says Ed Zitron
▶︎

Anthopic, OpenAI Should Not Be Allowed to IPO, Says Ed Zitron

How ASML Makes Chips Faster With Its New $400 Million High NA Machine
▶︎

How ASML Makes Chips Faster With Its New $400 Million High NA Machine

We Saw What AI Data Centers Don't Want You to See
▶︎

We Saw What AI Data Centers Don't Want You to See

Don't learn AI Agents without Learning these Fundamentals
▶︎

Don't learn AI Agents without Learning these Fundamentals

The most rational take on AI you’ll hear this year
▶︎

The most rational take on AI you’ll hear this year

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026
▶︎

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026

Real World Experiments in Reshaping Your Org, Interviews, & Infrastructure to Build AI Enabled Teams
▶︎

Real World Experiments in Reshaping Your Org, Interviews, & Infrastructure to Build AI Enabled Teams

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G
▶︎

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G

How We Cut LLM Latency By 70% With NVIDIA TensorRT-LLM. MLOps Community - Maher Hanafi, SVP of Eng
▶︎

How We Cut LLM Latency By 70% With NVIDIA TensorRT-LLM. MLOps Community - Maher Hanafi, SVP of Eng

Something is jamming GPS over Europe. Here's what we found
▶︎

Something is jamming GPS over Europe. Here's what we found