Accelerating the ML Lifecycle with an Enterprise-Grade Feature Store
Productionizing real-time ML models poses unique data engineering challenges for enterprises that are coming from batch-oriented analytics. Enterprise data, which has traditionally been centralized in data warehouses and optimized for BI use cases, must now be transformed into features that provide meaningful predictive signals to our ML models. Enterprises face the operational challenges of deploying these features in production: building the data pipelines, then processing and serving the features to support production models. ML data engineering is a complex and brittle process that can consume upwards of 80% of our data science efforts, all too often grinding ML innovation to a crawl. Based on our experience building the Uber Michelangelo platform, and currently building next-generation ML infrastructure for Tecton.ai, we’ll share insights on building a feature platform that empowers data scientists to accelerate the delivery of ML applications. Spark and DataBricks provide a powerful and massively scalable foundation for data engineering. Building on this foundation, a feature platform extends your data infrastructure to support ML-specific requirements. It enables ML teams to track and share features with a version-control repository, process and curate feature values to have a single source of centralized data, and instantly serve features for model training, batch, and real-time predictions. Atlassian will join us to provide first-hand perspective from an enterprise who has successfully deployed a feature platform in production. The platform powers real-time, ML-driven personalization and search services for a popular SaaS application. Connect with us: Website: https://databricks.com Facebook: / databricksinc Twitter: / databricks LinkedIn: / databricks Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-nam...

Rethinking Feature Stores

Michelangelo: Uber's machine learning platform - Achal Shah

Enable Production ML with Databricks Feature Store

Building a Real-Time Feature Store at iFood

How Netflix Built Their ML Infrastructure

MLOps on Databricks: A How-To Guide

Michelangelo - Machine Learning @Uber

ML System Design: Feature Store

MLflow: An Open Platform to Simplify the Machine Learning Lifecycle

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

What is Feature Store in Machine Learning | #Mlopstutorial #featurestore #machinelearning

Ultimate Guide to AI Infrastructure | Community Webinar

Introducing MLflow for End-to-End Machine Learning on Databricks

Something is jamming GPS over Europe. Here's what we found

AWS re:Invent 2020: Building end-to-end ML workflows with Kubeflow Pipelines

AWS re:Invent 2020: Amazon SageMaker Feature Store: Store, discover, & share features for ML apps

Seamless MLOps with Seldon and MLflow

Feature Store for Machine Learning: The Beginner's Guide (+Feast)

The Feature Store - Jim Dowling

