Bring Satellite and Drone Imagery into your Data Science Workflows

Bring Satellite and Drone Imagery into your Data Science Workflows Jason Brown A presentation from ApacheCon @Home 2020 https://apachecon.com/acah2020/ Overhead imagery from satellites and drones have entered the mainstream of how we explore, understand, and tell stories about our world. They are undeniable and arresting descriptions of cultural events, environmental disasters, economic shifts, and more. Data scientists recognize that their value goes far beyond anecdotal storytelling. It is unstructured data full of distinctive patterns in a high dimensional space. With machine learning, we can extract structured data from the vast set of imagery available. RasterFrames extends Apache Spark SQL with a strong Python API to enable processing of satellite, drone, and other spatial image data. This talk will discuss the fundamentals ideas to make sense of this imagery data. We will discuss how RasterFrames custom DataSource exploits convergent trends in how public and private providers publish images. Through deep Spark SQL integration, RasterFrames lets users consider imagery and other location-aware data sets in their existing data pipelines. RasterFrames builds on Apache licensed tech stack, fully supports Spark ML and interoperates smoothly with scikit-learn, TensorFlow, Keras, and PyTorch. To crystallize these ideas, we will discuss a practical data science case study using overhead imagery in PySpark. Jason is a Senior Data Scientist at Astraea, Inc. applying machine learning to Earth-observing data to provide actionable insights to clients' and partners' challenges. He brings a background in mathematical modeling and statistics together with an appreciation for data visualization, geography, and software development.

Massively Scalable Real-time Geospatial Anomaly Detection with Apache Kafka and Cassandra
▶︎

Massively Scalable Real-time Geospatial Anomaly Detection with Apache Kafka and Cassandra

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Exposing The Dark Side of America's AI Data Center Explosion | View From Above | Business Insider
▶︎

Exposing The Dark Side of America's AI Data Center Explosion | View From Above | Business Insider

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup
▶︎

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

Databricks Live Bootcamp | Day1: Introduction & Data Analytics
▶︎

Databricks Live Bootcamp | Day1: Introduction & Data Analytics

How the Electrical Grid Is Being Rebuilt for AI | Bloomberg Primer
▶︎

How the Electrical Grid Is Being Rebuilt for AI | Bloomberg Primer

The Truth About U.S. Drones: A Full Industry Breakdown
▶︎

The Truth About U.S. Drones: A Full Industry Breakdown

Something is jamming GPS over Europe. Here's what we found
▶︎

Something is jamming GPS over Europe. Here's what we found

The World's Most Important Machine
▶︎

The World's Most Important Machine

What Ukraine’s Drone-on-Drone Warfare Is Really Like | Crossfire | Daily Mail
▶︎

What Ukraine’s Drone-on-Drone Warfare Is Really Like | Crossfire | Daily Mail

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI
▶︎

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

How to Actually Build Mobile Apps with AI in 2026 | A Complete Beginner's Tutorial
▶︎

How to Actually Build Mobile Apps with AI in 2026 | A Complete Beginner's Tutorial

Full App Building Course with Cursor (3+ Hours)
▶︎

Full App Building Course with Cursor (3+ Hours)

How ASML Makes Chips Faster With Its New $400 Million High NA Machine
▶︎

How ASML Makes Chips Faster With Its New $400 Million High NA Machine

Complete GitHub Actions Course - From BEGINNER to PRO
▶︎

Complete GitHub Actions Course - From BEGINNER to PRO

Above the Cloud: Building Data Centers in Space - Richard Campbell - NDC Copenhagen 2026
▶︎

Above the Cloud: Building Data Centers in Space - Richard Campbell - NDC Copenhagen 2026

Yann LeCun: World Models: Enabling the next AI revolution
▶︎

Yann LeCun: World Models: Enabling the next AI revolution

System Design Explained: APIs, Databases, Caching, CDNs, Load Balancing & Production Infra
▶︎

System Design Explained: APIs, Databases, Caching, CDNs, Load Balancing & Production Infra

Python Project | Python Projects For Beginners | Python Project Tutorial | Intellipaat
▶︎

Python Project | Python Projects For Beginners | Python Project Tutorial | Intellipaat