Zhao & Qiao - Graph Neural Networks for Real World Fraud Detection | PyData Amsterdam 2023
Fraud is a major problem for financial services companies. As fraudsters change tactics, our detection methods need to get smarter. Graph neural networks (GNNs) are a promising model to improve detection performance. Unlike traditional machine learning models or rule-based engines, GNNs can effectively learn from subtle relationships by aggregating neighborhood information in the financial transaction networks. However, it remains a challenge to adopt this new approach in production. The goal of this talk is to share best practices for building a production ready GNN solution and hopefully spark your interest to apply GNNs to your own use cases. In this talk, we focus on suspicious account detection for online marketplaces. These platforms allow users to set up shops and sell products with little friction. Unfortunately, this attracts fraudsters who abuse these platforms. We use GNNs to do supervised learning based on accounts previously flagged as fraudulent, so that we can learn from both account properties and the relationship between accounts. However, productionizing GNNs is a big challenge. Addressing this challenge purely using open source packages is the main focus of this talk. We first give an overview of GNN-based fraud detection. Then we deep dive into utilizing PySpark and GraphFrames to build a transaction graph in a scalable way and convert it to DGL (Deep Graph Library) format. Next we share our experiences of setting up training and inference graphs in different time intervals, and deploying the end-to-end model pipeline in Airflow. Attendees are required to have a basic understanding of machine learning. In this informative talk, they will gain insights into fraud detection's challenges and learn best practices to productionize GNNs. Bios: Feng Zhao Feng is a senior data scientist at Adyen. He is passionate about solving real business problems using innovative AI/machine learning approaches. He received his Ph.D. from the National University of Singapore. Tingting Qiao Senior data scientist in Adyen, working in the Score team focusing on fraud detection. Having PhD background in computer vision and natural language processing using deep neural networks. Familiar with prediction models, such as regression, classification models, etc., as well as the latest research techniques, such as adversarial learning, neural networks etc. Several years of experience with popular deep learning frameworks. === www.pydata.org PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome! 00:10 Help us add time stamps or captions to this video! See the description for details. Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...

Tables as Code: The Journey from Ad-hoc Scripts to Maintainable ETL Workflows at Booking.com

An Introduction to Graph Neural Networks: Models and Applications

Gábor Szárnyas - DuckDB: The Power of a Data Warehouse in your Python Process

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Something is jamming GPS over Europe. Here's what we found

Ilya Sutskever – We're moving from the age of scaling to the age of research

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

Mini Lecture: Social Network Analysis for Fraud Detection

But what is a neural network? | Deep learning chapter 1

Deep Dive into LLMs like ChatGPT

Yann LeCun: World Models: Enabling the next AI revolution

The Story of Python and how it took over the world | Python: The Documentary

Preventing Abuse Using Unsupervised Learning | Fighting Abuse @Scale 2019

Building a Fraud Detection Platform using AI and Big Data

Gradient descent, how neural networks learn | Deep Learning Chapter 2

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs

Graph Transformers: What every data scientist should know, from Stanford, NVIDIA, and Kumo

AWS ML Summit 2021 | Deep Graph Library: Deep Graph learning at scale

Machine Learning for Everybody – Full Course

