Red Teaming Large Language Models - Armin Buescher - NDC Security 2024

This talk was recorded at NDC Security in Oslo, Norway. #ndcsecurity #ndcconferences #security #ai #developer #softwaredeveloper Attend the next NDC conference near you: https://ndcconferences.com https://ndc-security.com/ Subscribe to our YouTube channel and learn every day: /‪@NDC‬ Follow our Social Media! / ndcconferences / ndc_conferences / ndc_conferences As machine learning models become increasingly integrated into our digital infrastructure, evaluating their vulnerabilities is essential for both security and ethical reasons. Large language models (LLMs) are no exception. While they represent a revolutionary leap in natural language tasks, LLMs pose unique security and ethical challenges, including the potential to generate misleading, harmful, or biased content as well as leak confidential data, denial of service, or even cause remote code execution. This talk provides an in-depth look into red-teaming LLMs as an evaluation methodology to expose these vulnerabilities. By focusing on case studies and practical examples, we will differentiate between structured red team exercises and isolated adversarial attacks, such as model jailbreaks. Attendees will gain insights into the types of vulnerabilities that red teaming can reveal in LLMs, as well as potential strategies for mitigating these risks. The session aims to equip professionals with the knowledge to better evaluate the security and ethical dimensions of deploying Large Language Models in their organizations.

Incidents and incident handling @ VG.no - Audun Ytterdal - NDC Security 2024

Incidents and incident handling @ VG.no - Audun Ytterdal - NDC Security 2024

Demystifying Process Address Space: Heap, Stack, and Beyond - Piotr Wierciński - NDC TechTown 2024

Demystifying Process Address Space: Heap, Stack, and Beyond - Piotr Wierciński - NDC TechTown 2024

How hacking works - Espen Sande-Larsen - NDC TechTown 2023

How hacking works - Espen Sande-Larsen - NDC TechTown 2023

HiddenLayer Webinar: A Guide to AI Red Teaming

HiddenLayer Webinar: A Guide to AI Red Teaming

The Past, Present, and Future of Cross-Site/Cross-Origin Request Forgery - Philippe de Ryck

The Past, Present, and Future of Cross-Site/Cross-Origin Request Forgery - Philippe de Ryck

OWASP's Top 10 Ways to Attack LLMs: AI Vulnerabilities Exposed

OWASP's Top 10 Ways to Attack LLMs: AI Vulnerabilities Exposed

Why Large Language Models Hallucinate

Why Large Language Models Hallucinate

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3

Unlocking The Secrets Of TLS - Scott Helme - NDC Security 2024

Unlocking The Secrets Of TLS - Scott Helme - NDC Security 2024

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

How to Train LLMs to "Think" (o1 & DeepSeek-R1)

How to Train LLMs to "Think" (o1 & DeepSeek-R1)

Gaspard Baye - Hacking GenAI with LLM Red Teaming and Beyond

Gaspard Baye - Hacking GenAI with LLM Red Teaming and Beyond

Implicit Conversions Considered Harmful - Jason Turner - NDC TechTown 2025

Implicit Conversions Considered Harmful - Jason Turner - NDC TechTown 2025

Red Teaming o1 Part 1/2–Automated Jailbreaking w/ Haize Labs' Leonard Tang, Aidan Ewart& Brian Huang

Red Teaming o1 Part 1/2–Automated Jailbreaking w/ Haize Labs' Leonard Tang, Aidan Ewart& Brian Huang

Linux user namespaces: a blessing and a curse - Ignat Korchagin - NDC TechTown 2024

Linux user namespaces: a blessing and a curse - Ignat Korchagin - NDC TechTown 2024

AI for Red Team & Malware Development

AI for Red Team & Malware Development

Memory Safety: Rust vs. C - Robert Seacord - NDC TechTown 2024

Memory Safety: Rust vs. C - Robert Seacord - NDC TechTown 2024

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Richie Lee - LLM Security 101 - An Introduction to AI Red Teaming | PyData Amsterdam 2024

Richie Lee - LLM Security 101 - An Introduction to AI Red Teaming | PyData Amsterdam 2024

Why Inference is hard..

Why Inference is hard..