WWDC25: Read documents using the Vision framework | Apple
Learn about the latest advancements in the Vision framework. We’ll introduce RecognizeDocumentsRequest, and how you can use it to read lines of text and group them into paragraphs, read tables, etc. And we’ll also dive into camera lens smudge detection, and how to identify potentially smudged images in photo libraries or your own camera capture pipeline. Explore related documentation, sample code, and more: Classifying Images with Vision and Core ML: https://developer.apple.com/documenta... Vision: https://developer.apple.com/documenta... Recognizing tables within a document: https://developer.apple.com/documenta... Image Classification with Vision and CoreML: https://developer.apple.com/sample-co... Discover Swift enhancements in the Vision framework: https://developer.apple.com/videos/pl... Detect animal poses in Vision: https://developer.apple.com/videos/pl... Explore 3D body pose and person segmentation in Vision: https://developer.apple.com/videos/pl... Discover machine learning & AI frameworks on Apple platforms: https://developer.apple.com/videos/pl... 00:00 - Introduction 01:22 - Reading documents 13:35 - Camera lens smudge detection 17:59 - Hand pose update More Apple Developer resources: Video sessions: https://apple.co/VideoSessions Documentation: https://apple.co/DeveloperDocs Forums: https://apple.co/DeveloperForums App: https://apple.co/DeveloperApp

WWDC25: Meet the Foundation Models framework | Apple

WWDC21: Extract document data using Vision | Apple

Inside Apple Intelligence and Xcode: Special Presentation | WWDC26

WWDC24: Discover Swift enhancements in the Vision framework | Apple

WWDC25: Deep dive into the Foundation Models framework | Apple

WWDC25: Bring advanced speech-to-text to your app with SpeechAnalyzer | Apple

WWDC26: Run local agentic AI on the Mac using MLX | Apple

WWDC25: Optimize SwiftUI performance with Instruments | Apple

WWDC25: Get started with MLX for Apple silicon | Apple

WWDC26: Bring an LLM provider to the Foundation Models framework | Apple

WWDC23: Explore 3D body pose and person segmentation in Vision | Apple

WWDC26: Create UI prototypes using agents in Xcode | Apple

WWDC25: Share visionOS experiences with nearby people | Apple

WWDC25: Code-along: Explore localization with Xcode | Apple

WWDC25: Code-along: Bring on-device AI to your app using the Foundation Models framework | Apple

WWDC25: Improve memory usage and performance with Swift | Apple

WWDC23: What’s new in VisionKit | Apple

Code-along: Start building with Swift and SwiftUI | Meet with Apple

WWDC25: Use structured concurrency with Network framework | Apple

