Turning 2D into depth images

Most cameras just record colour but now the 3D shapes of objects, captured through only a single lens, can be accurately estimated using new software developed by UCL computer scientists. The method, published at CVPR 2017, gives state-of-the-art results and works with existing photos, allowing any camera to map the depth for every pixel it captures. The technology has a wide variety of applications, from augmented reality in computer games and apps, to robot interaction, and self-driving cars. Historical images and videos can also be analysed by the software, which is useful for reconstruction of incidents or to automatically convert 2D films into immersive 3D. Inferring object-range from a simple image by using real-time software has a whole host of potential uses. Depth mapping is critical for self-driving cars to avoid collisions, for example. Currently, car manufacturers use a combination of laser-scanners and/or radar sensors, which have limitations. They all use cameras too, but the individual cameras couldn’t provide meaningful depth information. The new software was developed using machine learning methods and has been trained and tested in outdoor and urban environments. It successfully estimates depths for thin structures such as street signs and poles, as well as people and cars, and quickly predicts a dense depth map for each 512 x 256 pixel image, running at over 25 frames per second. Currently, depth mapping systems rely on bulky binocular stereo rigs or a single camera paired with a laser or light-pattern projector that don’t work well outdoors because objects move too fast and sunlight dwarfs the projected patterns. There are other machine-learning based systems also seeking to get depth from single photographs, but those are trained in different ways, with some needing elusive high-quality depth information. The new technology doesn’t need real-life depth datasets, and outperforms all the other systems. Once trained, it runs in the field by processing one normal single-lens photo after another. Understanding the shape of a scene from a single image is a fundamental problem. A 360 degree depth map would be fantastically useful – it could drive wearable tech to assist disabled people with navigation, or to map real-life locations for virtual reality gaming, for example. At the moment, the software requires a desktop computer to process individual images, but they plan on miniaturising it, so it can be run on hand-held devices such as phones and tablets, making it more accessible to app developers. It has also optimised only for outdoor use, so the next step is to train it on indoor environments. The team has patented the technology for commercial use through UCL Business, but has made the code free for academic use. Funding for the research was kindly provided by the Engineering and Physical Sciences Research Council.

DeMoN: Depth and Motion Network for Learning Monocular Stereo

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Stereo 3D Vision (How to avoid being dinner for Wolves) - Computerphile

Stereo 3D Vision (How to avoid being dinner for Wolves) - Computerphile

Depth Images

Depth Images

CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction

CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction

VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - SIGGRAPH2017

VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - SIGGRAPH2017

432Hz - Fall Into Deep Sleep in 3 Minutes, Heal All Damage In The Body and Spirit, Relieve Stress #2

432Hz - Fall Into Deep Sleep in 3 Minutes, Heal All Damage In The Body and Spirit, Relieve Stress #2

Did AI Just Kill VFX?

Did AI Just Kill VFX?

Estimating Depth Map With Machine Learning

Estimating Depth Map With Machine Learning

How computers learn to recognize objects instantly | Joseph Redmon

How computers learn to recognize objects instantly | Joseph Redmon

EGGN 512 - Lecture 23-1 Epipolar and Essential

EGGN 512 - Lecture 23-1 Epipolar and Essential

Flow State Music | No Lyrics Creative Flow Music - Ultimate Work Flow Music For Focus Mode

Flow State Music | No Lyrics Creative Flow Music - Ultimate Work Flow Music For Focus Mode

How are holograms possible?

How are holograms possible?

Purple Particles and Textures Background video | Footage | Screensaver

Purple Particles and Textures Background video | Footage | Screensaver

Unsupervised Monocular Depth Estimation With Left-Right Consistency

Unsupervised Monocular Depth Estimation With Left-Right Consistency

Stop Prompting Claude. Use Karpathy's Method Instead.

Stop Prompting Claude. Use Karpathy's Method Instead.

الرقية الشرعية للشفاءمن السحروالعين والحسد حصن من الشيطان رقية البيت والاولاد بصوت القارئ سعيد حمدان

الرقية الشرعية للشفاءمن السحروالعين والحسد حصن من الشيطان رقية البيت والاولاد بصوت القارئ سعيد حمدان

Distance to objects using single vision camera.

Distance to objects using single vision camera.

Depth Maps and 6DoF from Stereo 360 Images

Depth Maps and 6DoF from Stereo 360 Images

Peaceful Aquarium 8K – Mesmerizing Marine Life & Calming Ocean Ambience

Peaceful Aquarium 8K – Mesmerizing Marine Life & Calming Ocean Ambience

Consistent Video Depth Estimation

Consistent Video Depth Estimation