Microsoft’s computer vision app for the blind and poor-sighted, Seeing AI, just became more useful for those moments when you’re less interested in navigating the world than learning about what’s on your phone. The company has updated the iOS app with an option to explore photos by touching them. Tap your finger on an image and you’ll hear a description of both the objects in that scene as well as their spatial relationship. You can get descriptions for photos taken through Seeing AI’s Scene channel, but they’ll also be available for pictures in your camera roll as well as other apps (through options menus).
It’s powered by machine learning, of course, specifically object and scene recognition. All you need to do is take a photo or open one up in the viewer and tap anywhere on it.
“This new feature enables users to tap their finger to an image on a touch-screen to hear a description of objects within an image and the spatial relationship between them,” wrote Seeing AI lead Saqib Shaikh in a blog post. “The app can even describe the physical appearance of people and predict their mood.”
Because there’s facial recognition built in as well, you could very well take a picture of your friends and hear who’s doing what and where, and whether there’s a dog in the picture (important) and so on. This was possible on an image-wide scale already, as you can see in this image:
But the app now lets users tap around to find where objects are — obviously important to understanding the picture or recognizing it from before. Other details that may not have made it into the overall description may also appear on closer inspection, such as flowers in the foreground or a movie poster in the background.
In addition to this, the app now natively supports the iPad, which is certainly going to be nice for the many people who use Apple’s tablets as their primary interface for media and interactions. Lastly, there are a few improvements to the interface so users can order things in the app to their preference.