Apples newest AI study unlocks street view for blind users - 9to5Mac

There’s no shortage of rumors about Apple’s plans to release camera-equipped wearables.And while it’s easy to get fatigued by yet another wave of upcoming AI-powered hardware, one powerful use case often gets lost in the shuffle: accessibility.SceneScout, a new research prototype from Apple and Columbia University, isn’t a wearable.

Yet.But it hints at what AI could eventually unlock for blind and low-vision users.As Apple’s and Columbia University’s researchers explain it: To try to close this gap, the researchers present this project that combines Apple Maps APIs with a multimodal large language model to provide interactive, AI-generated descriptions of street view images.

Instead of just relying on turn-by-turn directions or landmarks, users can explore an entire route or virtually explore a neighborhood block by block, with street-level descriptions that are tailored to their specific needs and preferences.The system supports two main modes: Route Preview, which lets users get a sense of what they’ll encounter along a specific path.That means sidewalk quality, intersections, visual landmarks, what a bus stop looks like, etc.

Virtual Exploration, which is more open-ended.Users describe what they’re searching for (like a quiet residential area with access to parks), and the AI helps them navigate intersections and explore in any direction based on that intent.Behind the scenes, SceneScout grounds a GPT-4o-based agent within real-world map data and panoramic images from Apple Maps.

It simulates a pedestrian’s view, interprets what’s visible, and outputs structured text, broken into short, medium, or long descriptions.The web interface, designed with screen readers in mind, presents all of this in a fully accessible format.The first tests showed promise, but also important (and dangerous) shortcomings The research team ran a study with 10 blind or low vision users, most of whom were proficient with screen readers and worked in tech.

Participants used both Route Preview and Virtual Exploration, and gave the experience high marks for usefulness and relevance.The Virtual Exploration mode was especially praised, as many said it gave them access to information they would normally have to ask others about.Still, there were important shortcomings.

While about 72% of the generated descriptions were accurate, some included subtle hallucinations, like claiming a crosswalk had audio signals when it didn’t, or event mislabeling street signs.And while most of the information was stable over time, a few descriptions referenced outdated or transient details like construction zones or parked vehicles.Participants also pointed out that the system occasionally made assumptions, both about the user’s physical abilities, and about the environment itself.

Several users emphasized the need for more objective language, and better spatial precision, especially for last-meter navigation.Others wished the system could adapt more dynamically to their preferences over time, instead of relying on static keywords.SceneScout obviously isn’t a shipping product, and it explores the collaboration between a multimodal large language model and the Apple Maps API, rather than real-time, computer vision-based in-site world navigation.

But one could easily draw a line from one to the other.In fact, that is brought up towards the end of the study: As with other studies published on arXiv, SceneScout: Towards AI Agent-driven Access to Street View Imagery for Blind Users hasn’t been peer-reviewed.Still, it is absolutely worth your time if you’d like to know where AI, wearables, and computer vision are inevitably heading.

AirPods deals on Amazon AirPods Pro 2, USB-C Charging: 35% off at $159,99 AirPods (3rd Generation): $88.15 AirPods 4, USB-C and Wireless Charging: 17% off at $148.99 AirPods 4 USB-C Charging: 23% off at $99 AirPods Max, USB-C Charging, Midnight: 18% off at $449.99   You’re reading 9to5Mac — experts who break news about Apple and its surrounding ecosystem, day after day.Be sure to check out our homepage for all the latest news, and follow 9to5Mac on Twitter, Facebook, and LinkedIn to stay in the loop.Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel

Read More
Related Posts