Apple and Columbia University Collaborate to Develop AI System SceneScout to Assist Blind People with Street View Navigation

AIbase基地

Published in AI News · 4 minute read · Jul 8, 2025

Recently, Apple Inc. and a research team from Columbia University jointly developed an AI prototype system called SceneScout. This system is designed to provide street view navigation assistance for the blind and low vision (BLV) community, helping them with daily travel more effectively.

The SceneScout system combines Apple Maps API with a multimodal large language model (based on the GPT-4o core) to generate personalized environmental descriptions. This innovative technology allows users to receive more intuitive and specific navigation information, thereby enhancing their travel experience. The related research paper has been published on the preprint platform arXiv, although it has not yet undergone peer review.

The core functions of the system include two main parts: first, Route Preview. Through this feature, users can predict the conditions of the road during their journey, such as the quality of sidewalks, characteristics of intersections, and the situation of nearby bus stops. This information is especially important for blind users, as it helps them understand their surroundings in advance when traveling.

Secondly, the Virtual Exploration feature. This function allows users to explore open scenes according to their needs. For example, users can ask the system "a quiet residential area near the park," and the system will provide corresponding directional guidance based on the user's request. SceneScout interprets visible content from the perspective of a pedestrian and generates structured text information, supporting short, medium, and long formats of output, which can be adapted to various screen readers, making it convenient for blind users to read.

In the testing phase, SceneScout recruited 10 visually impaired users for use, most of whom had backgrounds in the technology industry. Test results showed that 72% of the AI-generated descriptions were considered accurate. In virtual exploration mode, user feedback was very positive, with users stating that this feature could effectively replace traditional methods of information acquisition, bringing great convenience to their daily travel.

Key Points:
🌍 SceneScout, a system developed by Apple and Columbia University, provides street view navigation assistance for visually impaired users.
📊 This system combines Apple Maps API with a multimodal large language model to generate personalized environmental descriptions.
👥 Testing showed that 72% of AI-generated descriptions were accurate, and the virtual exploration feature received high praise from users.

DingTalk Launches New AI Spreadsheet Functionality, Introducing the 'Spreadsheet as Document' Feature

Recently, DingTalk officially launched the 'AI Spreadsheet' feature, marking the official start of a new application entry point for the AI era. In DingTalk AI Spreadsheet, AI technology has become an intrinsic capability, with each cell serving as an AI access point, creating intelligent workflows and providing enterprises and users with an unprecedented method of building business systems.

Xbox Executive's Suggestion to Use AI to Deal with Layoff Emotions Sparks Controversy

Microsoft announced the global layoff of 9,000 employees. Xbox executive Matt Turnbull suggested that laid-off employees use AI tools like ChatGPT to cope with their emotions, which sparked controversy. He shared AI prompt templates to help with career planning, but the suggestion was criticized as distasteful. Netizens believe that AI cannot replace the emotional trauma caused by layoffs. This round of layoffs affects 4% of Microsoft's employees, and the gaming department may be hit the hardest. The incident reflects broader societal discussions on employee mental health support and the boundaries of AI application in the context of the current trend of layoffs in tech companies.

Grok4 to be released: Musk confirms X platform live stream on Wednesday night

Elon Musk announced that xAI's new generation large model Grok4 will be released at 8 PM (11 PM Beijing Time on Thursday) this Wednesday, and the launch will be live-streamed on the X platform. Musk previously revealed that Grok has seen significant improvements, and this release will showcase xAI's latest breakthroughs in the AI field.

Apple and Columbia University Collaborate to Develop AI System SceneScout to Assist Blind People with Street View Navigation

Related AI News

DingTalk Launches New AI Spreadsheet Functionality, Introducing the 'Spreadsheet as Document' Feature

Baidu's Stock Rises, Intelligent Cloud Wins Double Champion in Large Model Market in the First Half of the Year

Microsoft Win11 is about to launch the AI Dynamic Wallpaper feature, preview code has appeared

Massive Transaction! CoreWeave Acquires Data Center Giant Core Scientific for $9 Billion

Xbox Executive's Suggestion to Use AI to Deal with Layoff Emotions Sparks Controversy

Grok4 to be released: Musk confirms X platform live stream on Wednesday night