NotebookLM Upgraded to Support Image Import, Whiteboard Notes Become Searchable Knowledge Base

Google announced that NotebookLM has added image data sources. After users upload blackboard notes, textbook scan pages, or street photos, the system automatically performs OCR and semantic analysis, allowing users to directly search for content in the images using natural language. This feature is now available for free across all platforms. Google stated that in the coming weeks, it will add local processing options to reduce the need to upload sensitive information to the cloud.

The new NotebookLM uses a multimodal model at its core, which can distinguish between handwritten and printed areas, extract table structures, and automatically associate them with existing text, audio, and video notes. Google demonstrated use cases: after taking a photo of classroom board notes, asking "How is the formula in the lower left corner derived?" the system immediately locates the formula and generates step-by-step explanations; after scanning page 127 of a textbook, you can directly query cell values; uploading a menu from a street coffee shop allows you to extract the price of lattes.

Google said that within 48 hours after the feature was launched, the number of images uploaded by educational accounts exceeded 500,000 pages, an increase of 340% compared to the previous period. The company plans to integrate an AR glasses real-time shooting interface into NotebookLM next year, enabling "ask anything you see." Currently, image processing uses existing free quotas, and it has not revealed whether a paid acceleration channel will be introduced.

Baidu PaddleOCR-VL Model Tops Global OCR Rankings, Continues to Lead Huggingface Trending List for Five Consecutive Days

On October 16, Baidu PaddlePaddle released the vision language model PaddleOCR-VL, achieving a score of 92.56 in the authoritative evaluation OmniDocBench V1.5 with 0.9B parameters, surpassing mainstream models such as DeepSeek-OCR and topping the global OCR rankings. As of October 21, the top three positions on the Huggingface trending list were all occupied by OCR models, with Baidu PaddlePaddle ranking first.

NotebookLM Upgraded to Support Image Import, Whiteboard Notes Become Searchable Knowledge Base

Related Recommendations

Google NotebookLM Launches Deep Research Feature and Adds Support for Multiple File Formats

NotebookLM Removes Style Restrictions! Users Can Generate Videos in the Style of 'The Simpsons' but Copyright Controversy Heats Up

Baidu PaddleOCR-VL Model Tops Global OCR Rankings, Continues to Lead Huggingface Trending List for Five Consecutive Days

NotebookLM integrates Nano Banana, which can be used for generating images for video content

Google NotebookLM Launches Anime-Style Video Feature: Nano Banana Can Instantly Generate Six Art Styles, Chinese Support Still Needs Optimization