Research from Renmin University: Caution Advised in Data Augmentation for Contrastive Learning

The latest research from Renmin University warns that caution is needed in data augmentation for contrastive learning. Strongly aligned positive samples may harm the generalization ability of contrastive learning. While stronger data augmentation can enhance downstream task performance, alignment performance decreases. The study reveals the mechanisms by which data augmentation affects contrastive learning and proposes seeking better data augmentation strategies from an information-theoretic and spectral perspective.

Tencent Releases New Patent on Training Large Language Models to Enhance Generalization and Accuracy

Recently, Tencent Technology (Shenzhen) Co., Ltd. published a patent regarding a training method and related equipment for large language models on the Tianyancha app. The patent is titled 'Training Method, Device, Computer Equipment, and Storage Medium for Large Language Models' and aims to enhance the learning capacity and accuracy of large language models through innovative training methods. In the training process of large language models, traditional methods often rely on a single text summary, which may lead to model overfitting and negatively impact the accuracy and diversity of generated content. However, Tencent's new...

Microsoft Launches LLM2CLIP: New AI Technology Supports Image Understanding with Language Models

In today's technology landscape, CLIP (Contrastive Language-Image Pre-training) is an important multimodal foundational model. It combines visual signals and text signals into a shared feature space using contrastive learning loss on a large-scale dataset of image-text pairs. As a retriever, CLIP supports various tasks such as zero-shot classification, detection, segmentation, and image-text retrieval. Meanwhile, as a feature extractor, it performs well in nearly all...

Small yet Beautiful! HKU's Latest Recommendation System EasyRec Insights User Voices through Text

EasyRec is a recommendation system based on language models, developed by a team from the University of Hong Kong. Its uniqueness lies in analyzing emotional and detailed user behavior stories through a text behavior alignment framework to predict user preferences without requiring large amounts of user data. The system combines contrastive learning and collaborative language models, enabling accurate predictions of preferences for new users and new products, particularly excelling in zero-shot recommendation scenarios. EasyRec's plug-and-play features make it easy to integrate into existing recommendation systems, enhancing performance. The paper showcases EasyRec's performance across multiple...

New Breakthrough in Embodied Intelligence: Ant Group Open Sources LingBot-Vision, Enabling Robots to Have a Sense of Space

Ant Group's Robbyant opensources the LingBot-Vision model family, which achieves outstanding performance in dense space perception tasks through self-supervised vision Transformers and innovative boundary modeling. It surpasses large models with several times more parameters in multiple metrics, breaking the limitations of existing visual foundation models that focus heavily on object recognition, making precise perception of physical space by robots a reality.

United States Approves GPT-5.6, OpenAI Launches Several Major Models This Week

The U.S. Commerce Department has lifted restrictions on the release of OpenAI's GPT-5.6 model, approving its wide public deployment. OpenAI will launch GPT-5.6 Sol this Thursday, alongside two new models, Terra and Luna. The model was previously under temporary national security controls, which have now ended.....