Research from Renmin University: Caution Advised in Data Augmentation for Contrastive Learning


Recently, Tencent Technology (Shenzhen) Co., Ltd. published a patent regarding a training method and related equipment for large language models on the Tianyancha app. The patent is titled 'Training Method, Device, Computer Equipment, and Storage Medium for Large Language Models' and aims to enhance the learning capacity and accuracy of large language models through innovative training methods. In the training process of large language models, traditional methods often rely on a single text summary, which may lead to model overfitting and negatively impact the accuracy and diversity of generated content. However, Tencent's new...
In today's technology landscape, CLIP (Contrastive Language-Image Pre-training) is an important multimodal foundational model. It combines visual signals and text signals into a shared feature space using contrastive learning loss on a large-scale dataset of image-text pairs. As a retriever, CLIP supports various tasks such as zero-shot classification, detection, segmentation, and image-text retrieval. Meanwhile, as a feature extractor, it performs well in nearly all...
EasyRec is a recommendation system based on language models, developed by a team from the University of Hong Kong. Its uniqueness lies in analyzing emotional and detailed user behavior stories through a text behavior alignment framework to predict user preferences without requiring large amounts of user data. The system combines contrastive learning and collaborative language models, enabling accurate predictions of preferences for new users and new products, particularly excelling in zero-shot recommendation scenarios. EasyRec's plug-and-play features make it easy to integrate into existing recommendation systems, enhancing performance. The paper showcases EasyRec's performance across multiple...
OpenAI completed a $122 billion financing round, valuing the company at $852 billion, setting a new record for the company's fundraising. The funds will be used for AI chip development, data center construction, and talent recruitment, paving the way for its IPO this year. SoftBank, Andreessen Horowitz, among others, led the round, with tech giants such as Amazon, NVIDIA, and Microsoft participating.
Recently, the source code of the AI company Anthropic's Claude Code tool was accidentally leaked. Developer Chaofan Shou posted a compressed package containing over 500,000 lines of TypeScript code on Twitter, which quickly attracted global attention, with views exceeding 5.3 million within a few hours. This leak incident exposed potential vulnerabilities in Anthropic's code management, becoming a hot topic in the tech circle.