Research from Renmin University: Caution Advised in Data Augmentation for Contrastive Learning


Recently, Tencent Technology (Shenzhen) Co., Ltd. published a patent regarding a training method and related equipment for large language models on the Tianyancha app. The patent is titled 'Training Method, Device, Computer Equipment, and Storage Medium for Large Language Models' and aims to enhance the learning capacity and accuracy of large language models through innovative training methods. In the training process of large language models, traditional methods often rely on a single text summary, which may lead to model overfitting and negatively impact the accuracy and diversity of generated content. However, Tencent's new...
In today's technology landscape, CLIP (Contrastive Language-Image Pre-training) is an important multimodal foundational model. It combines visual signals and text signals into a shared feature space using contrastive learning loss on a large-scale dataset of image-text pairs. As a retriever, CLIP supports various tasks such as zero-shot classification, detection, segmentation, and image-text retrieval. Meanwhile, as a feature extractor, it performs well in nearly all...
EasyRec is a recommendation system based on language models, developed by a team from the University of Hong Kong. Its uniqueness lies in analyzing emotional and detailed user behavior stories through a text behavior alignment framework to predict user preferences without requiring large amounts of user data. The system combines contrastive learning and collaborative language models, enabling accurate predictions of preferences for new users and new products, particularly excelling in zero-shot recommendation scenarios. EasyRec's plug-and-play features make it easy to integrate into existing recommendation systems, enhancing performance. The paper showcases EasyRec's performance across multiple...
xAI recently released the early version of the Agentic command line tool "Grok Build", designed specifically for developers to simplify coding, building applications, and automating workflows. It is currently available only to SuperGrok Heavy subscribers and can be accessed via x.ai/cli. The tool is positioned as a smart development assistant, offering more advanced features than traditional command lines.
Qwen APP has reached a strategic cooperation with the National Medical Products Administration Information Center, fully integrating millions of national-level authoritative data on medicines, cosmetics, and medical devices. This move aims to address the 'hallucination' issue in AI health consultations by real-time verification against authoritative databases, enhancing the accuracy of information and providing precise medication guidance and ingredient analysis for tens of millions of users, marking a crucial step in the compliance and professional development of domestic large models in vertical fields.