ZhiYuan Research Institute Releases 1 Billion Parameter General 3D Vision Model Uni3D

The Beijing Academy of Artificial Intelligence (BAAI) has recently open-sourced Uni3D, a 3D vision general model with 1 billion parameters. This model is capable of processing point cloud data, achieving comprehensive technical breakthroughs in mainstream 3D vision tasks, and demonstrating exceptional general vision capabilities. The development team of Uni3D drew on design experiences from 2D vision models and introduced a multi-modal alignment training method, allowing it to directly inherit knowledge learned from 2D vision models, thereby acquiring powerful 3D vision abilities. The model has achieved state-of-the-art results in various 3D vision tasks, showcasing its strong versatility and transfer learning capabilities. The BAAI states that the open-source release of Uni3D lays a solid foundation for future research and applications in 3D computer vision.

ZhiYuan Research Institute Releases Emu2: A New Generation Generative Multimodal Foundation Model

["ZhiYuan Research Institute has released the new generation multimodal foundation model Emu2, pushing the boundaries of multimodal contextual learning capabilities.", "Emu2 surpasses Flamingo-80B and IDEFICS-80B, demonstrating excellent performance in few-shot multimodal understanding tasks.", "Emu2 achieves optimal performance in multiple few-shot understanding, visual question answering, and image generation tasks.", "Emu2-Chat realizes accurate understanding of text-image instructions, while Emu2-Gen offers flexible, controllable, high-quality images."]

ZhiYuan Research Institute Releases Open-Source JudgeLM Evaluation Model to Assess Various Large Models and Provide Scores

ZhiYuan Research Institute has open-sourced the JudgeLM evaluation model, which can efficiently assess various large models and provide scores. Compared to GPT-4, JudgeLM's cost is only 1/120, with a consistency rate of over 90% for evaluation results. JudgeLM can be applied in various assessment scenarios including pure text and multimodal contexts, generating scores and justifying reasons. The consistency of JudgeLM with reference answers exceeds 90%, approaching human performance. ZhiYuan Research Institute has also released datasets for training and validation samples for in-depth research on large models.

ZhiYuan Research Institute Releases Open Source Bilingual Model Wudao・Tianying 34 Billion Aquila2-34B

ZhiYuan Research Institute has unveiled the new open-source bilingual model Wudao・Tianying 34 Billion Aquila2-34B, which excels in reasoning, generalization, and more. The institute has also released a comprehensive open-source toolkit to promote collaborative innovation in large model research. Aquila2-34B surpasses other open-source foundational models in overall capabilities, with the ZhiYuan team developing the NLPE method to enhance the model's extension capabilities.

ZhiYuan Releases the World's Largest Chinese-English Semantic Vector Model Training Dataset MTP

The ZhiYuan Research Institute has released the world's largest Chinese-English semantic vector model training dataset, MTP, with a data scale of 300 million pairs. MTP is the largest open-source dataset of Chinese-English related text pairs, providing an important foundation for training semantic vector models. The dataset includes Chinese-English text pairs from multiple sources, covering various types such as Q&A, comments, and news. The ZhiYuan Research Institute stated that this data plays a crucial role in training large models and will promote collaborative innovation in artificial intelligence. The release of this dataset is expected to address the shortage of training datasets for Chinese models.

ZhiYuan Research Institute Releases 1 Billion Parameter General 3D Vision Model Uni3D

Related Recommendations

ZhiYuan Research Institute Releases Emu2: A New Generation Generative Multimodal Foundation Model

ZhiYuan Research Institute Releases Open-Source JudgeLM Evaluation Model to Assess Various Large Models and Provide Scores

ZhiYuan Research Institute Releases Open Source Bilingual Model Wudao・Tianying 34 Billion Aquila2-34B

ZhiYuan Releases the World's Largest Chinese-English Semantic Vector Model Training Dataset MTP

A Selfie to Try On Your Whole Body! Google AI Virtual Fitting Feature Gets a Major Upgrade