University of California, Santa Cruz Develops Open Source Multimodal Model MiniGPT-5


European AI company Mistral AI released the new open-source coding model family Devstral2, including a 123B parameter flagship version and a 24B lightweight version, along with a complementary command-line tool Mistral Vibe CLI that supports automated programming. The model achieved 72.2 points on the SWE-bench benchmark, approaching the performance of top closed-source models, and the API is currently freely available, offering strong support for developers.
ByteDance's Seed team launched Seedream4.0, a next-gen image generation model with enhanced multimodal capabilities, supporting text-to-image, image-to-image, and multi-image editing for flexible creative experiences.....
Step-Audio2mini, an open-source audio model by Step Fun, achieves SOTA in benchmarks. It excels in speech understanding, translation, and emotion analysis with unified audio inference and generation.....
The Nanometer AI Super Search Intelligent Entity under the 360 Company has undergone a major update, adding new features such as multimodal content generation, cross-domain professional search, and smarter task preview functions. From one-click generation of PPTs, PDF reports to automatically integrating videos, voiceover scripts, and storyboard planning, Nano AI redefines the boundary of AI search and creation with more efficient and intuitive experiences. AIbase comprehensively organizes the latest social media dynamics to help you deeply understand the latest breakthroughs of Nano AI. Multimodal Generation: Handle everything from PPTs to videos with one click.
Recently, Tencent's Hunyuan Video Model has officially begun recruiting testing partners on Platform X, marking a critical testing stage for this cutting-edge AI video generation technology. According to official sources, there is a high probability that the model will be open-sourced after the testing concludes, contributing its technological achievements to the global AI community. The Hunyuan Video Model is an important innovation by Tencent in the field of AI video generation, boasting over 13B parameters, making it one of the largest video generation models among open-source models.