Stability AI Releases Two Japanese Language Models

Stability AI Japan has recently released two Japanese language models, named "Japanese Stable LM3B-4E1T" and "Japanese Stable LM Gamma7B". Both models are based on English language models and have been pre-trained with extensive Japanese and English data, enhancing their capabilities in Japanese language processing. The former model has 3 billion parameters, while the latter has 7 billion parameters. The release of these models marks a significant breakthrough in the field of natural language processing in Japan, providing users with enhanced performance and portability. Performance evaluations show that despite having fewer parameters, the former model performs exceptionally well in multiple tasks, and the latter scores higher, demonstrating remarkable progress in Japanese natural language processing.

GPT-4 Level! VITA-1.5: Real-time Visual and Voice Interaction with 1.5 Seconds Interaction Delay

Recently, the VITA-MLLM team announced the launch of VITA-1.5, an upgraded version based on VITA-1.0, aimed at enhancing the real-time and accuracy of multimodal interaction. VITA-1.5 not only supports English and Chinese but also achieves significant improvements in multiple performance metrics, providing users with a smoother interaction experience. In VITA-1.5, the interaction delay has been greatly reduced from 4 seconds to just 1.5 seconds, making it almost imperceptible for users during voice interactions.

Is OpenAI Nervous After the Release of Doubao Video Model PixelDance? Major Upgrade for Sora Announced

Recently, OpenAI has caused a stir in the video AI field as they are set to make significant upgrades to Sora, which was released in February of this year. According to reports from the Information Daily, the main goal of this upgrade is to significantly enhance Sora's performance, enabling it to generate longer and higher quality video content more quickly. Reflecting on the initial performance of Sora, there were indeed many pressing issues that needed addressing. The original version took over 10 minutes to generate a video, an efficiency level that clearly fails to meet practical application needs.

AI Developers Rejoice! Google Gemini 1.5 Upgrade: Performance Soars, Prices Halved

Google today announced the launch of the newly upgraded Gemini model series, which includes Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002. This update not only significantly enhances performance but also brings surprising price discounts, undoubtedly creating a buzz in the AI development community. Firstly, the most eye-catching aspect is the substantial price reduction. The usage cost of the new models has been halved, with a drop of over 50%. At the same time, performance has seen remarkable improvements.

Progress in Copyright Infringement Cases Faced by AI Image Generation Companies May Favor Artists

In a pivotal copyright infringement lawsuit, a judge ruled that visual artists' lawsuits against well-known AI image and video generation companies can proceed and enter the discovery phase. The artists accuse these companies of using their works without permission to train AI models, including Midjourney, Runway, Stability AI, and DeviantArt. The crux of the case is whether the AI companies constitute 'induced infringement.' The judge indicated that the allegations are strong enough to warrant further investigation.

Stability AI Launches New AI Model Stable Fast 3D: Generate 3D Images in Half a Second with 1200 Times Speed Improvement

Stability AI has announced a new generative AI technology called Stable Fast 3D, capable of quickly generating 3D images from a single image, processing speeds improved by 1200 times compared to previous methods, taking only half a second. Stable Fast 3D is based on the collaboration between Stability AI and 3D modeling provider Trip AI, and achieves efficient processing of high resolutions using enhanced transformer networks and innovative material and lighting estimation methods, reducing aliasing artifacts.