Stability AI Releases Two Japanese Language Models


Recently, the VITA-MLLM team announced the launch of VITA-1.5, an upgraded version based on VITA-1.0, aimed at enhancing the real-time and accuracy of multimodal interaction. VITA-1.5 not only supports English and Chinese but also achieves significant improvements in multiple performance metrics, providing users with a smoother interaction experience. In VITA-1.5, the interaction delay has been greatly reduced from 4 seconds to just 1.5 seconds, making it almost imperceptible for users during voice interactions.
Recently, OpenAI has caused a stir in the video AI field as they are set to make significant upgrades to Sora, which was released in February of this year. According to reports from the Information Daily, the main goal of this upgrade is to significantly enhance Sora's performance, enabling it to generate longer and higher quality video content more quickly. Reflecting on the initial performance of Sora, there were indeed many pressing issues that needed addressing. The original version took over 10 minutes to generate a video, an efficiency level that clearly fails to meet practical application needs.
Google today announced the launch of the newly upgraded Gemini model series, which includes Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002. This update not only significantly enhances performance but also brings surprising price discounts, undoubtedly creating a buzz in the AI development community. Firstly, the most eye-catching aspect is the substantial price reduction. The usage cost of the new models has been halved, with a drop of over 50%. At the same time, performance has seen remarkable improvements.
In a pivotal copyright infringement lawsuit, a judge ruled that visual artists' lawsuits against well-known AI image and video generation companies can proceed and enter the discovery phase. The artists accuse these companies of using their works without permission to train AI models, including Midjourney, Runway, Stability AI, and DeviantArt. The crux of the case is whether the AI companies constitute 'induced infringement.' The judge indicated that the allegations are strong enough to warrant further investigation.
Stability AI has announced a new generative AI technology called Stable Fast 3D, capable of quickly generating 3D images from a single image, processing speeds improved by 1200 times compared to previous methods, taking only half a second. Stable Fast 3D is based on the collaboration between Stability AI and 3D modeling provider Trip AI, and achieves efficient processing of high resolutions using enhanced transformer networks and innovative material and lighting estimation methods, reducing aliasing artifacts.