140 trillion token training support: Falcon3 challenges mainstream open-source AI models

The Technology Innovation Institute (TII) in Abu Dhabi has released the next-generation open-source AI model, Falcon3. With a training volume of 140 trillion tokens and an optimized architectural design, it sets a new record for performance on consumer-grade hardware. This training scale is more than double that of its predecessor, Falcon2.

The Falcon3 series offers four specifications: 1B, 3B, 7B, and 10B, each available in a base version and an Instruct version optimized for dialogue. Although specific versions are provided for English, French, Spanish, and Portuguese, all models can handle most commonly used languages.

In third-party language model evaluations on Hugging Face, Falcon3 outperformed mainstream open-source models, including Meta's Llama-3.1-8B, Qwen2.5-7B, Mistral's NeMo-12B, and Google's Gemma2-9B, demonstrating strong competitiveness.

Falcon3 performs better in relevant benchmark tests compared to similarly scaled competitors like Mistral, Alibaba, Meta, and Google. | Image: Technology Innovation Institute

TII emphasizes the model's ease of use, ensuring compatibility with standard APIs and libraries, and providing resource-optimized quantized versions for specific hardware configurations. Additionally, the institute has launched a free chatbot for users to test and provide feedback. The product's interface design draws on the successful experience of ChatGPT, incorporating similar features like project folders.

Looking ahead, TII plans to expand the capabilities of the Falcon3 series in early 2025, introducing multimodal models that support image, video, and voice processing. Currently, all models are available for free download on the Hugging Face platform under the TII Falcon license based on Apache 2.0, which includes guidelines for promoting responsible AI use.

Falcon Chat has a very similar interface to ChatGPT and includes comparable features. | Image: Screenshot from THE DECODER

This release marks another significant advancement in the open-source AI field, particularly achieving breakthrough accomplishments in enhancing AI performance on consumer-grade hardware. With the addition of multimodal capabilities, Falcon3 is expected to bring more innovative application possibilities to the AI open-source community in 2025.

RWKV: Small Team Aims to Be Android of AI Era with Big Model

Meta Intelligence OS is a startup founded by Bloomberg. It has developed a series of large models based on the open-source model RWKV and aims to become the Android in the era of large models. The RWKV model has superior performance and low cost in inference tasks, thus attracting customers from industries such as finance, law firms, and smart hardware. The business model of Meta Intelligence OS is model customization based on private data and internal AI Agent development. The company hopes to solve the problems of API call latency and data security by deploying large models on terminal devices. Currently, RWKV versions are available on Windows, Mac, and Linux computers, and Android and iOS versions are also in development. Meta Intelligence OS is raising funds and collaborating with chip companies and computing power platforms to create benchmark customers. Luo Xuan said that the decisive battlefield for large models is on hardware, and both terminal devices and the cloud require dedicated chips.

Shanji AI拍照 Glasses Officially Released: Starting Price 999 Yuan with Multi-Model Integration

Yesterday, Shanji Technology announced the launch of China's first mass-produced AI photography glasses - Shanji AI拍照 Glasses. The retail price for these glasses is 1499 Yuan, while the first batch of 50,000 co-creation versions is discounted to 999 Yuan, along with a promotional offer of a refundable 300-day purchase if used for 200 days. The Shanji AI拍照 Glasses are the first in the industry to feature a Sony 16MP, 123-degree ultra-wide-angle camera module, and are equipped with a flagship low-power ARM platform from UNISOC. These glasses also include a 6500mAh extended battery, supporting HI-F.

State Grid and Alibaba, Baidu Release the 'Bright Power Large Model' with a Trillion Parameters

Recently, State Grid Corporation announced the launch of China's first trillion-parameter artificial intelligence large model in the power industry - the 'Bright Power Large Model'. They have signed a strategic cooperation framework agreement with Baidu and Alibaba. The officials stated that they will work together with the signing parties to build the Bright Power Large Model and promote the integrated development of energy and power technological innovation and industrial innovation.

CompassArena Upgrade: Launch of New Judge Copilot Feature

The OpenCompass team from Shanghai Artificial Intelligence Laboratory and ModelScope have jointly launched an upgrade for the large model evaluation platform CompassArena. This upgrade aims to provide users with a more scientific and comprehensive model evaluation experience. Since its launch, the platform has attracted a large number of community users to participate and contribute data. Based on this data, CompassArena continues to optimize, and this upgrade includes the new Judge Copilot feature and improvements to the ranking algorithm.

Douyin Vice President Denies a Price War for Large Models: Promoting the Inclusive Development and Application of AI Technology

Today, in response to rumors that ByteDance might initiate a price war for large models, Douyin Vice President Li Liang issued a statement on social media, clearly stating that this is not a price war. Li Liang pointed out that the Doubao large model has reduced costs through technological innovation, with significant optimizations in algorithms, software engineering, and hardware solutions. He mentioned that the pricing of 0.3 yuan per 1,000 tokens not only has a considerable gross profit but also follows a transparent pricing strategy, which is not the traditional 'list price discount' model.