Hunyuan-A13B is a newly open-sourced large language model by Tencent. With an innovative design concept, it achieves strong performance with a relatively small number of active parameters, making it especially suitable for resource-constrained environments.

QQ20250630-092741.png

This model adopts a fine-grained MoE (Mixture-of-Experts) architecture, featuring 13 billion active parameters, but the total parameter count reaches up to 800 billion. This design allows it to maintain efficiency and scalability while providing cutting-edge reasoning capabilities and support for general applications.

QQ20250630-092705.png

The core features of Hunyuan-A13B include:

  • Hybrid reasoning mode supporting fast and slow thinking: This unique reasoning mechanism allows the model to adjust its depth of thinking according to task requirements, improving the efficiency of handling complex problems.

  • Native 256K ultra-long context understanding capability: This means the model can process extremely long text inputs, making it perform well in tasks that require a lot of background information.

  • Outstanding performance in agent-related tasks: Hunyuan-A13B demonstrates strong capabilities when performing various agent (Agent)-related tasks.

To achieve efficient reasoning, Hunyuan-A13B adopts Grouped Query Attention (GQA) technology and supports multiple quantization formats. Currently, the model has open-sourced pre-trained, instruction-tuned, FP8, and INT4 quantized versions, making it convenient for developers to use.

In various benchmark tests, Hunyuan-A13B has shown strong competitiveness, especially in mathematics, science, coding, reasoning, and agent domains.

Tencent provides comprehensive support for developers, including detailed guides for interaction and model training using Hugging Face Transformers. Additionally, for model deployment, Hunyuan-A13B offers support through TensorRT-LLM, vLLM, and SGLang, and provides pre-built Docker images and quantized model deployment solutions, greatly simplifying the deployment process.

The open-sourcing of Hunyuan-A13B undoubtedly opens new possibilities for the application of large models in resource-constrained environments and brings new innovation power to the AI community.

Open-source address: https://huggingface.co/tencent/Hunyuan-A13B-Instruct

github: https://github.com/Tencent-Hunyuan/Hunyuan-A13B?tab=readme-ov-file