Article Content

Lakera Launches API to Protect Large Language Models from Malicious Attacks

Published in Latest AI News

Time :Oct 13, 2023

Read :1minute

The translation of the provided data: Swiss startup Lakera has recently launched an API designed to protect businesses from malicious prompt attacks by large language models. They have developed an interactive game called "Gandalf" to establish an attack classification system and are dedicated to safeguarding data security and filtering inappropriate content for children. Lakera's API offers businesses a comprehensive solution to counter the threats of malicious attacks from LLMs.

Related Recommendations

Can AI Also Suffer Brain Damage? Study Reveals the Impact of Low-Quality Data on Large Language Models

The study found that after continuously exposing large language models to low-quality data (such as social media content), they can exhibit phenomena similar to human brain damage, leading to a 23% decline in reasoning ability and a 30% decline in long-term context memory. This damage is irreversible, and even subsequent training with high-quality data cannot fully restore it.

Nov 17, 2025

143.2k

Turing Award Winner LeCun Exits Meta: Large Models Are a Dead End, the Future Lies in World Models

Turing Award winner Yann LeCun has diverged from Meta on the direction of AI. As Meta's Chief AI Scientist, he recently publicly criticized large language models as a dead end and advocated for research into world models. His leaving rumors have drawn attention; he once led the fundamental AI research department FAIR and was considered a key intellectual advisor at the company.

Nov 17, 2025

150.8k

Weibo Launches VibeThinker-1.5B, a Low-Cost AI Model Challenging Large Language Models

The Weibo AI department has launched the open-source large model VibeThinker-1.5B, which has 1.5 billion parameters. The model is optimized based on Alibaba's Qwen2.5-Math-1.5B and performs well in math and code tasks. It is now freely available on platforms such as Hugging Face, and it follows the MIT license, supporting commercial use.

Nov 13, 2025

182.7k

New Version of Firefox Is Accused of Having AI Features Enabled by Default, Sparking Ongoing Debate on Privacy and Performance

The new version of Firefox has sparked controversy by enabling AI features by default, with users concerned about privacy and performance issues. Tests show that enabling it significantly increases CPU and memory usage, affecting the browsing experience, and most users were unaware of this.

Nov 11, 2025

159.2k

Study Reveals AI-Generated Social Media Content is Easily Recognizable, Emotional Expression Still Needs Improvement

The study found that AI-generated social media posts can be easily recognized by humans, with an accuracy rate of 70%-80%, far exceeding random levels. The research team tested multiple large language models, revealing their shortcomings in content recognition.

Nov 10, 2025

125.5k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご