Knowledge Distillation Methods

13h

Why Al Models Forget & How MIT Fixed It With Knowledge Retention

MIT introduces Self-Distillation Fine-Tuning to reduce catastrophic forgetting; it uses student-teacher demonstrations and needs 2.5x compute.

GitHub

[ICCV 2025] What to Distill? Fast Knowledge Distillation with Adaptive Sampling

Knowledge Distillation (KD) has been established as an effective technique for reducing the resource requirements of models when tackling computer vision tasks. Prior work has studied how to distill ...

Is China Winning The AI Race? Inside Beijing’s Bet On Cheap Intelligence, And Where Does India Stand?

The US is dominating headlines with frontier AI models, multi-billion-dollar investments and powerful chips, while China is making AI cheaper, widely deployable at home and abroad ...

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...

Computer Weekly

US artificial intelligence developers accuse Chinese firms of stealing their data

Artificial intelligence developers are accusing Chinese firms of stealing their intellectual property following a spate of ‘distillation attacks’, despite their own alleged theft of training data.

El Adelantado de Segovia

Goodbye to the false sense of security in artificial intelligence—experts warn of distillation attacks that can clone advanced models without permission

Recently, two of the most important artificial intelligence (AI) companies in the world (Google and OpenAI) have launched a ...

The Economist

American labs say China’s AI tigers are copycats

This month Anthropic and OpenAI each disclosed evidence that leading Chinese AI labs have illicitly used American models to ...

Anthropic joins OpenAI in flagging 'industrial-scale' distillation campaigns by Chinese AI firms

Anthropic accused three Chinese artificial intelligence enterprises of engaging in coordinated distillation campaigns, the ...

Anthropic says DeepSeek, Moonshot, and MiniMax used 24,000 fake accounts to rip off Claude

Anthropic alleges Chinese AI labs including DeepSeek, Moonshot and MiniMax used fake accounts to distill Claude, raising new concerns about AI model theft, proxies and U.S. export controls.

Anthropic Accuses DeepSeek and Other Chinese AI Firms of Model Distillation Attempts

Anthropic said it is investing heavily in defences designed to make distillation attacks harder to execute and easier to identify.

Anthropic: Chinese AI firms created 24,000 fraudulent accounts for 'distillation attacks'

Anthropic is accusing three Chinese artificial intelligence companies of "industrial-scale campaigns" to "illicitly extract" ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results