Mercury 2 introduces diffusion LLMs to text, delivering 10x faster speeds for AI agents and production workflows without sacrificing reasoning power.
The new AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...