Google rolled out Gemini 3.1 Pro yesterday, touting a 77.1% score on novel logic puzzles that models can't just memorize—more than double 3 Pro's result—and record marks for expert-level scientific ...
Bruno, Fx, ActivityWatch, DDEV, and TLDR Pages are all dev tools that you should try out because they're much better than ...
Mercury 2 introduces diffusion LLMs to text, delivering 10x faster speeds for AI agents and production workflows without sacrificing reasoning power.
Anthropic claims Chinese AI labs ran large-scale Claude distillation attacks to steal data and bypass safeguards.