Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
The first is a Surface Go 2, a six-year-old machine that just barely squeaked in under the Windows 11 compatibility wire with ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results