Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
Have you ever felt like you’ve been stuck in the gap between ideas and reality? You sketch something exciting, imagine users paying for it, and then the project dies in a folder of unfinished files.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results