Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
The last couple of years have seen a huge rise in browser-based puzzle games, tasking players with working out a certain kind of answer using limited guesses. Framed is one of the newest, following in ...