Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
The last couple of years have seen a huge rise in browser-based puzzle games, tasking players with working out a certain kind of answer using limited guesses. Framed is one of the newest, following in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results