Abstract: The quality of modern software relies heavily on the effective use of static code analysis tools. To improve their usefulness, these tools should be evaluated using a framework that ...
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Abstract: Static type inference is an effective way to maintain the safety of programs written in a dynamically typed language. However, foreign functions implemented in another programming language ...
Google’s Chrome team previews WebMCP, a proposed web standard that lets websites expose structured tools for AI agents instead of relying on screen scraping.