Abstract: Software regression testing is a crucial phase for maintaining software quality. To optimize the cost and efficiency of regression testing, selective test case generation and selection have ...
In this tutorial, we show how we treat prompts as first-class, versioned artifacts and apply rigorous regression testing to large language model behavior using MLflow. We design an evaluation pipeline ...
Abstract: Massive, multi-language, monolithic repositories form the backbone of many modern, complex software systems. To ensure consistent code quality while still allowing fast development cycles, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results