Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
├── problems/ # Problem solutions organized by name │ ├── two_sum/ # Individual problem directories │ │ ├── config.yml # Problem metadata and configuration │ │ ├── two_sum.py # Python solution │ │ ├── ...
A comprehensive toolkit for fine-tuning Qwen2.5 models on Japanese language data using NVIDIA NeMo 2.0. This project provides modular, production-ready implementations for PEFT (LoRA), SFT, and ...