Skills are folders of instructions, scripts, and resources that Claude loads dynamically to improve performance on specialized tasks. Skills teach Claude how to complete specific tasks in a repeatable ...
The official evaluation toolkit for Very Big Video Reasoning (VBVR). Unified inference and evaluation across 37 video generation models. VBVR-Bench matches each task to a rule-based evaluator by the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results