FindSkills

evals

Write and analyze evaluations for AI agents and LLM applications. Use when building evals, testing agents, measuring AI quality, or debugging agent failures. Use this skill when you need to test the performance of an LLM or Agent, or if the user mentions EZVals.

by camronh community Source: github
Quality: medium Safety: community Category: Coding Updated: 2026-02-21
View on GitHub JSON API

Related Skills in Coding

notebooklm
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic...
gs-quant
Python toolkit for quantitative finance
node-minify
Light Node.js module that compress javascript, css and html files
vera-language
Vera: a programming language designed for LLMs to write
mck-ppt-design
McKinsey-style PowerPoint design system for creating professional presentations ...
draw-io
Native draw.io skill with export helpers, SVG linting, and public docs for Codex...

View all Coding skills