Llm Evaluator

Name: Llm Evaluator
Author: clawhub

LLM-as-a-Judge evaluation system using Langfuse. Score AI outputs on relevance, accuracy, hallucination, and helpfulness. Backfill scoring on historical trac...

by clawhub community Source: clawhub

Quality: medium Safety: community Category: AI & ML Updated: 2026-03-05

View on ClawHub JSON API

Related Skills in AI & ML

"stock_analyzer"

LLM驱动的 A/H/美股智能分析器，多数据源行情 + 实时新闻 + Gemini 决策仪表盘 + 多渠道推送，零成本，纯白嫖，定时运行

claudeception

A Claude Code skill for autonomous skill extraction and continuous learning. Hav...

claw-compactor

🦞 Claw Compactor — The 98% Crusher. Cut your AI agent token spend in half with ...

pensieve

tore your decisions and principles. Claude reads them to make better choices.

vuln-analysis-expert

wooyun-legacy skill for claude code

frontend-slides

Create beautiful slides on the web using Claude's frontend skills

View all AI & ML skills