Minimax Image Understanding

使用多模态大模型理解图片内容，生成业务含义描述。支持多种模型：(1) MiniMax VLM (2) OpenAI GPT-4V (3) Claude Vision。用于理解截图、图表、文档照片等，生成精准的文字描述。

作者 clawhub 社区来源： clawhub

质量：中等安全：社区分类：媒体更新时间： 2026-03-08

在 ClawHub 上查看 JSON API

媒体分类的相关技能

A CLI for Bilibili — browse videos, users, favorites from the terminal 📺

ppt-svg-generator

ppt-svg-generator 是一个 Skill，帮助你将 Markdown 文稿快速转化PPT 或 PDF，并支持多种预设风格选择，效果美观且可控。使...

video-podcast-maker

Automated video podcast creation skill

Claude Code skill that translates entire books (PDF/DOCX/EPUB) into any language...

Skill to talk to Claude about your projects over the phone

A CLI for Bilibili — browse videos, users, search, and feeds from the terminal

查看全部媒体技能