FindSkills

Minimax Image Understanding

使用多模态大模型理解图片内容,生成业务含义描述。支持多种模型:(1) MiniMax VLM (2) OpenAI GPT-4V (3) Claude Vision。用于理解截图、图表、文档照片等,生成精准的文字描述。

by clawhub community Source: clawhub
Quality: medium Safety: community Category: Media Updated: 2026-03-08
View on ClawHub JSON API

Related Skills in Media

bilibili-cli
A CLI for Bilibili — browse videos, users, favorites from the terminal 📺
ppt-svg-generator
ppt-svg-generator 是一个 Skill,帮助你将 Markdown 文稿快速转化PPT 或 PDF,并支持多种预设风格选择,效果美观且可控。 使...
video-podcast-maker
Automated video podcast creation skill
translate-book
Claude Code skill that translates entire books (PDF/DOCX/EPUB) into any language...
call
Skill to talk to Claude about your projects over the phone
bilibili-cli
A CLI for Bilibili — browse videos, users, search, and feeds from the terminal

View all Media skills