Minimax Image Understanding

使用多模态大模型理解图片内容，生成业务含义描述。支持多种模型：(1) MiniMax VLM (2) OpenAI GPT-4V (3) Claude Vision。用于理解截图、图表、文档照片等，生成精准的文字描述。

by clawhub community Source: clawhub

Quality: medium Safety: community Category: Media Updated: 2026-03-08

View on ClawHub JSON API

Related Skills in Media

A CLI for Bilibili — browse videos, users, favorites from the terminal 📺

ppt-svg-generator

ppt-svg-generator 是一个 Skill，帮助你将 Markdown 文稿快速转化PPT 或 PDF，并支持多种预设风格选择，效果美观且可控。使...

video-podcast-maker

Automated video podcast creation skill

Claude Code skill that translates entire books (PDF/DOCX/EPUB) into any language...

Skill to talk to Claude about your projects over the phone

A CLI for Bilibili — browse videos, users, search, and feeds from the terminal

View all Media skills