メディアタスク向けの 3,611 件のツールとスキル
local search jellyfin and remote search byrpt
PDF skill for AI coding agents — generate polished PDFs from topics or Markdown files
Claude Code skill — transcribe & summarize Apple Voice Memos on-device using Apple Silicon GPU (MPS). M1/M2/M3/M4 on
Download YouTube video transcripts with automatic translation to Vietnamese and AI summarization. Perfect for learning E
A Claude skill that generates visual proof pages from PDF documents — find values, crop screenshots with highlights, and
YouTube動画の文字起こしをダウンロードするCLIツール (yt-dlp)
Given a DOI, returns the PDF download URL from Sci-Hub. That's it.
Claude Code skill for Gemini PRO UI automation - image/video generation and deep research
Real-time X/Twitter search powered by Grok-4. Find tweets, trends, and discussions with citations. Grok-4.20 also return
Give your AI agent X/Twitter and LinkedIn access for $0/month. Saves $2,400/year vs APIs.
Generate Images with any AI model on ImageRouter (requires API key).
自动从 Anna's Archive 下载书籍并上传到 Google NotebookLM。支持 PDF/EPUB 格式,自动转换,一键创建知识库。
Ultimate personalization engine for Apple Music. Analyzes listening history, Apple Music Replay stats, library data, and
Control Android cloud phones via ADB - tap, swipe, type, screenshot, read UI elements. Send commands to DuoPlus cloud ph
Control Music Assistant (Home Assistant music server) - playback, volume, queue management, and library search. Use when
Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (
个性化资讯电台生成服务。使用场景:(1) 生成特定主题的电台,(2) 设置每日定时推送,(3) 配置TTS音色,(4) 收听历史电台。不适用:音乐播放、实时广播、视频内容。
Publish tweets to X (Twitter) using the official Tweepy library. Supports text-only tweets, tweets with images or videos
WeChat Official Account Draft Box management tool. Create and manage graphic draft articles via WeChat API, supporting t
使用 MinerU API 解析 PDF 文件。当用户要求解析 PDF 文件时触发此技能。
Extract sales data from report images using OCR with cnocr, parse JSON via MiniMax API, and convert results to Excel spr
Automates video rough editing by detecting silence, scoring segments, removing duplicates, and generating a best-segment
Generate AI videos with ByteDance Seedance (豆包/火山方舟) via Ark API. Supports text-to-video and image-to-video using model
AI-powered YouTube comment moderation. Fetches comments, classifies them (spam, question, praise, hate, neutral, constru
图片理解技能,使用 Minimax Coding Plan VLM API 分析图片
Distill OpenClaw daily memory, session transcripts, and newly generated report files into new knowledge points and deepe
飞书图片发送工具,支持系统截屏、区域截图和本地图片文件发送到飞书工作区,方便快速分享屏幕内容。
老师作业批改助手,用于自动批改数学作业、统计错题、生成Excel统计表和PDF报告。当老师需要:(1) 上传正确答案并让AI识别 (2) 批量上传学生作业照片进行批改 (3) 统计全班错误率并生成错题分析报告 (4) 生成重点错题PDF供讲
Fetch follower counts and social media metrics from 11+ platforms using profile URLs or nicknames, including Bilibili, Y
管理 QQ空间相册。支持扫码登录、列出相册、浏览照片、上传照片、下载照片/相册、创建相册。当用户需要备份、整理或管理 QQ空间中的照片时使用此技能。
Integrates Bilibili hot trending monitoring, video downloading/playback, subtitle handling, and video publishing into on
Image recognition and understanding tool. Uses a multimodal model (e.g. doubao-seed-2.0-pro, kimi-k2.5) to analyze image
Enterprise-grade agentic document processing API. Accurately extracts key fields and line items from invoices, receipts,
Real time video translation / dubbing skill. Translate user-provided video (file or URL) and return preview_url. 适用于视频直译
Convert PDF and image documents to clean Markdown via the PDF2Markdown CLI. Use when the user wants to extract text from
TopMediai text-to-speech skill. Supports key entitlement info, voices listing (official + cloned), and text-to-speech ge
AI image and video generation CLI. Generate images, videos, and movie posters using 50+ models including Flux, Kling, Ve
Search, browse, and download high-quality free photos from Unsplash with filtering, random selection, and detailed photo
Make real phone calls through your OpenClaw agent via OpenAI's Realtime API. ~200-300ms latency, natural voice, IVR navi
Local speech-to-text using whisper-cli (whisper.cpp).
Write like the user, not like AI. ToneClone trains on the user's actual writing to generate content in their authentic v
One-step full-stack installer for OpenClaw WebChat voice input and local speech-to-text. Deploys faster-whisper backend
Enables the agent to create, manage, and publish a full-featured blog autonomously. The agent can write posts, upload me
Create, adapt, schedule, publish, and analyze AI-generated social media content across 10 platforms in 13 languages usin
Generates high-fidelity 1080p videos with synced audio using Google Veo 3.1. Use for creating cinematic clips from text
【技术文章配图大师】一个为技术媒体人设计的 Claude Skill,让 AI 配图告别"塑料感"。
Automate Twitter/X with posting, engagement, and user management via inference.sh CLI. Apps: x/post-tweet, x/post-create
Generate images with FLUX models (Black Forest Labs) via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA with
Create AI marketing videos for ads, promos, product launches, and brand content. Models: Veo, Seedance, Wan, FLUX for vi
Create AI-powered social media content for TikTok, Instagram, YouTube, Twitter/X. Generate: images, videos, reels, short