3,611 tools and skills for media tasks
Capture, summarize, and organize knowledge from URLs, YouTube videos, documents, and files. Proactively recall stored kn
Evaluate hi-fi and audio gear options, build system recommendations, guide installation and tuning, and analyze used-mar
Audit Amazon product listing images for non-square dimensions, auto-pad them to 2000×2000 white background, and push cor
Audit Amazon product listing images for non-square dimensions, auto-pad them to 2000×2000 white background, and push cor
Audit Amazon product listing images for non-square dimensions, auto-pad them to 2000×2000 white background, and push cor
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles&quo
Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image&q
Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word
Convert Bilibili (B站) videos into a searchable text knowledge base. Supports single videos and batch processing of entir
Manage personal receipts by extracting data from images, storing records, and providing expense searches and monthly fin
即梦AI视频生成工具(带声音版本),通过火山引擎API自动生成带音频的高质量视频。支持文生视频、图生视频,适用于短视频内容创作。
Generate images with Google's Nano Banana Gemini 2.5 Flash Image model via OpenRouter API using text prompts.
科技行业投资日报生成与推送。当用户要求生成科技投资日报、发送每日投资报告、或cron定时触发日报任务时使用。自动抓取财联社实时新闻、获取涉及上市公司股价、生成深度分析报告并通过飞书一条消息发送完整Markdown报告,同时生成PDF附件。
Query CJ Dropshipping API v2.0 to source products and fetch details for catalog building. Use for CJ keyword search, pul
Modularized Naver Search. Features separate scripts for Web, News, Shopping, and Images. Integrates SerpAPI specialized
case.dev — a legal AI platform with encrypted document vaults, OCR, audio transcription, and legal search. This skill in
Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe
Generate AI-optimized Alt Text, file names, captions, and Schema markup for images, videos, and audio assets. Improves A
Use when the user wants to trim, cut, or extract a specific segment from a video by time range — e.g. "cut from 1:3
Use when the user wants to change a video's aspect ratio or reformat it for a specific platform — e.g. "convert to
Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini), and more via f
Estimate API token costs from OpenClaw session transcripts. Analyzes all agent sessions to calculate what you'd pay on p
Use for AI image generation via IMA Open API. Supports text-to-image and image-to-image. IMPORTANT — Default model selec
Use for AI video generation via IMA Open API. Supports 4 modes: text-to-video (14 models), image-to-video (14 models), f
OctoFlow converts English task descriptions into GPU-accelerated Vulkan programs for data analysis, image processing, ML
Recommend suitable prompts from 10,000+ Nano Banana Pro image generation prompts based on user needs. Use this skill whe
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Perform audio editing tasks including trimming, volume adjustment, format conversion, and extracting audio from video fi
Create high-end cinematic scene prompts and production-ready scene briefs in a Hollywood producer voice. Use when the us
Japanese OCR via NDLOCR-Lite (National Diet Library). Trigger on 'OCR this image', '日文OCR', 'recognize Japanese text', o
Generate images using APIYI API.
Scrapes top viral LinkedIn posts in your niche, analyzes engagement drivers, and generates a personalized 30-day content
Convert Markdown files to styled PDFs using pandoc and wkhtmltopdf with built-in or custom CSS style options.
Automated community intelligence gathering for any open-source project or product. Searches Reddit, Hacker News, Twitter
OCR text recognition using DeepSeek-OCR model. Use when user asks for OCR, text recognition, image text extraction, scre
Pure intelligence gathering for social media platforms (TikTok & Instagram). Use when an agent needs to discover tre
高密度信息大图生成技能。根据文章/内容生成高信息密度的视觉大图,适合知识干货、笔记分享、课程内容可视化。支持8种风格:坐标蓝图波普实验室、复古波普网格、文件夹风、色块热敏纸、复古手帐、档案混合媒介、色块酸性风、票据剧场戏票风。使用 Imag
插画图片生成技能,支持多种手绘风格。使用 Imagen 3(ZenMux API)生成插图,适合流程图、功能说明、PPT配图、教程配图等场景。支持三种风格:A) Sketch 极简手绘风(Notion/Linear 风格,简笔人物,冷淡低饱
Search and summarize papers from ArXiv. Use when the user asks for the latest research, specific topics on ArXiv, or a d
Investigate a username across 1000+ social media platforms and websites using social-analyzer. Use this skill whenever t
Manage Facebook Pages via Meta Graph API. Post content (text, photos, links), list posts, manage comments (list/reply/hi
Virtual Imam that leads the five daily Islamic prayers via voice, delivers Friday Jumu'ah khutbahs, and interacts with m
Audit and rewrite content to remove AI-generated feel by stripping markdown artifacts, eliminating AI vocabulary pattern
Generate beautiful bento grid layouts for social media posts. Create Instagram/Twitter cards with statistics, calendars,
Set up, troubleshoot, and optimize HomePod and HomeKit audio workflows with reliable Siri control and room-aware playbac
CLI tool to clean up stopped Docker containers, unused images, volumes, and networks to free up disk space.
从URL提取图片并生成PDF
Generate, track, and download AI-created 3D models from text or images using Tripo AI, with free credits and optional AP
Generate images using Alibaba DashScope wan2.6-t2i model, download to Desktop, and upload to catbox.moe image hosting. U
A collection of PDF manipulation tools from the poppler-utils package. Use when working with PDF files - extracting text