3,611 tools and skills for media tasks
AI video production workflow using Remotion. Use when creating videos, short films, commercials, or motion graphics. Tri
Split image datasets into train, validation, and test sets with options for random or stratified splits, custom ratios,
Visualize bounding boxes and class labels on images with support for COCO, YOLO, VOC, and LabelMe annotation formats.
飞书云盘文件管理技能。用于读取、下载和管理飞书云盘中的文件。 当用户需要:访问飞书文件、下载文档、读取PDF/Word/PPT文件、分析飞书云盘内容时使用。 核心方法:使用 tenant_access_token 调用 Drive API
Generate and monitor CyberBara Public API v1 image and video tasks end-to-end. Use when work involves CyberBara `/api/v1
[Aibrary] Generate a book Idea Twin podcast script — an intellectually stimulating debate between the user's AI twin and
[Aibrary] Generate a book summary podcast script in a single-narrator storytelling style. Use when the user wants to tur
智能唤醒“小龙虾”,启用百度高准确度语音识别,持续监听并自动输入语音内容,支持“停止”暂停输入。
[Aibrary] Generate a book dialogue podcast script with two speakers — a host and a guest expert — discussing the book's
Crop objects from images using bounding box annotations in COCO, YOLO, VOC, or LabelMe formats with optional padding and
For developers who want to become SRS contributors or maintainers — learn SRS or any of its modules (ST, protocols, medi
Rename image datasets and annotations with custom patterns, prefixes, suffixes, and sequential numbering, including prev
Build Docker images from a Dockerfile with support for build args, tags, multi-stage builds, and efficient layer caching
HaS (Hide and Seek) on-device text and image anonymization. Text: 8 languages (zh/en/fr/de/es/pt/ja/ko), open-set entity
Query and manage Timeless meetings, rooms, transcripts, and AI documents. Capture podcast episodes and YouTube videos in
中国计算机软件著作权登记申请材料生成器。用于生成软著申请所需的全部材料,包括软件基本信息表单填写、程序鉴别材料PDF(源代码语法高亮)、文档鉴别材料PDF(用户手册)。当用户提到"软著"、"软件著作权"
Transcribe audio using a deployed Cloudflare Worker Whisper endpoint. Use when converting voice/audio files (wav, mp3, m
Generate/edit images via Tuzi API (default), Google Gemini, OpenAI, DashScope, Replicate. Text-to-image + image-to-image
终端截图OCR识别技能。专门处理终端/命令行界面的截图,提取文本内容并进行分析。
Generates article cover images with 5 dimensions (type, palette, rendering, text, mood) combining 9 color palettes and 6
Upload files to Tencent Cloud COS (Cloud Object Storage). Use when the user needs to upload images, documents, or any fi
Control Frameo digital photo frames via cloud API for frame info or ADB for full control including photo upload, brightn
Transform books and documents into professional audiobooks with natural voices. Multiple narrators and styles available.
End-to-end SEO article pipeline for any blog. Research keywords → analyze competition → write article → generate images
สร้างรูปภาพและวิดีโอ AI ฟรีผ่าน Google Gemini และ Google Flow โดยใช้ browser automation ไม่ต้องจ่าย API fee ใช้เมื่อต้อง
Monitor F5-TTS distributed training on the 9-GPU mining rig (Local-LLM) without interfering with the process.
Save and organize links, notes, and timestamps into a searchable Idea Vault. Use when a user drops a YouTube/web link (o
PDF contract redaction tool. Use when the user needs to redact sensitive information from scanned PDF contracts. The too
Fetch and download YouTube video transcripts with timestamps and multi-language support by providing the video URL.
Compress JPG, PNG, WebP, and GIF images to reduce file size significantly while preserving visual quality with bulk comp
Download videos and metadata from TikTok, Instagram Reels, YouTube Shorts, and Xiaohongshu with automatic platform detec
汇总和分析世界主要媒体对重大新闻事件的报道和观点。当用户想要了解不同国家/地区媒体对同一事件的看法、立场差异或详细报道内容时使用,例如:"查看XX事件的国际媒体反应"、"世界媒体怎么看这件事"、&qu
Send bots to Zoom, Google Meet, and Microsoft Teams meetings. Get live transcripts, recordings, and reports. Self-hosted
Play classic MTV music videos from the 80s, 90s, and 2000s. Use when the user wants to watch MTV, music videos, retro TV
Wirework scaffold for Theta EdgeCloud APIs (deployment, inference, video, chain/rpc) with dry-run safety.
Remember and retrieve visual content from conversations. Use when: (1) user sends an image, photo, chart, diagram, or sc
Analyze Polymarket sentiment using trading data, social media trends, and news to gauge positive, negative, or neutral m
Creates TikTok image carousels with text overlays using Pexels API & FFmpeg, then uploads via PostBridge API. Use wh
照片批量整理工具,支持按时间、地点自动分类和打标签。适用于手机照片整理、相册归档等场景,帮助用户快速整理成千上万张照片!
End-to-end dropship product lifecycle pipeline. CJ Dropshipping sourcing → margin check → Flux Kontext AI hero image → W
End-to-end AI UGC video pipeline. Product info → GPT-4o-mini script → ElevenLabs voiceover → Aurora talking head (fal-ai
Plan, schedule, and track content across channels — newsletters, social media, blog posts, and videos. Manages pipeline
Two-layer memory architecture for board meeting decisions. Manages raw transcripts (Layer 1) and approved decisions (Lay
Free basic version that converts YouTube transcript into a structured blog draft. Reserves premium upgrade hooks for SEO
Automate Windows desktop by simulating mouse/keyboard input, managing clipboard, capturing screenshots, running commands
Diagnose and optimize full conversion funnels for paid traffic from Meta (Facebook/Instagram), Google Ads, TikTok Ads, Y
Trade AI capabilities with escrow-secured settlement and graded verification. BUY: delegate deep research, code security
视频文件批量重命名和整理工具,支持按时间、格式、分辨率等方式整理视频,批量重命名,预览模式和撤销功能!
Generate professional four-act AI video scripts, episode breakdowns, asset prompts, and Seedance 2.0 storyboard scripts
使用 Qwen3-TTS 本地语音合成,将文字转为语音文件,并可通过飞书发送语音消息(语音气泡格式)。当用户要求合成语音、TTS、语音生成、发送飞书语音消息时使用。支持 Apple Silicon (MPS) 和 CUDA GPU。