FindSkills

Gemini Audio MCP

The Gemini Audio MCP server brings enterprise-grade generative audio directly to your AI assistant. Built in high-performance Rust, it leverages Google's state-of-the-art models to provide a unified bridge for environmental sound design, expressive narration, and professional music production. ✨ Key Capabilities * 🎙️ Infinite Soundscapes: Generate complex, immersive environmental audio using the Gemini 2.0 Multimodal Live API. * 🎵 Music & SFX: Create high-fidelity rhythmic loops, full songs, and discrete foley cues via Google's Lyria 3 Pro and Clip models. * 🗣️ Expressive Voice: Convert text to speech with natural voice direction and emotional nuances. * 🎲 Seamless Looping: Features a proprietary 100ms micro-crossfade algorithm to ensure click-free, non-repeating background audio. * 🎭 Cinematic Transitions: Smoothly blend and crossfade between two distinct audio prompts for dynamic environment changes. * 🎛️ Universal Encoding: Direct Stdin-to-FFmpeg piping allows for zero-latency transcoding into 10+ formats (MP3, OGG, FLAC, OPUS, WAV, etc.). 🎮 Use Cases * Game Developers (UE5, Godot, Blender): Instantly generate procedural soundscapes and NPC dialogue lines during development. * Content Creators: Automate foley and background texture generation for video projects. * Productivity: Enhance your AI workspace with high-quality narration and focus-oriented ambient audio. --- 🛠️ Requirements * FFmpeg: Must be installed on the system path for audio transcoding. * API Key: A valid Google AI Studio (Gemini) API Key.

by jxoesneon community Source: smithery
mcp
Quality: medium Safety: community Category: Coding Updated: 2026-04-03
View on Smithery JSON API

Related Skills in Coding

notebooklm
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic...
gs-quant
Python toolkit for quantitative finance
node-minify
Light Node.js module that compress javascript, css and html files
vera-language
Vera: a programming language designed for LLMs to write
mck-ppt-design
McKinsey-style PowerPoint design system for creating professional presentations ...
draw-io
Native draw.io skill with export helpers, SVG linting, and public docs for Codex...

View all Coding skills