OpenClaw skill for WhisperX — speech-to-text with word-level timestamps, speaker diarization, and forced alignment
查看全部媒体技能