Category
Media
Spotify, YouTube, audio + video editing, transcription
18 servers in this category · RSS
- Anthropic Official 85.6k
EverArt
AI image generation via the EverArt API — text-to-image with multiple model choices.
- mudler Discovered 46.2k
LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
- mastra-ai Discovered 23.9k
mastra
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
- VoltAgent Discovered 8.9k
voltagent
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
- aiming-lab Discovered 3.2k
SimpleMem
SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal
- MiniMax-AI Discovered 1.5k
MiniMax-MCP
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
- joey-zhou Discovered 1.2k
xiaozhi-esp32-server-java
小智ESP32的Java企业级管理平台,提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案
- mbailey Discovered 1.2k
voicemode
Natural voice conversations with Claude Code
- gyoridavid Discovered 1.1k
short-video-maker
Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.
- controlplaneio-fluxcd Discovered 626
flux-operator
GitOps on Autopilot Mode
- Community 601
Spotify
Control Spotify playback, browse libraries, search tracks, and manage playlists.
- Community 537
YouTube Transcript
Pull the transcript + chapter timestamps of any YouTube video — no API key required.
- youichi-uda Discovered 333
godot-mcp-pro
162 MCP tools for AI-powered Godot 4 development. Scene, animation, 3D, physics, particles, audio, shader, input simulation, runtime analysis, navigation, testing & more. $15 one-time.
- Community 94
Replicate
Run any model hosted on Replicate — image, video, audio, language — via one tool surface.
- Community
OpenAI Images
Generate + edit images via DALL-E 3 + gpt-image-1 — straight pipe to the OpenAI image API.
- Community
Whisper Transcribe
Transcribe audio files locally via Whisper — runs whisper.cpp under the hood.
- ElevenLabs Vendor
ElevenLabs TTS
Text-to-speech, voice cloning, and audio generation via ElevenLabs.
- Community
Fal.ai
Run any model hosted on Fal.ai — fast inference for FLUX, SD, video, audio.