Category
Media
Spotify, YouTube, audio + video editing, transcription
25 servers in this category · RSS
- Anthropic Official 87.9k
EverArt
AI image generation via the EverArt API — text-to-image with multiple model choices.
- mudler Discovered 47.2k
LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
- mastra-ai Discovered 25.5k
mastra
Mastra is the modern TypeScript framework for AI-powered applications and agents.
- screenpipe Discovered 19.5k
screenpipe
YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure. Connect to OpenClaw, Hermes agent and 100+ apps
- modelscope Discovered 18.7k
FunASR
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
- nukeop Discovered 17.9k
nuclear
Streaming music player that finds free music for you
- VoltAgent Discovered 9.8k
voltagent
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
- joeseesun Discovered 5.4k
qiaomu-anything-to-notebooklm
Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc.
- aiming-lab Discovered 3.6k
SimpleMem
SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal
- MiniMax-AI Discovered 1.5k
MiniMax-MCP
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
- ElevenLabs Vendor 1.4k
ElevenLabs TTS
Text-to-speech, voice cloning, and audio generation via ElevenLabs.
- joey-zhou Discovered 1.3k
xiaozhi-esp32-server-java
小智ESP32的Java企业级管理平台,提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案
- mbailey Discovered 1.2k
voicemode
Natural voice conversations with Claude Code
- gyoridavid Discovered 1.2k
short-video-maker
Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.
- jordanrendric Discovered 839
claude-video-vision
Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis
- controlplaneio-fluxcd Discovered 626
flux-operator
GitOps on Autopilot Mode
- Community 605
Spotify
Control Spotify playback, browse libraries, search tracks, and manage playlists.
- Community 565
YouTube Transcript
Pull the transcript + chapter timestamps of any YouTube video — no API key required.
- unifapi-agent Discovered 482
skills
AI agent skills for UnifAPI MCP and public-data workflows
- youichi-uda Discovered 455
godot-mcp-pro
162 MCP tools for AI-powered Godot 4 development. Scene, animation, 3D, physics, particles, audio, shader, input simulation, runtime analysis, navigation, testing & more. $15 one-time.
- AliAkhtari78 Discovered 267
SpotifyScraper
Extract public Spotify data — tracks, albums, artists, playlists, podcasts & lyrics — without the official API. Sync + async, typed models, one dependency.
- Community 103
OpenAI Images
Generate + edit images via DALL-E 3 + gpt-image-1 — straight pipe to the OpenAI image API.
- Community 95
Replicate
Run any model hosted on Replicate — image, video, audio, language — via one tool surface.
- Community 2
Fal.ai
Run any model hosted on Fal.ai — fast inference for FLUX, SD, video, audio.
- Community
Whisper Transcribe
Transcribe audio files locally via Whisper — runs whisper.cpp under the hood.