Category

Media

Spotify, YouTube, audio + video editing, transcription

25 servers in this category · RSS

Sign in to follow

Anthropic Official 87.9k

EverArt

AI image generation via the EverArt API — text-to-image with multiple model choices.

0 0 Discuss →
mudler Discovered 47.2k

LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

0 0 Discuss →
mastra-ai Discovered 25.5k

mastra

Mastra is the modern TypeScript framework for AI-powered applications and agents.

0 0 Discuss →
screenpipe Discovered 19.5k

screenpipe

YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure. Connect to OpenClaw, Hermes agent and 100+ apps

0 0 Discuss →
modelscope Discovered 18.7k

FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

0 0 Discuss →
nukeop Discovered 17.9k

nuclear

Streaming music player that finds free music for you

0 0 Discuss →
VoltAgent Discovered 9.8k

voltagent

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

0 0 Discuss →
joeseesun Discovered 5.4k

qiaomu-anything-to-notebooklm

Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc.

0 0 Discuss →
aiming-lab Discovered 3.6k

SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal

0 0 Discuss →
MiniMax-AI Discovered 1.5k

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

0 0 Discuss →
ElevenLabs Vendor 1.4k

ElevenLabs TTS

Text-to-speech, voice cloning, and audio generation via ElevenLabs.

0 0 Discuss →
joey-zhou Discovered 1.3k

xiaozhi-esp32-server-java

小智ESP32的Java企业级管理平台，提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案

0 0 Discuss →
mbailey Discovered 1.2k

voicemode

Natural voice conversations with Claude Code

0 0 Discuss →
gyoridavid Discovered 1.2k

short-video-maker

Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.

0 0 Discuss →
jordanrendric Discovered 839

claude-video-vision

Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis

0 0 Discuss →
controlplaneio-fluxcd Discovered 626

flux-operator

GitOps on Autopilot Mode

0 0 Discuss →
Community 605

Spotify

Control Spotify playback, browse libraries, search tracks, and manage playlists.

0 0 Discuss →
Community 565

YouTube Transcript

Pull the transcript + chapter timestamps of any YouTube video — no API key required.

0 0 Discuss →
unifapi-agent Discovered 482

skills

AI agent skills for UnifAPI MCP and public-data workflows

0 0 Discuss →
youichi-uda Discovered 455

godot-mcp-pro

162 MCP tools for AI-powered Godot 4 development. Scene, animation, 3D, physics, particles, audio, shader, input simulation, runtime analysis, navigation, testing & more. $15 one-time.

0 0 Discuss →
AliAkhtari78 Discovered 267

SpotifyScraper

Extract public Spotify data — tracks, albums, artists, playlists, podcasts & lyrics — without the official API. Sync + async, typed models, one dependency.

0 0 Discuss →
Community 103

OpenAI Images

Generate + edit images via DALL-E 3 + gpt-image-1 — straight pipe to the OpenAI image API.

0 0 Discuss →
Community 95

Replicate

Run any model hosted on Replicate — image, video, audio, language — via one tool surface.

0 0 Discuss →
Community 2

Fal.ai

Run any model hosted on Fal.ai — fast inference for FLUX, SD, video, audio.

0 0 Discuss →
Community

Whisper Transcribe

Transcribe audio files locally via Whisper — runs whisper.cpp under the hood.

0 0 Discuss →