Open source : Turning vocal imitations into sound effects. (New UX for sound generation)
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Hello guys I want to introduce my new project! Have you ever needed a specific sound while making a video or a game? You know exactly what it sounds like in your head, but have no idea how to search for it. That’s why sound design meetings at game studios often turn into people making noises with their mouths. “Not pewpew… more like pew↘︎pew↘︎.” That’s what inspired this project! It’s a model that lets you imitate a sound with your voice, then uses that vocal imitation together with text as input to generate the sound you actually want. repo: https://github.com/thxxx/VTS (You’ll get a better sense of it if you check out the demo in the repo. Would love to hear your feedback in the comments.) [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.