r/LocalLLaMA · · 1 min read

Furiosa AI selling inference chip to consumer market will be a game changer to local llm

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Furiosa AI selling inference chip to consumer market will be a game changer to local llm

This is south Korean start up all-in on inference chip:

https://furiosa.ai/renegade-spec

Tsmc 5nm node

Hynix HBM3 1.5TB/s 48GB VRAM

TDP 180W

Already tested on LG LLM.

If they opened their programming interface the way NVIDIA opens PTX and Intel opens SPIR-V, and team up with llama.cpp for getting a GGML backend working, it would be a game changer.

Rtx pro 5000 48gb (non-hbm) is $5k now.

Amd's r9700 32gb is $1.3k

Intel B70 32gb is $1k

I bet if their RNGD chip is priced right—with that memory BW, VRAM, and TDP—they will get record sales at this rate.

For $2.5k a card I'll certainly buy one in a heartbeat, if they get llama.cpp runs as well as vulkan on AMD. Heck, i'd buy it even it runs like intel B70 SYCL backend and get 40% of theoretical TG speed. That's still better than AMD vulkan TG.

Edit: they are not selling to the consumer market. I'm hoping that they would, bc it will be a game changer to local llm.

submitted by /u/siegevjorn
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA