Furiosa AI selling inference chip to consumer market will be a game changer to local llm
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| This is south Korean start up all-in on inference chip: https://furiosa.ai/renegade-spec Tsmc 5nm node Hynix HBM3 1.5TB/s 48GB VRAM TDP 180W Already tested on LG LLM. If they opened their programming interface the way NVIDIA opens PTX and Intel opens SPIR-V, and team up with llama.cpp for getting a GGML backend working, it would be a game changer. Rtx pro 5000 48gb (non-hbm) is $5k now. Amd's r9700 32gb is $1.3k Intel B70 32gb is $1k I bet if their RNGD chip is priced right—with that memory BW, VRAM, and TDP—they will get record sales at this rate. For $2.5k a card I'll certainly buy one in a heartbeat, if they get llama.cpp runs as well as vulkan on AMD. Heck, i'd buy it even it runs like intel B70 SYCL backend and get 40% of theoretical TG speed. That's still better than AMD vulkan TG. Edit: they are not selling to the consumer market. I'm hoping that they would, bc it will be a game changer to local llm. [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.