Releasing Cohere North Mini Code
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Hi folks! Jay here from Cohere. we just officially launched North Mini Code after getting some great feedback from you guys this weekend on the unreleased version. I wanted to come here and answer some of the questions you asked and provide some extra detail about the model itself. You can download the weights on Hugging Face (fp8 here) or try it on OpenCode for free. if you want to read more about what I mentioned in the video, feel free to look at our technical blog post on HuggingFace as well as the announcement post! If you're deploying with vllm, please use vLLM main for North Mini Code until a new release is available, and accurate response parsing also requires installing Cohere’s melody library. Then the vllm server can be started with the following command: A couple of PRs were pushed to make this work better based on your feedback. Useful tidbits from the previous post:
if you have any questions or feedback, don't hesitate. We're really interested in seeing your builds and any problems you run into so we can build even better models for devs in the future. Really excited to hear what you think! Thanks again for all your help on this. [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.