r/LocalLLaMA · · 3 min read

[NEW MODEL] SupraLabs just released a new model! - Supra-50M-Reasoning

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

SupraLabs just released a new model! - Supra-50M-Reasoning

Hello again r/LocalLLaMA! Supra-50M-Reasoning (ThinkSupra-50M) is the reasoning version of Supra-50M-Instruct. It produces a full thinking chain before every answer, fine-tuned from Supra-50M-Base using a custom synthetic dataset of 500 samples generated by Qwen3 1.7B, trained for 6 epochs. It's experimental, it hallucinates, and it's fully open. This is part of the Supra-50M collection under Project Chimera.

Model: 🤗 Supra-50M-Reasoning

Dataset: SupraThink-Dataset-500x

What's coming next?

Supra-124M — Base, Chat, Reasoning

Supra-350M — Base, Chat, Reasoning, Coding

🧠 Answer Structure

Every answer follows this format:

<|begin_of_thought|> ... thinking ... <|end_of_thought|> <|begin_of_solution|> ... final answer ... <|end_of_solution|> 

⚙️ Training Setup

Parameter Value
Base model Supra-50M-Instruct
Dataset SupraThink-Dataset-500x (500 samples)
Generated by Qwen3 1.7B
Epochs 6
Type Supervised Fine-Tuning (SFT)
Precision bfloat16

🚀 Inference

import os, warnings os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3" warnings.filterwarnings("ignore", category=UserWarning, module="transformers") import torch from transformers import pipeline, AutoTokenizer, logging logging.set_verbosity_error() MODEL_ID = "SupraLabs/Supra-50M-Reasoning" tokenizer = AutoTokenizer.from_pretrained(MODEL_ID, clean_up_tokenization_spaces=False) pipe = pipeline( "text-generation", model=MODEL_ID, tokenizer=tokenizer, device_map="auto", torch_dtype=torch.bfloat16 if torch.cuda.is_available() else torch.float32 ) def build_prompt(instruction, input_text=""): if input_text.strip(): return ( "Below is an instruction that describes a task, paired with an input " "that provides further context. Write a response that appropriately " "completes the request.\n\n" f"### Instruction:\n{instruction}\n\n" f"### Input:\n{input_text}\n\n### Response:\n" ) return ( "Below is an instruction that describes a task. Write a response that " "appropriately completes the request.\n\n" f"### Instruction:\n{instruction}\n\n### Response:\n" ) def generate(instruction, input_text=""): result = pipe( build_prompt(instruction, input_text), max_new_tokens=512, do_sample=True, temperature=0.3, top_k=50, top_p=0.9, repetition_penalty=1.15, pad_token_id=pipe.tokenizer.pad_token_id, eos_token_id=pipe.tokenizer.eos_token_id, return_full_text=False ) return result[0]['generated_text'].strip() while True: print("\nEnter an instruction (or 'exit' to quit):") user_input = input().strip() if user_input.lower() == "exit": break print("\nEnter additional context (optional, press Enter to skip):") context_input = input().strip() print(f"\nResponse:\n{generate(user_input, context_input)}\n") 

💬 Sample Outputs

Prompt: "What is AI?"

🧠 Thinking

Okay, the user is asking about AI. Let me start by recalling what AI is. AI is a subset of machine learning, specifically neural networks. It's a branch of computer science that focuses on enabling machines to learn from data. The main goal is to create intelligent machines that can perform tasks that would normally require human intelligence. I need to make sure I'm being clear. AI is a broad field with many different applications. For example, AI-powered virtual assistants like Siri or Alexa. Also, AI is used in healthcare, finance and even in the field of robotics. I should mention the key concepts, such as data, algorithms and the potential applications. I should also mention the potential challenges, such as bias and lack of human intervention. Be concise, under 150 words. 

✅ Response

AI is a subset of machine learning that focuses on enabling machines to learn from data. It involves training machines to recognize patterns and make decisions based on data. AI is used in healthcare, finance and even in the field of robotics. Its applications include virtual assistants like Siri or Alexa, and its potential for automation and training. 

Prompt: "What is the meaning of life?"

🧠 Thinking

Okay, the user is asking about the meaning of life. Let me start by recalling what life is. It's about the human experience, like the choices we make, the relationships we have with others and the choices we make. I need to make sure I cover the main points: the human experience, the human mind, the role of our emotions and the experiences we have. Keep concise, under 150 words. 

✅ Response

Life is about the human experience, the human mind and the experiences we have. The human mind is a complex interplay of emotions, thoughts and experiences, shaped by the experiences we have. Final answer: Life is about the human experience, the human mind and the experiences we have. 

⚠️ THE MODEL STILL HALLUCINATES AND ANSWERS WILL SOMETIMES BE INCORRECT ⚠️

First reasoning model from SupraLabs. It's experimental and chaotic, but the thinking chains are real. Feedback welcome!

submitted by /u/Dangerous_Try3619
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA