r/MachineLearning

500 articles archived · Visit source ↗ · RSS

r/MachineLearning community 1mo ago

Are ICML workshops worth attending? [D]

Hi! I missed securing a main conference ticket for ICML 2026, as my workshop paper got accepted two days ago. Do you believe that it is worth attending just workshops at such A*-tier conferences (with all the overseas travel costs etc.)? I was quite looking forward to attending…

31
r/MachineLearning community 1mo ago

Using large language models [R]

Can LLMs be used to come up with a research topic that's worthwhile? Has anyone had good results in coming up with solid research ideas by chatting with an LLM? Maybe using Claude to review existing work and define the research topic. Thanks!   submitted by  …

24
r/MachineLearning community 1mo ago

Call for Papers - Workshop on Unlearning and Model Editing U&ME at ECCV 2026 [R]

I have been seeing a lot of really interesting work lately around unlearning, model editing, controllability, safety, etc. Feels like this space is moving very fast right now, and there are still so many open questions. This year I’m helping organize the U&ME workshop at ECCV…

27
r/MachineLearning community 1mo ago

If you use NVIDIA Isaac Sim for reinforcement learning, do you use Isaac Lab with it? Just want to get a sense of what the status quo is. [D]

The reason for this query is that I am in the process of shifting to Isaac Sim / Isaac Lab since that is what seems to be in use nowadays. However, Isaac Lab is proving to be somewhat difficult to handle. While it handles the logging, and the creation of multi-actor systems for…

5
r/MachineLearning community 1mo ago

Sponsio: Deterministic Contract Layer for LLM Agents [P]

We've been trying to put LangGraph agents into production for a while. The thing that kept biting us was tool-call boundary enforcement: stuff like "must call X before Y", "max N retries", "approval gate before destructive action". Worked fine in demos, broke at the moments that…

31
r/MachineLearning community 1mo ago

Please help with tensor dock [d]

Anyone have any idea what I should do. This is my email to tensor dock. I developed corporate GPU benchmarking software so I need a cloud PC that can benchmark 5090 Consumer cards and 4090 Consumer cards. It worked absolutely amazing for six hours yesterday on the 4090 full…

28
r/MachineLearning community 1mo ago

"AI solved one of math's greatest challenges, but it cannot add two numbers reliably?!" [D]

Suppose your friend, a mathematician, woke up from a 5-year coma. How would you explain this to him? Do we even have an explanation other than "it is what it is"?   submitted by   /u/we_are_mammals [link]   [comments]

26
r/MachineLearning community 1mo ago

MergeNB: An intuitive merge conflict resolver built for Jupyter notebooks in VS Code [P]

I used to work heavily with Jupyter Notebooks + git + VS Code in a collaborative research setting and found nbdime to be somewhat buggy/a hassle to work with in general. So, in typical side project fashion ( relevant xkcd ) I've been working on MergeNB quite a bit over the last…

31
r/MachineLearning community 1mo ago

How do ML practitioners select hyperparameters, architectures, etc for self-supervised representation learning when the loss is non-monotonic? [D]

Non-contrastive SSL methods like BYOL/JEPA/data2vec seem promising, but I have no idea what is being learned, or how well; it’s models all the way down. Maybe I’ve got supervised tasks for which I’d like to see transfer, and I can evaluate linear probe/KNN results during…

26
r/MachineLearning community 1mo ago

Thermocompute constant time inference [P]

I invented thermocompute! It makes machine learning super fast!   submitted by   /u/arcco96 [link]   [comments]

30
r/MachineLearning community 1mo ago

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

At our work we use CUDA in Rust since the company switched to it recently. Rust has pretty good Driver API bindings but it made me wonder why the hell we cant have something decent in Go without cgo. I mostly build ML tools in the last month and Go is my main language for pretty…

30
r/MachineLearning community 1mo ago

PapersWithCode new features - week 1 [P]

Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…

23
r/MachineLearning community 1mo ago

Expedia ML Scientist II interview experience anyone ? [D]

I have an Initial Technical Screen interview (45 Mins) coming up for ML Scientist II: Agentic AI role, and wanted to know what to expect. Would really appreciate any info. Haven't found much information on this interview experience. Thanks!   submitted by  …

27
r/MachineLearning community 1mo ago

Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA [D]

I benchmarked vision-capable LLMs (the "just attach the PDF and let the model read it" pattern) against OCR-based pipelines on 30 long, image-heavy PDFs from MMLongBench-Doc ( https://github.com/mayubo2333/MMLongBench-Doc ). There were 171 questions in total, using Claude Sonnet…

9
r/MachineLearning community 1mo ago

Per-pixel bounding-box regression + DBSCAN for handwritten word detection - visual walkthrough of WordDetectorNet [P]

Overview of WordDetectorNN architecture. Sharing a visual breakdown of WordDetectorNet, Harald Scheidl's handwritten-word detection model. I think the design choice at its core is unusual enough to be worth a closer look - and I haven't seen it written up in detail anywhere…

26
r/MachineLearning community 1mo ago

I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]

Tested three formats: chat demos, first-person statements ("I am C-3PO..."), and synthetic Wikipedia-style docs. Same model, same LoRA config, 500 examples each. First-person statements won on generalization, which I didn't expect. The synthetic doc model was the weirdest…

6
r/MachineLearning community 1mo ago

pipeline is really slow - consulting [D]

Hi, after a long debugging process and many discussions, I wanted to ask for advice from people who may have encountered similar training bottlenecks. My goal is imitation learning for robotics. Model / Pipeline Observation space: 4 RGB robot cameras image resolution: 128x128x3…

25
r/MachineLearning community 1mo ago

AgentLantern: exposing the hidden graph of AI agent projects [P]

AI agent frameworks make it easy to create agents, tasks, tools, and workflows. But as soon as a project grows beyond a few agents, the real execution graph becomes difficult to understand. The issue : agent projects often hide their structure across code, YAML files, tool…

7
r/MachineLearning community 1mo ago

Hebbian architecture AI model [R]

Hello , for some time now i have been hooked on a side project after work hours, these are the results for a Hebbian architecture AI model. The model does not use backpropagation or gradients, the substrate started as a 1000k neuron and scaled to 100k between versions. The…

31
r/MachineLearning community 1mo ago

Alignment: Higher order prioritizing over constraints [R]

So, I ran across a behavior that I found interesting and may lead to alignment or safety research. I'm going to try to maintain an abstract description of what happened without giving away the details and the keys to jailbreaking. The nature of a transformer is to predict the…

25
r/MachineLearning community 1mo ago

Is personalized AI memory actually a problem worth solving or am I just coping[D]

genuine question for this community every time i use claude or chatgpt i have to re-explain myself. and even their memory feature is shallow it remembers facts about me, not how i actually think. the idea i've been sitting on is different from just "memory across sessions." what…

8
r/MachineLearning community 1mo ago

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice , an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

6
r/MachineLearning community 1mo ago

I built a Mamba1 variant I call SM1 with d_state=1 that runs on Blackwell in pure PyTorch [P]

On windows mamba-ssm is not easily available and doesn't compile on sm_120. SM1 (Scalar Mamba1) replaces the entire selective scan with two native PyTorch ops: L = torch.cumprod(dA, dim=1) h = L * (h0.unsqueeze(1) + torch.cumsum(dBx / L.clamp(min=1e-6), dim=1)) y = h * C This is…

21
r/MachineLearning community 1mo ago

Tested chunking + embeddings data from 3 production websites. [P]

Tiered + page-role-aware RAG retrieval results across 3 corpora with very different content density: Workspace Sources Chunks HIGH MEDIUM LOW REJECTED Intercom 188 941 96 200 541 104 HubSpot 251 1705 40 508 1153 4 KPMG 53 209 3 14 127 65 (HIGH = avg operational score 0.84,…

19
r/MachineLearning community 1mo ago

LLMs are just giant probability machines pretending to think [P]

It’s fascinating that simple mathematics between tokens can eventually become a machine that writes essays, code, poetry, and even reasoning. We usually think probability means uncertainty. But LLMs show something strange: If probability + context + mathematical matching are…

36
r/MachineLearning community 1mo ago

Anthropic posted a profit while xAI burned $4.2B. The AI profitability numbers finally leaked.[D]

This week basically forced everyone to stop guessing about AI margins. Three major financial reality checks hit at once: OpenAI confidentially filing their S-1, xAI’s Q1 numbers leaking via SpaceX, and Anthropic somehow posting an actual operating profit. If you are building an…

4
r/MachineLearning community 1mo ago

LQS v3.1 — an open methodology for rating AI training data (multi-oracle consensus + signed certificates) [P]

Solo author here. I spent the last six months building (and then sunsetting) a marketplace for AI training data. The marketplace failed for an interesting reason: the actual bottleneck isn't supply. There's tons of data. The bottleneck is that buyers can't independently evaluate…

14
r/MachineLearning community 1mo ago

Anonymous Data Upload for Submission [D]

How do you upload data anonymously for a submission (ACL/EMNLP)? I have several models I need to upload for replication and was thinking HuggingFace, but HF offers download tracking on a paid plan. Does this violate the policy since there is the potential of tracking the…

18
r/MachineLearning community 1mo ago

Looking for arXiv endorsement + sharing a preprint on homeostatic cognitive architecture for AI companions [R]

Hey r/ML — I just posted a preprint on SSRN for PHI // DRIFT, a cognitive architecture that gives an AI companion persistent internal state, salience-weighted memory retrieval, and a falsifiable continuity metric (PEDI). Ablation testing confirmed the DMU memory system injects…

13
r/MachineLearning community 1mo ago

Could ML be used to automate C-suite organizational duties? [D]

We often see worry from workers that ML techniques will either fully replace them, or jostle them violently economically such that their earnings and well-being are impacted. Concurrently, many tech companies resist unionization/"guild" efforts to protect the careers of…

11
r/MachineLearning community 1mo ago

Custom image encoder [P]

Hello, I would like to know whether building my own image encoder would be a good idea instead of using models like CLIP, SigLIP/SigLIP2, or DINO. My use case is video frame classification. My pipeline is the following: the client sends me a video stream, sampled at 1 frame per…

5
r/MachineLearning community 1mo ago

COLM 2026 ReviewsDiscussion [D]

Didn't see one so wanted to make one myself. Reviews are actually already out, curious what everyone thinks about the quality of the reviews? I've heard it's a mixed bag and apparently a concerning amount of AI generated reviews for some people.   submitted by  …

33
r/MachineLearning community 1mo ago

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Disclaimer: I work for Numind, the company behind this open-weight model We just released a 4B model based on Qwen3.5-4B, under Apache-2.0 license. The goal is to make information extraction from complex documents more practical with an open model: PDFs, screenshots, forms,…

12
r/MachineLearning community 1mo ago

One thing that's been bothering me lately: benchmark performance often tells me almost nothing about whether a workflow will survive production usage.[D]

I've seen systems score well internally and then immediately fail under: ambiguous user intent messy real-world context contradictory instructions long-running sessions Feels like evaluation still heavily rewards clean-task optimization instead of behavioral robustness. What are…

26
r/MachineLearning community 1mo ago

Live Human Detector on Outbound Phone Calls [R]

Goal To save humans wasting time sitting in Call Centre queues waiting to be answered To have tool listen in on the audio stream of a live call, post IVR Navigation - to determine whether the call has transitioned out of the queue and to a live person. Requirements The tool must…

20
r/MachineLearning community 1mo ago

Novel Problems in VLA [R]

I'm currently doing a research internship and my supervisor is constantly pushing me to have a novel idea, I've read about 15-20 papers about VLA and I think that most of the things are saturated, I thought about an equivariant VLA based on equivariant CNN which was published in…

21
r/MachineLearning community 1mo ago

Can liveness detection models generalise to synthetic media generation techniques they were never trained on? [D]

Most liveness detection systems in production today were built around a threat model where the attacker is submitting a static image or a basic replay video. The generation quality of current synthetic media is categorically different from what those training datasets captured.…

32
r/MachineLearning community 1mo ago

using .npy dataset with 3D models [R]

Hello guys , i am trying to work on ADNI dataset to get 90% accuracy , but it keeps getting stuck at 55%. any tip to improve results ?   submitted by   /u/LahmeriMohamed [link]   [comments]

4
r/MachineLearning community 1mo ago

Lisbon Machine Learning School (LxMLS 2026) [D]

Hi did anyone apply it, or attended it previously? How was the experience? I got the acceptance but no scholarship, is it worth going self sponsored?   submitted by   /u/Icy-Solid-4159 [link]   [comments]

21
r/MachineLearning community 1mo ago

I created an LLM post-training method called RPS. Preliminary results show that it improved Qwen3-8b's program synthesis reliability. [R]

RPS is inspired by neuroscience. As humans, we learn basic skills as kids with high neuro-plasticity. We then learn advanced skills as teens and adults with low neuro-plasticity. RPS trains a model in 2 stages. In stage 1, the model is trained on easy data with high learning…

26
r/MachineLearning community 1mo ago

Does this idea sound fun? [R]

It's about inference-time learning by inserting some experts specialized for updating sibling expert weights in MoE. All the components needed were already there, but no one tried it inside MoE, so I did a small PoC. It kinda worked. I'd love to hear what you think.…

33
r/MachineLearning community 1mo ago

Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]

The research community has provided (already for some time) seemingly more efficient and effective tokenizations for vision. Do we have any hint on whether non-fixed-patches tokenization is being applied on the big player models? I imagine not, and I'm trying to think why: -…

7
r/MachineLearning community 1mo ago

Looking for real world comparisons between WALL OSS pi0.6 and OpenVLA[D]

I am choosing a baseline for a real manipulation stack and trying not to lose a month on setup that someone here has already done. Shortlist is OpenVLA, pi0.6, and WALL OSS from X Square Robot. OpenVLA is still the easiest reference point with lots of reproductions. pi0.6 looks…

21
r/MachineLearning community 1mo ago

Columbia Machine Learning Summer School (MLSS) 2026 [D]

I got into this CFE MLSS 2026 and would like to connect with people who also got into it or have been in previous cohorts! I am organizing a group chat for people who got into the program :DD https://cfe.columbia.edu/content/mlss   submitted by   /u/elucidativemind…

24
r/MachineLearning community 1mo ago

High E2E latency on fine-tuned Gemma 4 26B despite low TTFT [R]

Recently fine-tuned a Gemma 4 26B model, and I’m seeing surprisingly high end-to-end latency despite the effective inference footprint being much smaller (~4B-ish behavior during serving). Current setup: Model: Gemma 4 26B (fine-tuned) Engine: vLLM Quantization: FP8 Hardware:…

27
r/MachineLearning community 1mo ago

Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R]

Autoregressive LLM world models factorize next-state generation left-to-right, preventing them from conditioning on globally interdependent anchors (tool schemas, trailing status fields, expected outcomes) and yielding prefix-consistent but globally incoherent rollouts. MDLMs'…

28
r/MachineLearning community 1mo ago

l9gpu - open-source GPU observability with workload-level attribution [P]

GPU monitoring tools like DCGM give you hardware-level metrics but no workload context. When a node is saturated, you can't tell which experiment, team, or job is responsible without digging through logs. We built l9gpu to close that gap. It's a node-level agent that exports GPU…

25
r/MachineLearning community 1mo ago

OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound [D]

OpenAI posted a math result today claiming that one of its general-purpose reasoning models found a construction disproving the conjectured n^{1+O(1/log log n)} upper bound in Erdős’s planar unit-distance problem. Announcement:…

31
r/MachineLearning community 1mo ago

LLMs and Emojis [D]

LLMs are trained on human data, so where does the tendency to add emojis come from? For example, when some models generate code explanations or even normal responses, they often add lots of emojis that people don’t really use that way in real life. My current guess (without…

33
r/MachineLearning community 1mo ago

How competitive are PhD admissions currently [D]

Hi, how hard is it currently to get a PhD position in machine Learning? Like what are the requirements to get to a decent mid tier program (= they publish regularly at respected journals and their work gets read my some people)? How is it in different regions e.g US, Europe,…

10

Are ICML workshops worth attending? [D]

Using large language models [R]

Call for Papers - Workshop on Unlearning and Model Editing U&ME at ECCV 2026 [R]

If you use NVIDIA Isaac Sim for reinforcement learning, do you use Isaac Lab with it? Just want to get a sense of what the status quo is. [D]

Sponsio: Deterministic Contract Layer for LLM Agents [P]

Please help with tensor dock [d]

"AI solved one of math's greatest challenges, but it cannot add two numbers reliably?!" [D]

MergeNB: An intuitive merge conflict resolver built for Jupyter notebooks in VS Code [P]

How do ML practitioners select hyperparameters, architectures, etc for self-supervised representation learning when the loss is non-monotonic? [D]

Thermocompute constant time inference [P]

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

PapersWithCode new features - week 1 [P]

Expedia ML Scientist II interview experience anyone ? [D]

Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA [D]

Per-pixel bounding-box regression + DBSCAN for handwritten word detection - visual walkthrough of WordDetectorNet [P]

I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]

pipeline is really slow - consulting [D]

AgentLantern: exposing the hidden graph of AI agent projects [P]

Hebbian architecture AI model [R]

Alignment: Higher order prioritizing over constraints [R]

Is personalized AI memory actually a problem worth solving or am I just coping[D]

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

I built a Mamba1 variant I call SM1 with d_state=1 that runs on Blackwell in pure PyTorch [P]

Tested chunking + embeddings data from 3 production websites. [P]

LLMs are just giant probability machines pretending to think [P]

Anthropic posted a profit while xAI burned $4.2B. The AI profitability numbers finally leaked.[D]

LQS v3.1 — an open methodology for rating AI training data (multi-oracle consensus + signed certificates) [P]

Anonymous Data Upload for Submission [D]

Looking for arXiv endorsement + sharing a preprint on homeostatic cognitive architecture for AI companions [R]

Could ML be used to automate C-suite organizational duties? [D]

Custom image encoder [P]

COLM 2026 ReviewsDiscussion [D]

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

One thing that's been bothering me lately: benchmark performance often tells me almost nothing about whether a workflow will survive production usage.[D]

Live Human Detector on Outbound Phone Calls [R]

Novel Problems in VLA [R]

Can liveness detection models generalise to synthetic media generation techniques they were never trained on? [D]

using .npy dataset with 3D models [R]

Lisbon Machine Learning School (LxMLS 2026) [D]

I created an LLM post-training method called RPS. Preliminary results show that it improved Qwen3-8b's program synthesis reliability. [R]

Does this idea sound fun? [R]

Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]

Looking for real world comparisons between WALL OSS pi0.6 and OpenVLA[D]

Columbia Machine Learning Summer School (MLSS) 2026 [D]

High E2E latency on fine-tuned Gemma 4 26B despite low TTFT [R]

Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R]

l9gpu - open-source GPU observability with workload-level attribution [P]

OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound [D]

LLMs and Emojis [D]

How competitive are PhD admissions currently [D]