r/MachineLearning
500 articles archived · Visit source ↗ · RSS
-
r/MachineLearning community 1mo ago
Are ICML workshops worth attending? [D]
Hi! I missed securing a main conference ticket for ICML 2026, as my workshop paper got accepted two days ago. Do you believe that it is worth attending just workshops at such A*-tier conferences (with all the overseas travel costs etc.)? I was quite looking forward to attending…
31 -
r/MachineLearning community 1mo ago
Using large language models [R]
Can LLMs be used to come up with a research topic that's worthwhile? Has anyone had good results in coming up with solid research ideas by chatting with an LLM? Maybe using Claude to review existing work and define the research topic. Thanks!   submitted by  …
24 -
r/MachineLearning community 1mo ago
Sponsio: Deterministic Contract Layer for LLM Agents [P]
We've been trying to put LangGraph agents into production for a while. The thing that kept biting us was tool-call boundary enforcement: stuff like "must call X before Y", "max N retries", "approval gate before destructive action". Worked fine in demos, broke at the moments that…
31 -
r/MachineLearning community 1mo ago
Please help with tensor dock [d]
Anyone have any idea what I should do. This is my email to tensor dock. I developed corporate GPU benchmarking software so I need a cloud PC that can benchmark 5090 Consumer cards and 4090 Consumer cards. It worked absolutely amazing for six hours yesterday on the 4090 full…
28 -
r/MachineLearning community 1mo ago
"AI solved one of math's greatest challenges, but it cannot add two numbers reliably?!" [D]
Suppose your friend, a mathematician, woke up from a 5-year coma. How would you explain this to him? Do we even have an explanation other than "it is what it is"?   submitted by   /u/we_are_mammals [link]   [comments]
26 -
-
r/MachineLearning community 1mo ago
PapersWithCode new features - week 1 [P]
Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…
23 -
r/MachineLearning community 1mo ago
Expedia ML Scientist II interview experience anyone ? [D]
I have an Initial Technical Screen interview (45 Mins) coming up for ML Scientist II: Agentic AI role, and wanted to know what to expect. Would really appreciate any info. Haven't found much information on this interview experience. Thanks!   submitted by  …
27 -
-
-
r/MachineLearning community 1mo ago
pipeline is really slow - consulting [D]
Hi, after a long debugging process and many discussions, I wanted to ask for advice from people who may have encountered similar training bottlenecks. My goal is imitation learning for robotics. Model / Pipeline Observation space: 4 RGB robot cameras image resolution: 128x128x3…
25 -
r/MachineLearning community 1mo ago
AgentLantern: exposing the hidden graph of AI agent projects [P]
AI agent frameworks make it easy to create agents, tasks, tools, and workflows. But as soon as a project grows beyond a few agents, the real execution graph becomes difficult to understand. The issue : agent projects often hide their structure across code, YAML files, tool…
7 -
r/MachineLearning community 1mo ago
Hebbian architecture AI model [R]
Hello , for some time now i have been hooked on a side project after work hours, these are the results for a Hebbian architecture AI model. The model does not use backpropagation or gradients, the substrate started as a 1000k neuron and scaled to 100k between versions. The…
31 -
r/MachineLearning community 1mo ago
Alignment: Higher order prioritizing over constraints [R]
So, I ran across a behavior that I found interesting and may lead to alignment or safety research. I'm going to try to maintain an abstract description of what happened without giving away the details and the keys to jailbreaking. The nature of a transformer is to predict the…
25 -
-
r/MachineLearning community 1mo ago
Tested chunking + embeddings data from 3 production websites. [P]
Tiered + page-role-aware RAG retrieval results across 3 corpora with very different content density: Workspace Sources Chunks HIGH MEDIUM LOW REJECTED Intercom 188 941 96 200 541 104 HubSpot 251 1705 40 508 1153 4 KPMG 53 209 3 14 127 65 (HIGH = avg operational score 0.84,…
19 -
r/MachineLearning community 1mo ago
LLMs are just giant probability machines pretending to think [P]
It’s fascinating that simple mathematics between tokens can eventually become a machine that writes essays, code, poetry, and even reasoning. We usually think probability means uncertainty. But LLMs show something strange: If probability + context + mathematical matching are…
36 -
r/MachineLearning community 1mo ago
Anonymous Data Upload for Submission [D]
How do you upload data anonymously for a submission (ACL/EMNLP)? I have several models I need to upload for replication and was thinking HuggingFace, but HF offers download tracking on a paid plan. Does this violate the policy since there is the potential of tracking the…
18 -
r/MachineLearning community 1mo ago
Could ML be used to automate C-suite organizational duties? [D]
We often see worry from workers that ML techniques will either fully replace them, or jostle them violently economically such that their earnings and well-being are impacted. Concurrently, many tech companies resist unionization/"guild" efforts to protect the careers of…
11 -
r/MachineLearning community 1mo ago
Custom image encoder [P]
Hello, I would like to know whether building my own image encoder would be a good idea instead of using models like CLIP, SigLIP/SigLIP2, or DINO. My use case is video frame classification. My pipeline is the following: the client sends me a video stream, sampled at 1 frame per…
5 -
r/MachineLearning community 1mo ago
COLM 2026 ReviewsDiscussion [D]
Didn't see one so wanted to make one myself. Reviews are actually already out, curious what everyone thinks about the quality of the reviews? I've heard it's a mixed bag and apparently a concerning amount of AI generated reviews for some people.   submitted by  …
33 -
-
r/MachineLearning community 1mo ago
Live Human Detector on Outbound Phone Calls [R]
Goal To save humans wasting time sitting in Call Centre queues waiting to be answered To have tool listen in on the audio stream of a live call, post IVR Navigation - to determine whether the call has transitioned out of the queue and to a live person. Requirements The tool must…
20 -
r/MachineLearning community 1mo ago
Novel Problems in VLA [R]
I'm currently doing a research internship and my supervisor is constantly pushing me to have a novel idea, I've read about 15-20 papers about VLA and I think that most of the things are saturated, I thought about an equivariant VLA based on equivariant CNN which was published in…
21 -
r/MachineLearning community 1mo ago
Lisbon Machine Learning School (LxMLS 2026) [D]
Hi did anyone apply it, or attended it previously? How was the experience? I got the acceptance but no scholarship, is it worth going self sponsored?   submitted by   /u/Icy-Solid-4159 [link]   [comments]
21 -
r/MachineLearning community 1mo ago
Does this idea sound fun? [R]
It's about inference-time learning by inserting some experts specialized for updating sibling expert weights in MoE. All the components needed were already there, but no one tried it inside MoE, so I did a small PoC. It kinda worked. I'd love to hear what you think.…
33 -
r/MachineLearning community 1mo ago
Columbia Machine Learning Summer School (MLSS) 2026 [D]
I got into this CFE MLSS 2026 and would like to connect with people who also got into it or have been in previous cohorts! I am organizing a group chat for people who got into the program :DD https://cfe.columbia.edu/content/mlss   submitted by   /u/elucidativemind…
24 -
r/MachineLearning community 1mo ago
High E2E latency on fine-tuned Gemma 4 26B despite low TTFT [R]
Recently fine-tuned a Gemma 4 26B model, and I’m seeing surprisingly high end-to-end latency despite the effective inference footprint being much smaller (~4B-ish behavior during serving). Current setup: Model: Gemma 4 26B (fine-tuned) Engine: vLLM Quantization: FP8 Hardware:…
27 -
r/MachineLearning community 1mo ago
OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound [D]
OpenAI posted a math result today claiming that one of its general-purpose reasoning models found a construction disproving the conjectured n^{1+O(1/log log n)} upper bound in Erdős’s planar unit-distance problem. Announcement:…
31 -
r/MachineLearning community 1mo ago
LLMs and Emojis [D]
LLMs are trained on human data, so where does the tendency to add emojis come from? For example, when some models generate code explanations or even normal responses, they often add lots of emojis that people don’t really use that way in real life. My current guess (without…
33 -
r/MachineLearning community 1mo ago
How competitive are PhD admissions currently [D]
Hi, how hard is it currently to get a PhD position in machine Learning? Like what are the requirements to get to a decent mid tier program (= they publish regularly at respected journals and their work gets read my some people)? How is it in different regions e.g US, Europe,…
10