r/MachineLearning
500 articles archived · Visit source ↗ · RSS
-
r/MachineLearning community 5d ago
Any ideas for unconventional ML projects? [D]
Hey everyone, I'm a stats student and I'm struggling to come up with a personal machine learning project. I just can't seem to find an idea that genuinely sparks my curiosity, and that's usually how I learn best. For example, back when I was learning SQL, I got so obsessed with…
6 -
r/MachineLearning community 5d ago
Xperience-10M Download Help [D]
Hi, I really really need access to Xperience-10M for a deadline which is very soon. https://huggingface.co/datasets/ropedia-ai/xperience-10m Unfortunately, it looks like the owners have stopped approving people to download the dataset. I filled out the form a few weeks ago but…
27 -
r/MachineLearning community 5d ago
High Dimensional, Dynamic Rotary Positional Embedding [P]
At the end of my last post , I presented an idea: what if I used the core of my last project, the cumulative matrix product, and repurposed it as a positional embedding? I just finished fleshing out the math behind HDD-RoPE and training a model with this positional embedding…
31 -
-
-
r/MachineLearning community 6d ago
Will I be desk rejected for this[R]
so I submitted a paper to a conference, and literally went one line on a 2 column submission so literally half a line over the page limit. im really paranoid that this will be a desk rejection.. has anyone ever had this happen before? will it be desk rejected?   submitted by…
18 -
r/MachineLearning community 6d ago
WACV supp. mat. video [R]
Hello, WACV conference submission deadline is by the end of this week, good luck everyone! Does anyone know what the expected format/duration of the video for the supp. mat. is? The guidelines only mention: The supplementary material can be either PDF or ZIP only (maximum…
14 -
r/MachineLearning community 6d ago
Found a potential mistake in an ICLR 2026 blogpost [D]
I think I found a mistake in an ICLR 2026 blog post. I created an issue and have been trying to contact the author and organizers, but I haven't received a response after several weeks. Could anyone please take a look and let me know your thoughts? (I'm just curious and would…
26 -
r/MachineLearning community 7d ago
Recommendations for speech annotation tools [D]
I'm looking for human-in-the-loop platforms that allow you to automatically transcribe audio followed by manually fixing the transcriptions and fine tuning the model. Is there a local (not an online service) installable platform for doing this?   submitted by  …
11 -
r/MachineLearning community 7d ago
About ML research collab group post [D]
Hi, I'm thinking of building a small community of 10-15 people where we can help each other to learn something new. The primary focus will be on ML research and open-source projects. If you're interested, DM me. knowledge of machine learning is a plus, as want to keep this a…
16 -
r/MachineLearning community 7d ago
Some new updates to Papers with Code [P]
Hi folks, Niels here from the open-source team at Hugging Face. I continue working on a revival of paperswithcode.co as we're back to the "age of research" per Ilya Sutskever! Hence, it's important to discover each other's research and build on each other's work, so we can…
38 -
r/MachineLearning community 8d ago
[ECCV 2026] Paper Decision Appeals Discussion [D]
With the release of meta-reviews, ECCV sent out a google form for dissatisfied authors to submit an appeal for the following reasons: Policy errors, e.g., reviewers or Area Chairs applied a policy that does not exist, or reviewers or Area Chairs applied policies that are not…
18 -
r/MachineLearning community 8d ago
An Update on Matrix Recurrent Units, an Attention Alternative [R]
I recently revisited my matrix recurrent units algorithm (the MRU), a novel linear-time sequence architecture I created as an alternative to attention. I explain it in depth at the repo , but the gist is the MRU works by transforming the embedding into an input state matrix,…
29 -
r/MachineLearning community 8d ago
Data-centric debugging for teams training neural nets [P]
We just did a big revamp of WeightsLab and wanted to share it here. If you’ve ever spent hours debugging a training run only to discover it was a data problem all along, this is for you. WeightsLab lets you pause training mid-run, inspect your live loss signals, and catch…
29 -
r/MachineLearning community 8d ago
EMA on LoRA ? [R]
Hi guys Does anyone know of papers where EMA on LoRA adapters has been used successfully? Im interested in cases where the EMA adapter acts as a self-teacher generating soft labels for the trainable adapter. On-policy self-distillation [1] uses ema for the teacher. However, they…
20 -
r/MachineLearning community 8d ago
A slightly improved DVD-JEPA demo [P]
Hey! I came across this post , which I found quite neat as a minimal demonstration of JEPA. However, as the comments pointed out, there was some room for improvement. So I added a few things such as environment noise and a fair* comparison to a pixel-space baseline. I think the…
19 -
r/MachineLearning community 8d ago
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
  submitted by   /u/NonGameCatharsis [link]   [comments]
29 -
-
r/MachineLearning community 9d ago
TSAuditor: A time-series auditing framework [P]
This happened a few months ago when I was working on an analysis project that dealt with time-series data. The dataset was large (10 years of data). I was using a standard profiling tool to check the pipeline. Everything looked fine because the tool reported 3% missing data rate…
29 -
-
-
-
r/MachineLearning community 9d ago
Time Series Modeling Needs a Dynamical Systems Perspective [R]
In our #ICML2026 position paper we argue a dynamical systems perspective is needed to drive time series (TS) modeling forward: https://arxiv.org/abs/2602.16864 Essentially all time series in nature and engineering come from some underlying dynamical system (DS), mostly chaotic…
31 -
r/MachineLearning community 9d ago
Built a Global AQ (PM2.5) Forecaster ML Model [P]
Hey everyone, I’ve been building an end-to-end Air Quality (PM2.5) forecasting pipeline for 4 countries (US, UK, India, Australia) using 1.6M+ rows of OpenAQ and NASA weather data. The problem i hit (the variance trap): My V7 model was a standard stateless Gradient Boosting…
23 -
r/MachineLearning community 10d ago
Best library for releasing my research optimization algorithm? [D]
Hi All! I have developed a research optimizer (QQN Quadratic Quasi-Newton) and published a paper on it where I am able to, but I would really like to make the algorithm itself easily available to the community for evaluation. I have a Rust, Java, and Javascript implementations,…
36 -
r/MachineLearning community 11d ago
Neuron Populations Exhibit Divergent Selectivity with Scale [R]
Hi! We just released a paper where we study “Rosetta Neurons”: universal neurons across different neural networks, and their relationship to scaling laws, specialization, and monosemanticity. Would love to kick off a discussion and get the community's thoughts. Main Findings: We…
11