r/MachineLearning

17 articles archived · Visit source ↗ · RSS

r/MachineLearning community 2h ago

What kinds of models are people training with document data? [P]

We've helped some folks with synthetic data for a number of different projects and some of them for "document data". Like annotated PDFs, PNGs. Tax forms, health forms. Especially things with PII that are hard to get because of obvious privacy concerns. So, we came up with an…

30
r/MachineLearning community 2h ago

Have the "on-hold" durations been getting longer for arXiv submissions? [D]

I have a paper that has been "on-hold" for about 2 weeks now. I understand that it might take a little longer now because of inundation of AI generated low-effort papers but my papers have gone from "on-hold" to "submitted" within a couple of days in the past. Wondering if…

13
r/MachineLearning community 2h ago

Image generation models running locally on limited resources [P]

I have a project consisting of generating high quality free ebook covers out of its content. On my 16GB of ram machine with no gpu, i have tested the opensourced stable diffusion models without any success. All return bad quality covers with blurred faces and scenes that do not…

6
r/MachineLearning community 4h ago

EEML Summer School (Eastern European ML) - Anyone here got accepted? [D]

Has anyone got into EEML Summer School in Montenegro? I did and please feel free to DM to manage stay or other plans after the summer school. I see that it's tricky to get there and find a stay.   submitted by   /u/ade17_in [link]   [comments]

29
r/MachineLearning community 6h ago

Best examples of ML projects with good dataset/task code abstractions? [D]

I am working on a benchmark and need to manage several interlocking components: datasets and metadata, diverse ML tasks (varying inputs and outputs), and baseline experiments covering models, training, and evaluations. Any pointers to projects that handle these through…

4
r/MachineLearning community 6h ago

Human-level performance via ML was *not* proven impossible with complexity theory [D]

Van Rooij, Guest, Adolfi, Kolokolova, and Rich claimed to have proven that AGI via ML is impossible in Computational Brain & Behavior in 2024. The basic idea was to try to reduce a known NP-hard problem to the problem of learning a human-level classifier from data. The purported…

17
r/MachineLearning community 6h ago

Built Support Vector Machine(SVM) from scratch in Rust [P]

Built my own SVM classifier from scratch in Rust. It uses SMO optimization, have linear and rbf kernel, uses grid search to tune the hyperparameters. I tested it on two datasets one using Linear dataset and other using RBF, these were the results: Dataset Kernel Accuracy Recall…

10
r/MachineLearning community 7h ago

ML for UFC predictions: logistic regression vs random forest? [P]

Hello everyone, I am pretty new to anything ML related so bear with me. I’ve been working on a UFC fight prediction project in Python using pandas + scikit-learn. Right now I’m using logistic regression since the output is binary (fighter A wins or fighter B wins). I’m currently…

37
r/MachineLearning community 9h ago

Training a number-aware embedding model + Text JEPA doesn't work too well + Text auto-encoders have a strange frequency bias [R][P]

Hi guys! I've spent 1y trying to predict company growth from the full text of their 10-k filings. It completely failed. But I've had a lot of fun playing with encoder transformers and making them good at numbers (bypassing the tokenizer/prediction head for numbers). I've…

22
r/MachineLearning community 9h ago

Elastic Attention Cores for Scalable Vision Transformers [R]

Wanted to share our latest paper on an alternative building block for Vision Transformers. Illustration of our model's accuracy and dense features Traditional ViTs utilize dense ( N 2 ) self-attention, which can become pretty costly at higher resolutions. In this work, we…

35
r/MachineLearning community 10h ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually [R]

Large language models (LLMs) are trained for downstream tasks by updating their parameters (e.g., via RL). However, updating parameters forces them to absorb task-specific information, which can result in catastrophic forgetting and loss of plasticity. In contrast, in-context…

11
r/MachineLearning community 13h ago

Sharing all KGC 2026 decks. More production-grade KG systems than I've seen at any conference. [D]

Didn't make it to New York for the Knowledge Graph Conference this year, but caught some talks virtually and managed to download all the decks. Sharing them below because some of what was shown is worth knowing about. Majority of the presentations described live production…

6
r/MachineLearning community 21h ago

How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D]

Hello everyone, Presenting at a top-tier conference for the first time and having a very hard time coming up with an appropriate design for my poster. Everything I do seems basic and banal. My paper is more theory-oriented, and apart from putting math formulas in bold in the…

5
r/MachineLearning community 22h ago

I created a minimal one-file implementations (160loc) of JEPA family (ijepa, vjepa, vjepa2, cjepa) for educational purposes [P]

Hi all, I made my own minimal implementation of JEPA algorithms. Making things minimal and removing all the things needed for scaling the algorithm always helped me understand the essence. So I stripped everything but the algorithm parts. What's left is 160-200 lines of code…

26
r/MachineLearning community 1d ago

Steam Recommender using similarity! (Undergraduate Student Project) [P]

(DISCLAIMER: I accidentally deleted the last post on this subreddit my apologies if this is your second time seeing it) Last year I made a post about my steam recommender The last one was great and served its purpose of showing many people new games, But this new version is much…

15
r/MachineLearning community 1d ago

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

TabPFN-3 was released today, the next iteration of the tabular foundation model, originally published in Nature. Quick recap for anyone new to TabPFN: TabPFN predicts on tabular data in a single forward pass - no training, no hyperparameter search, no tuning. Built on TabPFN-2.5…

31
r/MachineLearning community 1d ago

I Found a Hidden Ratio in Transformers That Predicts Geometric Stability [R]

I have analyzed some decoder transformer models using Lyapunov spectral analysis and found that the ratio of the MLP and attention spectral norms strongly indicates whether a model will eventually collapse to rank-1 or not by the final layers. I found that the spectral ratio is…

36

What kinds of models are people training with document data? [P]

Have the "on-hold" durations been getting longer for arXiv submissions? [D]

Image generation models running locally on limited resources [P]

EEML Summer School (Eastern European ML) - Anyone here got accepted? [D]

Best examples of ML projects with good dataset/task code abstractions? [D]

Human-level performance via ML was *not* proven impossible with complexity theory [D]

Built Support Vector Machine(SVM) from scratch in Rust [P]

ML for UFC predictions: logistic regression vs random forest? [P]

Training a number-aware embedding model + Text JEPA doesn't work too well + Text auto-encoders have a strange frequency bias [R][P]

Elastic Attention Cores for Scalable Vision Transformers [R]

Learning, Fast and Slow: Towards LLMs That Adapt Continually [R]

Sharing all KGC 2026 decks. More production-grade KG systems than I've seen at any conference. [D]

How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D]

I created a minimal one-file implementations (160loc) of JEPA family (ijepa, vjepa, vjepa2, cjepa) for educational purposes [P]

Steam Recommender using similarity! (Undergraduate Student Project) [P]

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

I Found a Hidden Ratio in Transformers That Predicts Geometric Stability [R]

Human-level performance via ML was not proven impossible with complexity theory [D]