r/MachineLearning
17 articles archived · Visit source ↗ · RSS
-
r/MachineLearning community 2h ago
What kinds of models are people training with document data? [P]
We've helped some folks with synthetic data for a number of different projects and some of them for "document data". Like annotated PDFs, PNGs. Tax forms, health forms. Especially things with PII that are hard to get because of obvious privacy concerns. So, we came up with an…
30 -
r/MachineLearning community 4h ago
EEML Summer School (Eastern European ML) - Anyone here got accepted? [D]
Has anyone got into EEML Summer School in Montenegro? I did and please feel free to DM to manage stay or other plans after the summer school. I see that it's tricky to get there and find a stay.   submitted by   /u/ade17_in [link]   [comments]
29 -
r/MachineLearning community 6h ago
Built Support Vector Machine(SVM) from scratch in Rust [P]
Built my own SVM classifier from scratch in Rust. It uses SMO optimization, have linear and rbf kernel, uses grid search to tune the hyperparameters. I tested it on two datasets one using Linear dataset and other using RBF, these were the results: Dataset Kernel Accuracy Recall…
10 -
r/MachineLearning community 9h ago
Elastic Attention Cores for Scalable Vision Transformers [R]
Wanted to share our latest paper on an alternative building block for Vision Transformers. Illustration of our model's accuracy and dense features Traditional ViTs utilize dense ( N 2 ) self-attention, which can become pretty costly at higher resolutions. In this work, we…
35 -
r/MachineLearning community 10h ago
Learning, Fast and Slow: Towards LLMs That Adapt Continually [R]
Large language models (LLMs) are trained for downstream tasks by updating their parameters (e.g., via RL). However, updating parameters forces them to absorb task-specific information, which can result in catastrophic forgetting and loss of plasticity. In contrast, in-context…
11 -