News / #hardware Tag Hardware 29 articles archived under #hardware · RSS Sign in to follow r/MachineLearning community 4h ago Human-level performance via ML was *not* proven impossible with complexity theory [D] Van Rooij, Guest, Adolfi, Kolokolova, and Rich claimed to have proven that AGI via ML is impossible in Computational Brain & Behavior in 2024. The basic idea was to try to reduce a known NP-hard problem to the problem of learning a human-level classifier from data. The purported… 17 arXiv — Machine Learning research 15h ago Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models arXiv:2605.11142v1 Announce Type: new Abstract: Graph representation learning has become a standard approach for analyzing networked data, with latent embeddings widely used for link prediction, community detection, and related tasks. Yet a basic design choice, the latent… 36 arXiv — Machine Learning research 15h ago COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication arXiv:2605.11165v1 Announce Type: new Abstract: Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client… 36 r/MachineLearning community 19h ago How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D] Hello everyone, Presenting at a top-tier conference for the first time and having a very hard time coming up with an appropriate design for my poster. Everything I do seems basic and banal. My paper is more theory-oriented, and apart from putting math formulas in bold in the… 5 Ars Technica — AI news-outlet 21h ago The newest AI boom pitch: Host a mini data center at your home The plan aims to speed up AI compute deployment while compensating residents. 15 Ars Technica — AI news-outlet 1d ago Data center guzzled 30 million gallons of water, and nobody noticed for months Can AI save us from the AI industry’s endless thirst for water? Outlook not so good. 4 NVIDIA Developer Blog official-blog 5d ago Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. This design enables... 10 Simon Willison community 6d ago Notes on the xAI/Anthropic data center deal There weren't a lot of big new announcements from Anthropic at yesterday's Code w/ Claude event, but the biggest by far was the deal they've struck with SpaceX/xAI to use "all of the capacity of their Colossus data center". As I mentioned in my live blog of the keynote , that's… 8 OpenAI news 8d ago Unlocking large scale AI training networks with MRC (Multipath Reliable Connection) OpenAI introduces MRC (Multipath Reliable Connection), a new supercomputer networking protocol released via OCP to improve resilience and performance in large-scale AI training clusters. 25 Simon Willison community 8d ago Quoting Andy Masley [...] Between 2000 and 2024, farmers sold in total a Colorado-sized chunk of land all on their own, 77 times all land on data center property in 2028, and grew more food than ever on what was left. None of this caused any problems for US food access. And then, in the middle of… 11 OpenAI news 14d ago Building the compute infrastructure for the Intelligence Age OpenAI scales Stargate to build the compute infrastructure powering AGI, adding new data center capacity to meet growing AI demand. 20 Vercel — AI dev-tools 15d ago 2026 Vercel AI Accelerator recap On April 16th, 39 teams took the stage to pitch investors at Demo Day. During the prior six weeks, founders worked shoulder-to-shoulder with the Vercel team, our partners, and industry leaders to shape their ideas into the next generation of AI applications. Six weeks with the… 25 MIT News — AI research 16d ago A faster way to estimate AI power consumption The “EnergAIzer” method generates reliable results in seconds, enabling data center operators to efficiently allocate resources and reduce wasted energy. 18 NVIDIA Developer Blog official-blog 20d ago Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20 AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools.... 31 NVIDIA Developer Blog official-blog 22d ago Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these... 36 Dwarkesh Podcast news-outlet 28d ago What I learned this week - Pretraining parallelisms, Can distillation be stopped, Mythos and the cybersecurity equilibrium, Pipeline RL, On why pretraining runs fails April 15, 2025 18 NVIDIA Developer Blog official-blog 1mo ago Running Large-Scale GPU Workloads on Kubernetes with Slurm Slurm is an open source cluster management and job scheduling system for Linux. It manages job scheduling for over 65% of TOP500 systems. Most organizations... 33 MIT News — AI research 1mo ago Sixteen new START.nano companies are developing hard-tech solutions with the support of MIT.nano Startup accelerator program grows to over 30 companies, almost half of them with MIT pedigrees. 10 MIT News — AI research 1mo ago Helping data centers deliver higher performance with less hardware Researchers developed a system that intelligently balances workloads to improve the efficiency of flash storage hardware in a data center. 33 NVIDIA Developer Blog official-blog 1mo ago CUDA Tile Programming Now Available for BASIC! Note: CUDA Tile Programming in BASIC is an April Fools’ joke, but it's also real and actually works, demonstrating the flexibility of CUDA. CUDA 13.1... 5 Vercel — AI dev-tools 1mo ago Unified reporting for all AI Gateway usage If you're shipping AI features, you already have usage data. The problem is that it's split across providers, keys, and dashboards, so it's hard to answer basic questions before the bill shows up. You've probably felt the drift into after-the-fact reconciliation. Provider… 31 Smol AI News news-outlet 1mo ago not much happened today **Cursor** launched **Composer 2**, a frontier-class coding model with major cost reductions and strong benchmark scores like **61.3 on CursorBench** and **73.7 on SWE-bench Multilingual**. The model was improved via a **first continued pretraining run** feeding into… 36 MIT News — AI research 1mo ago MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact Academia-industry relationship is an early-stage accelerator, supporting professional progress and research. 4 NVIDIA Developer Blog official-blog 1mo ago Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform NVIDIA Groq 3 LPX is a new rack-scale inference accelerator for the NVIDIA Vera Rubin platform, designed for the low-latency and large-context demands of... 20 Import AI news-outlet 1mo ago ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text Will AI cause a political interregnum 5 NVIDIA Developer Blog official-blog 2mo ago Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes Every AI cluster running on Kubernetes requires a full software stack that works together, from low-level driver and kernel settings to high-level operator and... 32 NVIDIA Developer Blog official-blog 2mo ago Accelerating Data Processing with NVIDIA Multi-Instance GPU and Locality Domains NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but... 30 Zed Editor dev-tools 24mo ago Text Manipulation Kung Fu for the Aspiring Black Belt Learn the basics of text manipulation in Zed via a series of guided exercises. 24 Lil'Log (Lilian Weng) research 101mo ago Object Detection for Dummies Part 3: R-CNN Family [Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regression and tricks sections for R-CNN.] In the series of “Object Detection for Dummies”, we started with basic… 6