News / #hardware Tag Hardware 283 articles archived under #hardware · RSS Sign in to follow r/MachineLearning community 1mo ago OpenAI's deployment company move says more about the AI gap than any benchmark[D] OpenAI launched a deployment company with $4B initial investment, 19 partner organizations, and acquired Tomoro (UK-based AI consultancy, ~150 engineers). The pitch: embed "Forward Deployed Engineers" into enterprises to help them actually use AI. This is basically the Palantir… 35 arXiv — Machine Learning research 1mo ago scShapeBench: Discovering geometry from high dimensional scRNAseq data arXiv:2605.12662v1 Announce Type: new Abstract: High-dimensional point cloud data arise across many scientific domains, especially single-cell biology. The shapes or topologies of these datasets determine the types of information that can be extracted. For example, clustered… 32 The Information — AI news-outlet 1mo ago Fervo Raises Nearly $2 Billion in IPO Enhanced geothermal energy pioneer Fervo Energy raised nearly $2 billion in an initial public offering, giving it the financial firepower to grow aggressively and compete with natural gas to power AI data centers. Fervo is a leader in one of the hottest sectors in energy. The… 8 TechCrunch — AI news-outlet 1mo ago Musk’s xAI is running nearly 50 gas turbines unchecked at its Mississippi data center Gas turbines at xAI's Colossus 2 data center have drawn a lawsuit over the company's use of "mobile" gas turbines as power plants. 12 r/MachineLearning community 1mo ago Human-level performance via ML was *not* proven impossible with complexity theory [D] Van Rooij, Guest, Adolfi, Kolokolova, and Rich claimed to have proven that AGI via ML is impossible in Computational Brain & Behavior in 2024. The basic idea was to try to reduce a known NP-hard problem to the problem of learning a human-level classifier from data. The purported… 17 arXiv — Machine Learning research 1mo ago Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models arXiv:2605.11142v1 Announce Type: new Abstract: Graph representation learning has become a standard approach for analyzing networked data, with latent embeddings widely used for link prediction, community detection, and related tasks. Yet a basic design choice, the latent… 36 arXiv — Machine Learning research 1mo ago COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication arXiv:2605.11165v1 Announce Type: new Abstract: Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client… 36 r/MachineLearning community 1mo ago How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D] Hello everyone, Presenting at a top-tier conference for the first time and having a very hard time coming up with an appropriate design for my poster. Everything I do seems basic and banal. My paper is more theory-oriented, and apart from putting math formulas in bold in the… 5 Ars Technica — AI news-outlet 1mo ago The newest AI boom pitch: Host a mini data center at your home The plan aims to speed up AI compute deployment while compensating residents. 15 Ars Technica — AI news-outlet 1mo ago Data center guzzled 30 million gallons of water, and nobody noticed for months Can AI save us from the AI industry’s endless thirst for water? Outlook not so good. 4 NVIDIA Developer Blog official-blog 1mo ago Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. This design enables... 10 Simon Willison community 1mo ago Notes on the xAI/Anthropic data center deal There weren't a lot of big new announcements from Anthropic at yesterday's Code w/ Claude event, but the biggest by far was the deal they've struck with SpaceX/xAI to use "all of the capacity of their Colossus data center". As I mentioned in my live blog of the keynote , that's… 8 OpenAI news 1mo ago Unlocking large scale AI training networks with MRC (Multipath Reliable Connection) OpenAI introduces MRC (Multipath Reliable Connection), a new supercomputer networking protocol released via OCP to improve resilience and performance in large-scale AI training clusters. 25 Simon Willison community 1mo ago Quoting Andy Masley [...] Between 2000 and 2024, farmers sold in total a Colorado-sized chunk of land all on their own, 77 times all land on data center property in 2028, and grew more food than ever on what was left. None of this caused any problems for US food access. And then, in the middle of… 11 OpenAI news 2mo ago Building the compute infrastructure for the Intelligence Age OpenAI scales Stargate to build the compute infrastructure powering AGI, adding new data center capacity to meet growing AI demand. 20 Vercel — AI dev-tools 2mo ago 2026 Vercel AI Accelerator recap On April 16th, 39 teams took the stage to pitch investors at Demo Day. During the prior six weeks, founders worked shoulder-to-shoulder with the Vercel team, our partners, and industry leaders to shape their ideas into the next generation of AI applications. Six weeks with the… 25 MIT News — AI research 2mo ago A faster way to estimate AI power consumption The “EnergAIzer” method generates reliable results in seconds, enabling data center operators to efficiently allocate resources and reduce wasted energy. 18 NVIDIA Developer Blog official-blog 2mo ago Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20 AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools.... 31 NVIDIA Developer Blog official-blog 2mo ago Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these... 36 Dwarkesh Podcast news-outlet 2mo ago What I learned this week - Pretraining parallelisms, Can distillation be stopped, Mythos and the cybersecurity equilibrium, Pipeline RL, On why pretraining runs fails April 15, 2025 18 NVIDIA Developer Blog official-blog 2mo ago Running Large-Scale GPU Workloads on Kubernetes with Slurm Slurm is an open source cluster management and job scheduling system for Linux. It manages job scheduling for over 65% of TOP500 systems. Most organizations... 33 MIT News — AI research 2mo ago Sixteen new START.nano companies are developing hard-tech solutions with the support of MIT.nano Startup accelerator program grows to over 30 companies, almost half of them with MIT pedigrees. 10 MIT News — AI research 2mo ago Helping data centers deliver higher performance with less hardware Researchers developed a system that intelligently balances workloads to improve the efficiency of flash storage hardware in a data center. 33 NVIDIA Developer Blog official-blog 3mo ago CUDA Tile Programming Now Available for BASIC! Note: CUDA Tile Programming in BASIC is an April Fools’ joke, but it's also real and actually works, demonstrating the flexibility of CUDA. CUDA 13.1... 5 Vercel — AI dev-tools 3mo ago Unified reporting for all AI Gateway usage If you're shipping AI features, you already have usage data. The problem is that it's split across providers, keys, and dashboards, so it's hard to answer basic questions before the bill shows up. You've probably felt the drift into after-the-fact reconciliation. Provider… 31 Smol AI News news-outlet 3mo ago not much happened today **Cursor** launched **Composer 2**, a frontier-class coding model with major cost reductions and strong benchmark scores like **61.3 on CursorBench** and **73.7 on SWE-bench Multilingual**. The model was improved via a **first continued pretraining run** feeding into… 36 MIT News — AI research 3mo ago MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact Academia-industry relationship is an early-stage accelerator, supporting professional progress and research. 4 NVIDIA Developer Blog official-blog 3mo ago Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform NVIDIA Groq 3 LPX is a new rack-scale inference accelerator for the NVIDIA Vera Rubin platform, designed for the low-latency and large-context demands of... 20 Import AI news-outlet 3mo ago ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text Will AI cause a political interregnum 5 NVIDIA Developer Blog official-blog 3mo ago Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes Every AI cluster running on Kubernetes requires a full software stack that works together, from low-level driver and kernel settings to high-level operator and... 32 NVIDIA Developer Blog official-blog 4mo ago Accelerating Data Processing with NVIDIA Multi-Instance GPU and Locality Domains NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but... 30 Zed Editor dev-tools 26mo ago Text Manipulation Kung Fu for the Aspiring Black Belt Learn the basics of text manipulation in Zed via a series of guided exercises. 24 Lil'Log (Lilian Weng) research 103mo ago Object Detection for Dummies Part 3: R-CNN Family [Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regression and tricks sections for R-CNN.] In the series of “Object Detection for Dummies”, we started with basic… 6 Page 6 of 6 · 283 articles ← Newer