Tag

Hardware

283 articles archived under #hardware · RSS

r/MachineLearning community 1mo ago

OpenAI's deployment company move says more about the AI gap than any benchmark[D]

OpenAI launched a deployment company with $4B initial investment, 19 partner organizations, and acquired Tomoro (UK-based AI consultancy, ~150 engineers). The pitch: embed "Forward Deployed Engineers" into enterprises to help them actually use AI. This is basically the Palantir…

35
arXiv — Machine Learning research 1mo ago

scShapeBench: Discovering geometry from high dimensional scRNAseq data

arXiv:2605.12662v1 Announce Type: new Abstract: High-dimensional point cloud data arise across many scientific domains, especially single-cell biology. The shapes or topologies of these datasets determine the types of information that can be extracted. For example, clustered…

32
The Information — AI news-outlet 1mo ago

Fervo Raises Nearly $2 Billion in IPO

Enhanced geothermal energy pioneer Fervo Energy raised nearly $2 billion in an initial public offering, giving it the financial firepower to grow aggressively and compete with natural gas to power AI data centers. Fervo is a leader in one of the hottest sectors in energy. The…

8
TechCrunch — AI news-outlet 1mo ago

Musk’s xAI is running nearly 50 gas turbines unchecked at its Mississippi data center

Gas turbines at xAI's Colossus 2 data center have drawn a lawsuit over the company's use of "mobile" gas turbines as power plants.

12
r/MachineLearning community 1mo ago

Human-level performance via ML was *not* proven impossible with complexity theory [D]

Van Rooij, Guest, Adolfi, Kolokolova, and Rich claimed to have proven that AGI via ML is impossible in Computational Brain & Behavior in 2024. The basic idea was to try to reduce a known NP-hard problem to the problem of learning a human-level classifier from data. The purported…

17
arXiv — Machine Learning research 1mo ago

Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models

arXiv:2605.11142v1 Announce Type: new Abstract: Graph representation learning has become a standard approach for analyzing networked data, with latent embeddings widely used for link prediction, community detection, and related tasks. Yet a basic design choice, the latent…

36
arXiv — Machine Learning research 1mo ago

COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

arXiv:2605.11165v1 Announce Type: new Abstract: Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client…

36
r/MachineLearning community 1mo ago

How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D]

Hello everyone, Presenting at a top-tier conference for the first time and having a very hard time coming up with an appropriate design for my poster. Everything I do seems basic and banal. My paper is more theory-oriented, and apart from putting math formulas in bold in the…

5
Ars Technica — AI news-outlet 1mo ago

The newest AI boom pitch: Host a mini data center at your home

The plan aims to speed up AI compute deployment while compensating residents.

15
Ars Technica — AI news-outlet 1mo ago

Data center guzzled 30 million gallons of water, and nobody noticed for months

Can AI save us from the AI industry’s endless thirst for water? Outlook not so good.

4
NVIDIA Developer Blog official-blog 1mo ago

Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling

NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. This design enables...

10
Simon Willison community 1mo ago

Notes on the xAI/Anthropic data center deal

There weren't a lot of big new announcements from Anthropic at yesterday's Code w/ Claude event, but the biggest by far was the deal they've struck with SpaceX/xAI to use "all of the capacity of their Colossus data center". As I mentioned in my live blog of the keynote , that's…

8
OpenAI news 1mo ago

Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)

OpenAI introduces MRC (Multipath Reliable Connection), a new supercomputer networking protocol released via OCP to improve resilience and performance in large-scale AI training clusters.

25
Simon Willison community 1mo ago

Quoting Andy Masley

[...] Between 2000 and 2024, farmers sold in total a Colorado-sized chunk of land all on their own, 77 times all land on data center property in 2028, and grew more food than ever on what was left. None of this caused any problems for US food access. And then, in the middle of…

11
OpenAI news 2mo ago

Building the compute infrastructure for the Intelligence Age

OpenAI scales Stargate to build the compute infrastructure powering AGI, adding new data center capacity to meet growing AI demand.

20
Vercel — AI dev-tools 2mo ago

2026 Vercel AI Accelerator recap

On April 16th, 39 teams took the stage to pitch investors at Demo Day. During the prior six weeks, founders worked shoulder-to-shoulder with the Vercel team, our partners, and industry leaders to shape their ideas into the next generation of AI applications. Six weeks with the…

25
MIT News — AI research 2mo ago

A faster way to estimate AI power consumption

The “EnergAIzer” method generates reliable results in seconds, enabling data center operators to efficiently allocate resources and reduce wasted energy.

18
NVIDIA Developer Blog official-blog 2mo ago

Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20

AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools....

31
NVIDIA Developer Blog official-blog 2mo ago

Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson

The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these...

36
Dwarkesh Podcast news-outlet 2mo ago

What I learned this week - Pretraining parallelisms, Can distillation be stopped, Mythos and the cybersecurity equilibrium, Pipeline RL, On why pretraining runs fails

April 15, 2025

18
NVIDIA Developer Blog official-blog 2mo ago

Running Large-Scale GPU Workloads on Kubernetes with Slurm

Slurm is an open source cluster management and job scheduling system for Linux. It manages job scheduling for over 65% of TOP500 systems. Most organizations...

33
MIT News — AI research 2mo ago

Sixteen new START.nano companies are developing hard-tech solutions with the support of MIT.nano

Startup accelerator program grows to over 30 companies, almost half of them with MIT pedigrees.

10
MIT News — AI research 2mo ago

Helping data centers deliver higher performance with less hardware

Researchers developed a system that intelligently balances workloads to improve the efficiency of flash storage hardware in a data center.

33
NVIDIA Developer Blog official-blog 3mo ago

CUDA Tile Programming Now Available for BASIC!

Note: CUDA Tile Programming in BASIC is an April Fools’ joke, but it's also real and actually works, demonstrating the flexibility of CUDA. CUDA 13.1...

5
Vercel — AI dev-tools 3mo ago

Unified reporting for all AI Gateway usage

If you're shipping AI features, you already have usage data. The problem is that it's split across providers, keys, and dashboards, so it's hard to answer basic questions before the bill shows up. You've probably felt the drift into after-the-fact reconciliation. Provider…

31
Smol AI News news-outlet 3mo ago

not much happened today

**Cursor** launched **Composer 2**, a frontier-class coding model with major cost reductions and strong benchmark scores like **61.3 on CursorBench** and **73.7 on SWE-bench Multilingual**. The model was improved via a **first continued pretraining run** feeding into…

36
MIT News — AI research 3mo ago

MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact

Academia-industry relationship is an early-stage accelerator, supporting professional progress and research.

4
NVIDIA Developer Blog official-blog 3mo ago

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform

NVIDIA Groq 3 LPX is a new rack-scale inference accelerator for the NVIDIA Vera Rubin platform, designed for the low-latency and large-context demands of...

20
Import AI news-outlet 3mo ago

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Will AI cause a political interregnum

5
NVIDIA Developer Blog official-blog 3mo ago

Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes

Every AI cluster running on Kubernetes requires a full software stack that works together, from low-level driver and kernel settings to high-level operator and...

32
NVIDIA Developer Blog official-blog 4mo ago

Accelerating Data Processing with NVIDIA Multi-Instance GPU and Locality Domains

NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but...

30
Zed Editor dev-tools 26mo ago

Text Manipulation Kung Fu for the Aspiring Black Belt

Learn the basics of text manipulation in Zed via a series of guided exercises.

24
Lil'Log (Lilian Weng) research 103mo ago

Object Detection for Dummies Part 3: R-CNN Family

[Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regression and tricks sections for R-CNN.] In the series of “Object Detection for Dummies”, we started with basic…

6

OpenAI's deployment company move says more about the AI gap than any benchmark[D]

scShapeBench: Discovering geometry from high dimensional scRNAseq data

Fervo Raises Nearly $2 Billion in IPO

Musk’s xAI is running nearly 50 gas turbines unchecked at its Mississippi data center

Human-level performance via ML was *not* proven impossible with complexity theory [D]

Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models

COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

How do you create memorable poster for top tier conferences ( ICML/ICLR/NEURips ect…) [D]

The newest AI boom pitch: Host a mini data center at your home

Data center guzzled 30 million gallons of water, and nobody noticed for months

Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling

Notes on the xAI/Anthropic data center deal

Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)

Quoting Andy Masley

Building the compute infrastructure for the Intelligence Age

2026 Vercel AI Accelerator recap

A faster way to estimate AI power consumption

Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20

Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson

What I learned this week - Pretraining parallelisms, Can distillation be stopped, Mythos and the cybersecurity equilibrium, Pipeline RL, On why pretraining runs fails

Running Large-Scale GPU Workloads on Kubernetes with Slurm

Sixteen new START.nano companies are developing hard-tech solutions with the support of MIT.nano

Helping data centers deliver higher performance with less hardware

CUDA Tile Programming Now Available for BASIC!

Unified reporting for all AI Gateway usage

not much happened today

MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes

Accelerating Data Processing with NVIDIA Multi-Instance GPU and Locality Domains

Text Manipulation Kung Fu for the Aspiring Black Belt

Object Detection for Dummies Part 3: R-CNN Family

Human-level performance via ML was not proven impossible with complexity theory [D]