r/MachineLearning

500 articles archived · Visit source ↗ · RSS

r/MachineLearning community 1mo ago

What’s the actual focus in World Models right now? [R]

Hey everyone, I'm trying to get back into the loop on world models. The last time I followed SSL closely, the buzz was all about Barlow Twins and DINO, but now everything just looks like scaled-up video generation from big industry labs. What is the actual academic research…

36
r/MachineLearning community 1mo ago

UAI Results are out [R]

You can’t see AC comments yet, but you can see the Accept/Reject consoles. My paper (with scores of 8,6,3) got rejected.   submitted by   /u/GeeseChen [link]   [comments]

6
r/MachineLearning community 1mo ago

Workshop on Unlearning and Model Editing U&ME at ECCV 2026 [R]

  submitted by   /u/Mushroom-Severe [link]   [comments]

21
r/MachineLearning community 1mo ago

Arabic ASR model struggling to converge during training [D]

i'm trying to train an ASR model using the LibriSpeech recipe from SpeechBrain (without the language model) on a 100-hour dataset of dialectal Arabic speech. the model architecture uses a Conformer-small encoder and a Transformer decoder, with a total of around 13M parameters.…

23
r/MachineLearning community 1mo ago

What type of models are the most used by you?? [R]

XGBoost, CatBoost, LightGBM, linearRegression, treeClassifier, randomForest, SVM, KNN? Or another one that I didn't put   submitted by   /u/Particular_Dog3811 [link]   [comments]

22
r/MachineLearning community 1mo ago

I built a tool to browse and plan CVPR workshop/tutorial days [P]

Hi everyone, as someone attending CVPR, one thing that always frustrated me was managing the workshop and tutorial days. The information is technically all there, but in practice it is scattered across dozens of workshop websites, PDFs, schedules, and program pages. I often…

22
r/MachineLearning community 1mo ago

Built an AI Accelerator and opensourced it. [P]

There is a huge gap in open source AI accelerators, so I implemented mine . Popular and well known ones are already legacy and doesn't support contemporary operations like Attention. Here is what makes mine special: Attention mechanism smelted directly into silicon Prototyped…

25
r/MachineLearning community 1mo ago

When are ICML openreviews made public? [R]

First time, so no idea.   submitted by   /u/camelCasedUser [link]   [comments]

28
r/MachineLearning community 1mo ago

How would you model this "strand" clustering problem? [P]

https://preview.redd.it/llqlupnwng4h1.png?width=2188&format=png&auto=webp&s=7fae5860babaffa1c8bfdcb1468b374eb38ac55d I'm currently building a computer vision application. I've managed to successfully train a YOLO model to detect the object I'm interested in for my videos. The…

33
r/MachineLearning community 1mo ago

I built mlx-Chronos — a community benchmark leaderboard for local LLM engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama) [P]

Hey! I'm a CS student and I got tired of not being able to compare MLX inference engines properly — every benchmark out there is either made by the engine's own developers, runs on an M3 Ultra nobody has, or just shows tok/s with zero context. So I built mlx-Chronos — a small…

11
r/MachineLearning community 1mo ago

[D] Monthly Who's Hiring and Who wants to be Hired?

For Job Postings please use this template Hiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for] For Those looking for jobs please use this template Want to be Hired: [Location], Salary…

5
r/MachineLearning community 1mo ago

Bayesian Opt. GPs vs Linear models and Neural Networks for parameter optimizations [R]

Hi, Relatively new to deep learning. I wanted some opinions on which of these approaches might be best for time series data and spectral analysis. I currently use a GP and it works pretty well, but I’m wondering what the computational tradeoffs and so forth might be. Any ideas?…

4
r/MachineLearning community 1mo ago

Workshop submission for main conference paper under review [D]

I have an ECCV paper main conf. Can I submit the same to a workshop at some other place happening before ECCV? The other workshop (non archival) will be after the final decisions of eccv come. Under any result in eccv- acceptance or rejection, how will this affect it? Im not the…

5
r/MachineLearning community 1mo ago

How to fine-tune an LLM for open-ended problems? [P]

I want to develop an LLM that can solve open-ended math problems (such as proof-only problems). This means that RLVR where we use the final answer alone as reward signal is not enough. Since SFT is useless here and GRPO/PPO methods will not have an appropriate reward function,…

34
r/MachineLearning community 1mo ago

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Ps. Not pitching anything; Just trying to understand where reality differs from the narrative. We're a couple of ML students, mostly worked on ML/software before, but over the last few months we've been playing with VLAs, robot datasets, and trying to understand where the field…

27
r/MachineLearning community 1mo ago

Query about non-archival workshop at CVPR-2026 [R]

My paper was recently accepted to a workshop at CVPR-2026 as non-archival acceptance. Is it mandatory for me to register to the conference as I won't be able to attend(visa issues), but my friend will be there in the conference and can present on my behalf. I have few questions…

9
r/MachineLearning community 1mo ago

Why do the output layer weights become word vectors in Word2Vec? [D]

I'm trying to understand the intuition behind Word2Vec training using a neural network. In Word2Vec (CBOW or Skip-gram), we often hear that the weight matrices learned during training contain the vector representations (embeddings) of words. However, I don't understand why the…

31
r/MachineLearning community 1mo ago

What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D]

Hey r/ML , I spent the last few months building a tool that hooks into PyTorch training loops to automatically detect and localize failures (vanishing gradients, exploding gradients, data anomalies). Along the way, I learned some things about training failure diagnosis that…

27
r/MachineLearning community 1mo ago

Requesting reduction in reviewer load for NeuRIPS? [D]

I didn't submit any but did place bids on some papers. I got assigned four papers. I have a bit of travel coming up and I don't think I will be able to do justice to as many the papers, especially in the rebuttal period. Is this the standard reviewing load? In other communities…

25
r/MachineLearning community 1mo ago

Event like spiking neuron lib that fits into the CPU cache [P]

I benchmarked it against PyTorch with a Wikipedia dataset. I heavily used Gemini Flash 3.5 to build out my vision https://huggingface.co/etoxin/neuronguard-wikipedia-classifier   submitted by   /u/Logical_Prompt_3543 [link]   [comments]

21
r/MachineLearning community 1mo ago

Graduating Without a PhD Internship [D]

In early 2022, I was deciding between PhD offers. The deal maker was a prospective supervisor telling me that through their connections with big tech, I would be able to do a PhD internship each summer, which was one of my main goals for the PhD. During my first and second…

37
r/MachineLearning community 1mo ago

How long does it realistically take for you to produce an ICML/NeurIPS/ICLR-level paper? [D]

Hey everyone, Since there are many researchers here who regularly publish at top-tier ML conferences like ICML, NeurIPS, and ICLR, I wanted to ask about realistic paper timelines. In your lab or research setting, how long does it usually take to develop a paper from the initial…

12
r/MachineLearning community 1mo ago

Does anyone have a copy of the ICDAR2013 Chinese Handwriting Competition Dataset? [R]

I understand that this is a little unorthodox, but I'm desperately trying to download a copy of the ICDAR2013 Chinese Handwriting Recognition Competition Dataset. Unfortunately, the linked page in the Conference Archive: https://nlpr.ia.ac.cn/databases/handwriting/Download.html…

16
r/MachineLearning community 1mo ago

How Much of a Shortcut Are Connections in Top AI Lab Hiring for PhD grads? [D]

hi everyone. I'm trying to calibrate my expectations and would appreciate honest perspectives from people involved/ with experience in hiring at places like Anthropic, OpenAI, Google DeepMind, Meta, etc (haven't started interviewing yet). I'm at a top ML university, but my…

35
r/MachineLearning community 1mo ago

What's the theoretical basis for using llm consensus as a probability estimator for real world events [R]

This is a genuine technical question here. I've been looking at systems that use an ensemble of ai models to generate probability estimates for open ended real world events. The claim is that consensus across multiple models produces more calibrated estimates than any single…

27
r/MachineLearning community 1mo ago

ICML paper checker is down? [D]

I was getting ready to upload my camera-ready paper to ICML (few minutes before the deadline... no comments), but the paper checker site seemingly went down before I could finish... I emailed the publication chairs already but i just wanted to know if anyone else was in the same…

6
r/MachineLearning community 1mo ago

Hopfield Memory in VLA [R]

I am currently doing a research internship (2 months) in VLA and I have come across the Hopfield network based on the paper Hopfield Networks is All You Need and seeing the potential advantages of using this as a memory module over the transformer architecture based HAMLET…

18
r/MachineLearning community 1mo ago

Building a monokernel for LLM inference on AMD MI300X - up to 3,300 output tokens/s per request [P]

We built a monokernel that runs the full decode sequence as one GPU-resident program on AMD MI300X, with some neat optimizations. The die topology is central to the result, we map memory access patterns to the physical layout, compute units group by their associated IOD, and the…

30
r/MachineLearning community 1mo ago

Making LLMs tell you how confident they really are through probe-targeted fine tuning.[R]

Just wanted to share my research regarding probe-targeted fine-tuning (LoRa) for verbal confidence calibration., If you probe the hidden states of an instruct-tuned LLM, it can tell correct from incorrect answers at 0.76–0.88 AUROC. But when you ask it directly it tends to…

16
r/MachineLearning community 1mo ago

Social Simulation with LLMs - Fidelity in Applications (CFP @ COLM'26) [R]

🌟 Announcing the 2nd Workshop on Social Simulation with LLMs (Social Sim'26) @ COLM 📣 Welcoming Submissions! Submission here:. 🗓️ Deadline: June 23, 2026 (AoE) This year's theme is "Fidelity in Applications”, moving beyond compelling demos toward evaluation, robustness,…

11
r/MachineLearning community 1mo ago

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

Hey , I've been building VeritasReason — an open-source Python framework that adds a structured reasoning and provenance layer on top of LLMs and AI agents. The problem it solves: AI agents today make decisions but record nothing. When something breaks in prod, you have zero…

38
r/MachineLearning community 1mo ago

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

Are agents aging after deployment? : https://arxiv.org/abs/2605.26302 On a new longitudinal deployment benchmark, switching the Claude Code CLI agent from Sonnet 4.6 to Opus 4.7 dropped PyTest pass rate by ~15%. This (to me) is a counterintuitive-enough result to pay attention…

6
r/MachineLearning community 1mo ago

Wall-OSS-0.5: 4B VLA with open training code and zero-shot real-robot evaluation[D]

Wall-OSS-0.5 is a new 4B VLA release from X Square Robot, built on a 3B VLM backbone with action experts in a Mixture-of-Transformers layout. What caught my eye is that the report evaluates the pretrained checkpoint on real robots before task-specific fine tuning, instead of…

25
r/MachineLearning community 1mo ago

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both…

15
r/MachineLearning community 1mo ago

Built a richer reading layer for arxiv (Chrome extension + web): OpenReview reviews, GitHub/HuggingFace links, citation graph, SPECTER2 neighbors, TLDRs. 3M papers, free, looking for feedback [P]

Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both…

10
r/MachineLearning community 1mo ago

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: https://huggingface.co/datasets/jasperai/monet MONET is open, Apache 2.0-licensed image–text dataset. It was built from 2.9 billion images and refined to 104.9 million high-quality samples. We are…

5
r/MachineLearning community 1mo ago

ACM MM 2026 review discussion [D]

The AC email says the rebuttal is between 28 to 4th. The June 4th on website is the deadline. So I created this post for the discussion. I know it's a MM conference and less about ML but I think many people here are still submitting there.   submitted by  …

32
r/MachineLearning community 1mo ago

Training GPT-like model on non-language series [R]

I am responsible for a research project that is supposed to train a GPT-like model (Transformer-decoder) with 100M, 250M and 500M model variants. # params ## training dataset - 750M tokens - vocabulary is ~15k to ~100k tokens (depends on tokenizer settings) - ~3% of the…

29
r/MachineLearning community 1mo ago

Diffusion models for sketch-guided trajectory simulation [R]

Blog post: https://wezteoh.github.io/posts/diffusion-for-sketch-guided-trajectory-simulation/ During NBA games, coaches often sketch attacking plays on a whiteboard and mentally simulate how teammates and defenders might react. In this project, I explored using diffusion models…

30
r/MachineLearning community 1mo ago

STEM PhD's transitioning to MLE/Data [R]

I'm hoping for some advice from any former PhD's outside of machine learning. If you made it into machine learning engineering and/or data science, what was the key for you? Any tips for this job market? It seems like non computer science PhD's are especially in trouble at the…

38
r/MachineLearning community 1mo ago

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

[R] BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison I’m looking for feedback on a local agent-memory benchmark comparison, especially from people who care about evaluation methodology. I built an open-source R&D memory system called Context Swarm Memory…

31
r/MachineLearning community 1mo ago

Cross-Platform Fused MoE Dispatch in Triton: Portable Expert Routing Without CUDA [R]

New preprint. A Mixture-of-Experts inference kernel (TritonMoE) written entirely in OpenAI Triton, targeting portability across NVIDIA and AMD without vendor-specific code. Highlights: A fused gate+up GEMM computes both SwiGLU projections from shared tile loads, eliminating 35%…

38
r/MachineLearning community 1mo ago

UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D]

Dataset for fine-tuning compliance assistants. Each pair includes: - A practical SME-facing question ("Can I use pre-ticked consent boxes?") - An answer with specific UK GDPR article references, ICO guidance by name, and actionable steps - Source metadata: which GDPR concepts…

23
r/MachineLearning community 1mo ago

Should I attend ICML as a junior? [D]

I am a junior in college, and have two accepted workshop papers at ICML 2026. Some background: I had an accepted workshop paper last year at ICLR, but couldn't attend due to a rejected visa, which led to all the more disappointment. So this year I was VERY eager to attend, and…

4
r/MachineLearning community 1mo ago

I used the N.E.A.T algorithm to teach AI how to control a worm in my game in making! It uses evolution to improve. [P]

Each brain is unique, and from the best generations that I save, a worm can pick random brain files to use, letting each worm be completely unique and feel alive. This is for Bonk Universe.   submitted by   /u/Lanse012 [link]   [comments]

38
r/MachineLearning community 1mo ago

"Unified Neural Scaling Laws" paper release [R]

. https://x.com/ethanCaballero/status/2059686905105563907 .   submitted by   /u/Glittering_Author_81 [link]   [comments]

13
r/MachineLearning community 1mo ago

[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents [R]

I recently wanted to see whether an AI agent could self-improve a harness to solve terminal bench tasks. It’s possible for an AI agent to propose a meaningful one-time change to the harness, but after experimenting with this for a couple of weeks, I think the continuous…

35
r/MachineLearning community 1mo ago

AI-generated CUDA kernels silently break training and inference [R]

Last month NVIDIA released SOL-ExecBench , a new benchmark of 235 production CUDA kernels lifted from DeepSeek, Qwen, Gemma, and Kimi. We took several top-ranked AI-generated submissions and tried using them in production workloads. Many of them broke, sometimes in surprising…

14
r/MachineLearning community 1mo ago

Best Text to Text Translation Model? [D]

I'm working on a project that translates any language into English. So far, I've tried NMT models like NLLB, MADLAD, and SeamlessM4T v2. The main issue is that they struggle with proper nouns such as: - names - places - dates - organizations I also tried LLMs like Gemma 4, Qwen…

22
r/MachineLearning community 1mo ago

Physics Informed Neural Networks for damped harmonic oscillator and Burger's Equation (with extrapolation analysis) [P]

I built a PINN implementation in Python to solve two problems as part of a physics exam project: the damped harmonic oscillator (2nd-order ODE) and the 1D viscid Burgers' equation (nonlinear PDE). Both forward and inverse problems (to estimate unknown equation parameters from…

37

What’s the actual focus in World Models right now? [R]

UAI Results are out [R]

Workshop on Unlearning and Model Editing U&ME at ECCV 2026 [R]

Arabic ASR model struggling to converge during training [D]

What type of models are the most used by you?? [R]

I built a tool to browse and plan CVPR workshop/tutorial days [P]

Built an AI Accelerator and opensourced it. [P]

When are ICML openreviews made public? [R]

How would you model this "strand" clustering problem? [P]

I built mlx-Chronos — a community benchmark leaderboard for local LLM engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama) [P]

[D] Monthly Who's Hiring and Who wants to be Hired?

Bayesian Opt. GPs vs Linear models and Neural Networks for parameter optimizations [R]

Workshop submission for main conference paper under review [D]

How to fine-tune an LLM for open-ended problems? [P]

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Query about non-archival workshop at CVPR-2026 [R]

Why do the output layer weights become word vectors in Word2Vec? [D]

What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D]

Requesting reduction in reviewer load for NeuRIPS? [D]

Event like spiking neuron lib that fits into the CPU cache [P]

Graduating Without a PhD Internship [D]

How long does it realistically take for you to produce an ICML/NeurIPS/ICLR-level paper? [D]

Does anyone have a copy of the ICDAR2013 Chinese Handwriting Competition Dataset? [R]

How Much of a Shortcut Are Connections in Top AI Lab Hiring for PhD grads? [D]

What's the theoretical basis for using llm consensus as a probability estimator for real world events [R]

ICML paper checker is down? [D]

Hopfield Memory in VLA [R]

Building a monokernel for LLM inference on AMD MI300X - up to 3,300 output tokens/s per request [P]

Making LLMs tell you how confident they really are through probe-targeted fine tuning.[R]

Social Simulation with LLMs - Fidelity in Applications (CFP @ COLM'26) [R]

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

Wall-OSS-0.5: 4B VLA with open training code and zero-shot real-robot evaluation[D]

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Built a richer reading layer for arxiv (Chrome extension + web): OpenReview reviews, GitHub/HuggingFace links, citation graph, SPECTER2 neighbors, TLDRs. 3M papers, free, looking for feedback [P]

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

ACM MM 2026 review discussion [D]

Training GPT-like model on non-language series [R]

Diffusion models for sketch-guided trajectory simulation [R]

STEM PhD's transitioning to MLE/Data [R]

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

Cross-Platform Fused MoE Dispatch in Triton: Portable Expert Routing Without CUDA [R]

UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D]

Should I attend ICML as a junior? [D]

I used the N.E.A.T algorithm to teach AI how to control a worm in my game in making! It uses evolution to improve. [P]

"Unified Neural Scaling Laws" paper release [R]

[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents [R]

AI-generated CUDA kernels silently break training and inference [R]

Best Text to Text Translation Model? [D]

Physics Informed Neural Networks for damped harmonic oscillator and Burger's Equation (with extrapolation analysis) [P]