r/MachineLearning
500 articles archived · Visit source ↗ · RSS
-
-
r/MachineLearning community 27d ago
[R] Measuring the Symmetry--Data Exchange Rate
The prediction that equivariance reduces sample complexity by a factor of |G| appears in roughly every paper on geometric deep learning and is measured as an actual scaling law in roughly none of them. This paper does the measurement. The methodology is the interesting part.…
9 -
r/MachineLearning community 28d ago
How do ML researchers actually use AI tools to improve their writing? [D]
As an ML researcher, how do you use AI tools in your daily work? Do you mostly use them to clean up grammar and wording, or also to rewrite, structure, or draft technical text?   submitted by   /u/Hope999991 [link]   [comments]
5 -
-
-
r/MachineLearning community 28d ago
KVarN: Variance-Normalized KV-Cache Quantization [R]
Excited to share some of my own work here :) KVarN is our new KV-Cache quantization method. In very brief, we combine Hadamard rotations with variance-normalization on both axes of the K and V matrices, then round to nearest. Simple, but works very well, especially for…
21 -
r/MachineLearning community 28d ago
On-policy distillation: one of the hottest terms on PapersWithCode [R]
Hi, Niels here from the open-source team at Hugging Face. At paperswithcode.co I am trying to make it easier for people to learn about the newest techniques used across AI papers. One of the hottest terms in AI research that I've recently added is On-policy distillation , also…
27 -
r/MachineLearning community 28d ago
ICML financial aid [D]
Hello I am curious about the election criteria for ICML financial aid. If anyone have been granted financial aid would you mind sharing your profile. Somehow being a black woman ( 2 underrepresented groups) with one paper accepted at the main conference and two papers accepted…
7 -
r/MachineLearning community 28d ago
Embedding space [D]
Hello everyone, I’m relatively new to this area of machine learning and currently experimenting with Variational Autoencoders (VAEs) to build an embedding space for an image dataset with images have different spatial dimensions, I cannot easily standardize them to a fixed size.…
11 -
r/MachineLearning community 28d ago
Repo for implementations of various Transformer Attn mechanisms [P]
Initially, I developed this so I can easily switch between different Attention mechanisms for my Small Language Model (SLM) experiments and benchmarking. However, I also realized that these implementations can be applicable in Computer Vision, modernize Vision Encoders, RL, and…
14 -
r/MachineLearning community 28d ago
Research in Image/Video Gen AI models [D]
I've been going down a rabbit hole with image/video generation/editing models for a few months now, started with playing around with Stable Diffusion and ComfyUI, then got genuinely hooked on understanding why things work, not just that they do. I have an Engineering background…
20 -
r/MachineLearning community 28d ago
Best Visual Reasoning Model in 2026 (Including APIs) [D]
For example, suppose I have a one-hour video and I provide it to ChatGPT or another AI model. If I ask complex reasoning questions about the video, which models are best suited for long-horizon video understanding and reasoning? Which models can produce the most reliable answers…
38 -
r/MachineLearning community 28d ago
I have done a ML Project as a Novice [P]
Hi there! I am going to complete my MSc in Business Analytics and planning to do some real-life projects to attract the recruiters. I am sharing one of such projects here: FIFA World Cup 2026 Prediction: https://amit-world-cup-2026-simulator.streamlit.app/ Project Overview Large…
5 -
r/MachineLearning community 28d ago
Has anyone heard back from citadel ICML travel grant ? [D]
It’s confusing because they said applicants will be notified on 3rd June but also said you’ll be notified 2-4 weeks after the deadline (29th may)   submitted by   /u/Smol_pp001 [link]   [comments]
6 -
r/MachineLearning community 28d ago
First paper acceptance (ICML Workshop), should I attend? [D]
I just finished my first year of undergrad, and I got my first first-author paper accepted to an ICML workshop! Super stoked, especially since I was lowk a crashout in high school I wanted to know if it is worth it for me to go? It's quite expensive, and I will be the only one…
30 -
r/MachineLearning community 28d ago
NeurIPS Reciprocal Reviewers be careful in reviewing with LLMs [D]
As the title says. I am not a reciprocal reviewer but I just noticed a clever prompt injection like they did in ICML for our submission.   submitted by   /u/Massive-Bobcat-5363 [link]   [comments]
18 -
r/MachineLearning community 29d ago
NeurIPS used uncalibrated AI detector for desk rejections [D]
I recently had a submission desk-rejected from the NeurIPS 2026 Position Paper Track for an alleged AI-policy violation. After corresponding with the track leadership and reading their public blog post, I think the broader methodological issue is worth discussing here. The track…
13 -
r/MachineLearning community 29d ago
Analysis of AlphaZero training data [D]
I am trying to train an AlphaZero model for Othello on a 6x6-board. Having been warned that too little exploration during data generation can lead to models being overconfident and trapped in some tight region of the search tree, I started with the value c_puct = 4.0, and then…
35 -
r/MachineLearning community 29d ago
MiniMax dropped a new attention architecture. [N]
It contains something interesting about context windows. They’re natively scaling to 1M tokens with MiniMax Sparse Attention (MSA) , bypassing standard quadratic complexity by completely restructuring the memory access patterns at the operator level. Instead of relying on…
26 -
r/MachineLearning community 1mo ago
Thoughts on Logical Intelligence’s Kona [D]
Sometime late last year a company called Logical Intelligence developed an EBM called Kona. What do people make of the company’s claims that they have a close to functioning EBM. And if true, what impact would this have on existing AI?   submitted by   /u/Treey1234…
24 -
r/MachineLearning community 1mo ago
MTPAMI Survey Paper Length for submission time? [D]
My paper is around 33 pages including but tpami guideline said it should be 20 pages Does anyone know which is correct? Its mistake it’s TPAMI   submitted by   /u/Alternative_Art2984 [link]   [comments]
30 -
r/MachineLearning community 1mo ago
Is the hallucination problem solved for document search? [D]
I was wondering if someone knew state of the art research about the hallucination problem for document search with LLMs. I know for example in math you can use some verifier to check a proof. What about document search with LLMs, when I feed them documents?   submitted by…
23 -
-
r/MachineLearning community 1mo ago
Browse CVPR 2026 papers on PapersWithCode [P]
https://preview.redd.it/se5nr2z7tt4h1.png?width=3046&format=png&auto=webp&s=7db15b73afb749da236e5bb50ff96372f6a3239b Hi, Niels here from the open-source team at Hugging Face. It's been 2 weeks since I launched paperswithcode.co , a revival of the website we all loved. It allows…
11 -
r/MachineLearning community 1mo ago
[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc. Please mention the payment and pricing requirements for products and services. Please do not post link shorteners, link aggregator websites , or auto-subscribe links. -- Any abuse…
22 -
r/MachineLearning community 1mo ago
ICML Conference Ticket (looking to purchase) [D]
Hi everyone, I missed the ICML conference tickets because I was waiting for some travel funding confirmation and now they are sold out. Do you know any other ways I could still purchase one? There seems to be no waiting list… or if you know anyone who needs to cancel theirs,…
37 -
r/MachineLearning community 1mo ago
Full duplex vs half duplex - the spectrum of AI voice models [D]
It seems that there are two ways to build voice AI: Half-duplex: strict turn-taking. You speak, the other side waits until you’re done, one direction of speech at a time. ← This is how almost every voice assistant works today. Full-duplex: two channels, both sides can talk at…
32 -
r/MachineLearning community 1mo ago
Feedback on my EU AI Act Risk Tier Assessor [P]
Hey everyone, hope this is ok to post here. I built a free EU AI Act risk assessment tool and would love some feedback from people who actually know this space. You fill out a 10-question form describing your AI system, it classifies your EU AI Act risk tier, and emails you a…
35 -
r/MachineLearning community 1mo ago
ICML Financial Aid [D]
Financial aid results for ICML are out and unfortunately I wasn't selected. I was wondering, does this mean I wasn't selected for Volunteering as well? Or should I expect a separate email?   submitted by   /u/RussB3ar [link]   [comments]
34 -
-
r/MachineLearning community 1mo ago
[D] Simple Questions Thread
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the…
32 -
-
r/MachineLearning community 1mo ago
Do you see GNN's playing a meaningful role in astrophysics research? [D]
A bit of background about myself: I have been accepted to RWTH Aachen's Computer Science program starting this fall, and one of the things that I am genuinly excited about is exploring the intersection of astrophysics and machine learning. The tricky part is that RWTH's CS…
13 -
r/MachineLearning community 1mo ago
[P] Free AI Agent Security Assessment [P]
Hey everyone, We’re building Antitech , a security layer for AI agents and LLM-powered workflows. We’re opening a small number of free early-access assessments for teams/builders working on AI agents. If you give us access to an endpoint of a Dockerized / sandboxed environment…
8