r/MachineLearning
500 articles archived · Visit source ↗ · RSS
-
r/MachineLearning community 1mo ago
EMA-Gated Temporal Sequence Compression in Vision Transformers [P]
Vision Transformers waste 90% of their compute recalculating stationary asphalt. NeuroFlow tracks semantic surprise in embedding space, physically eliminating background tokens before the encoder. NeuroFlow is a dynamic routing framework for Vision Transformer video inference.…
34 -
r/MachineLearning community 1mo ago
Profiling PyTorch training without accidentally stalling the GPU [D]
Profiling PyTorch training has an interesting measurement problem: the more you measure, the more you can change the behavior of the run itself. A simple example is torch.cuda.synchronize() . It gives cleaner timing boundaries, but it also inserts synchronization points into an…
13 -
r/MachineLearning community 1mo ago
A Tiny Open-Source Self-Driving AI That Runs on a Phone [P]
https://preview.redd.it/ww14mzr2fm3h1.png?width=1890&format=png&auto=webp&s=79873d47ae79c7815ca3e7e91fd43141632174f5 https://www.youtube.com/watch?v=rr_uS4bf0B4&feature=youtu.be trained a 7MB open-source L4 self-driving AI that learns navigation, lane following, and drift…
11 -
r/MachineLearning community 1mo ago
What to use for Sign Language Recognition [R]
Hi everyone, I'm finishing up my proposal for my undergraduate thesis for computer science on sign language recognition, specifically Filipino Sign Language and i want to ask what architecture to use for my methodology that is best, rn im considering Mediapipe Holistic +…
32 -
r/MachineLearning community 1mo ago
[R]GNN Model For Fraud Detection Isn't Performing Well[R]
We're writing a research paper on explainable fraud detection GNN model and in the first step we're creating a basic Graph Neural Network for that. We're using the most famous dataset available on this topic i.e IEEE CIS Fraud Detection Dataset and implemented all necessary…
7 -
r/MachineLearning community 1mo ago
Trouble exploring in ai/ml,idk where to being with [D]
So as the title says Context:I am a sophomore in computer science Have prior knowledge in maths(especially the relevant topics in ml) Good enough with numpy,pandas I don't really know where to start Ok internet every second guy is trying to make me earn 100k/year in 3 months…
24 -
r/MachineLearning community 1mo ago
[P] have a couple technical questions for my LLM router. [P]
I am a CS undergrad and I think token economics is the next big problem for companies. I am building a LLM router specifically for code and codebases. The Routing is not actually done by a heavily fine tuned llm(already existing solutions do this). Using a bit of a different…
11 -
r/MachineLearning community 1mo ago
[D] Dlib or pytorch to CNN? [D]
I’m currently studying ML, more specifically convolutional neural networks (CNNs) for finding patterns in images. Right now, I’m trying to develop a model that can solve the “Where’s Waldo?” challenge. However, I currently have a question: what would be the best option for…
31 -
r/MachineLearning community 1mo ago
[D] Where do you go for serious AI research discussion online? [D]
Looking for communities where people actually dig into ML/AI research, not hype, not "look what I built with an LLM API," but discussions about papers, training dynamics, debugging real models, infra problems, that kind of thing. I'm specifically interested in places where you…
15 -
r/MachineLearning community 1mo ago
Aiki my local Wikipedia Retrieval-Augmented Generation system [R]
Hey i built Aiki a lightweight tool that let's you chat with Wikipedia locally. what it does: - Downloads and chunks wikipedia articles (u can choose those articles by their name or articles and also the option of downloading the similar topics) - Uses a custom TF-IDF + cosine…
23 -
r/MachineLearning community 1mo ago
Is AI inference platform really that saturated now? [D]
I’m thinking of expanding an on-device inference SDk into a full blown AI inference platform and seeing more and more inference platform popping out. Been talking with a VC from Seattle/NY. Is this space really that saturated?   submitted by   /u/kampak212 [link]  …
35 -
r/MachineLearning community 1mo ago
𝐃𝐞𝐥𝐭𝐚 𝐀𝐭𝐭𝐞𝐧𝐭𝐢𝐨𝐧 𝐑𝐞𝐬𝐢𝐝𝐮𝐚𝐥𝐬 [R]
We're excited to release 𝐃𝐞𝐥𝐭𝐚 𝐀𝐭𝐭𝐞𝐧𝐭𝐢𝐨𝐧 𝐑𝐞𝐬𝐢𝐝𝐮𝐚𝐥𝐬, a drop-in upgrade to residual connections that learns which past layers to route from — without the routing collapse that breaks prior cross-layer attention at scale. 🚀 Attention Residuals route over…
9 -
r/MachineLearning community 1mo ago
Anyone heard from ICML about Oral decisions yet? [D]
hi all, my paper received a spotlight from ICML. they told us that we would receive decisions as to whether our paper would get an oral by the end of the month with the implication that we wouldn’t receive a notification if we didn’t get it; I was just wondering if anyone has…
30 -
r/MachineLearning community 1mo ago
I’m building an open-source decision layer above AI agents [P]
Hi everyone, I’m Jia, the creator of Spice. I’ve been working on an open-source project called Spice. The simplest way to describe it is: Spice is a decision layer above agents. Most agent systems today are very focused on execution, They are getting better at doing tasks after…
30 -
r/MachineLearning community 1mo ago
Call for Papers - Workshop on Efficient Reasoning at COLM 2026 [R]
🌟 Announcing the 2nd Workshop on Efficient Reasoning (ER) at @colm2026 — Oct 9! 📣 We welcome submissions! Submit your work here: https://openreview.net/group?id=colmweb.org/COLM/2026/Workshop/Efficient_Reasoning 🗓️ Deadline: July 12, 2026 (AoE) 🔗 Website:…
11 -
r/MachineLearning community 1mo ago
Are ICML workshops worth attending? [D]
Hi! I missed securing a main conference ticket for ICML 2026, as my workshop paper got accepted two days ago. Do you believe that it is worth attending just workshops at such A*-tier conferences (with all the overseas travel costs etc.)? I was quite looking forward to attending…
31 -
r/MachineLearning community 1mo ago
Using large language models [R]
Can LLMs be used to come up with a research topic that's worthwhile? Has anyone had good results in coming up with solid research ideas by chatting with an LLM? Maybe using Claude to review existing work and define the research topic. Thanks!   submitted by  …
24 -
r/MachineLearning community 1mo ago
Sponsio: Deterministic Contract Layer for LLM Agents [P]
We've been trying to put LangGraph agents into production for a while. The thing that kept biting us was tool-call boundary enforcement: stuff like "must call X before Y", "max N retries", "approval gate before destructive action". Worked fine in demos, broke at the moments that…
31 -
r/MachineLearning community 1mo ago
Please help with tensor dock [d]
Anyone have any idea what I should do. This is my email to tensor dock. I developed corporate GPU benchmarking software so I need a cloud PC that can benchmark 5090 Consumer cards and 4090 Consumer cards. It worked absolutely amazing for six hours yesterday on the 4090 full…
28 -
r/MachineLearning community 1mo ago
"AI solved one of math's greatest challenges, but it cannot add two numbers reliably?!" [D]
Suppose your friend, a mathematician, woke up from a 5-year coma. How would you explain this to him? Do we even have an explanation other than "it is what it is"?   submitted by   /u/we_are_mammals [link]   [comments]
26 -
-
r/MachineLearning community 1mo ago
PapersWithCode new features - week 1 [P]
Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…
23 -
r/MachineLearning community 1mo ago
Expedia ML Scientist II interview experience anyone ? [D]
I have an Initial Technical Screen interview (45 Mins) coming up for ML Scientist II: Agentic AI role, and wanted to know what to expect. Would really appreciate any info. Haven't found much information on this interview experience. Thanks!   submitted by  …
27 -
-
-
r/MachineLearning community 1mo ago
pipeline is really slow - consulting [D]
Hi, after a long debugging process and many discussions, I wanted to ask for advice from people who may have encountered similar training bottlenecks. My goal is imitation learning for robotics. Model / Pipeline Observation space: 4 RGB robot cameras image resolution: 128x128x3…
25 -
r/MachineLearning community 1mo ago
AgentLantern: exposing the hidden graph of AI agent projects [P]
AI agent frameworks make it easy to create agents, tasks, tools, and workflows. But as soon as a project grows beyond a few agents, the real execution graph becomes difficult to understand. The issue : agent projects often hide their structure across code, YAML files, tool…
7 -
r/MachineLearning community 1mo ago
Hebbian architecture AI model [R]
Hello , for some time now i have been hooked on a side project after work hours, these are the results for a Hebbian architecture AI model. The model does not use backpropagation or gradients, the substrate started as a 1000k neuron and scaled to 100k between versions. The…
31 -
r/MachineLearning community 1mo ago
Alignment: Higher order prioritizing over constraints [R]
So, I ran across a behavior that I found interesting and may lead to alignment or safety research. I'm going to try to maintain an abstract description of what happened without giving away the details and the keys to jailbreaking. The nature of a transformer is to predict the…
25