r/MachineLearning

500 articles archived · Visit source ↗ · RSS

r/MachineLearning community 4h ago

SentryCode: Real-time Auditor + Honeytokens for AI Coding Agents [P]

In light of recent privacy concerns arising from local AI coding agents performing telemetry, environmental scanning, and hidden cue fingerprinting, I've open-sourced SentryCode—a kernel-level behavior auditing tool. It logs file/network/cue activity, uses honeypot tokens for…

12
r/MachineLearning community 5h ago

[D] Self-Promotion Thread

Please post your personal projects, startups, product placements, collaboration needs, blogs etc. Please mention the payment and pricing requirements for products and services. Please do not post link shorteners, link aggregator websites , or auto-subscribe links. -- Any abuse…

17
r/MachineLearning community 7h ago

Making Optimization Work When Labels Are Scarce [R]

https://www.gnosyslabs.com/case-studies/safety-classifier-sparse-labels Gnosys is an autonomous model engineer: it improves prompts and classifiers when ground truth is too sparse for conventional optimization. On ToxicChat, a public safety benchmark, under realistic label…

23
r/MachineLearning community 10h ago

Hamiltonian Neural Networks from a Differential Geometry Perspective [D]

This is a write-up on our company blog that I wrote, sharing our perspective into Hamiltonian Neural Networks (Greydanus et al., 2019) from a differential-geometry angle rather than the usual "here's the loss function" treatment. I've been working on HNN and LNN adjacent topics…

17
r/MachineLearning community 10h ago

New PyMuPDF release, supports Markdown [N]

https://pymupdf.io/blog/markdown-in-pymupdf-1-28 PyMuPDF 1.28 release, introduces Markdown as a first class document in PyMuPDF. Seems useful for a variety of workflows. You can create PDFs from Markdown text with control over appearance using CSS   submitted by  …

9
r/MachineLearning community 12h ago

How to describe a model that has higher accuracy with fewer #param and FLOPs? [D]

Hello, My supervisor is nowhere to be found so I am turning to the internet for my naive questions.   submitted by   /u/obliviousphoenix2003 [link]   [comments]

25
r/MachineLearning community 13h ago

ACL ARR May 2026[D]

Hi everyone. Do the ACL arr may 2026 reviews come out of July 2nd or do they come out on July 7 th?? How much does one need to get into Main or Findings? I am a bit new to this. Thanks a lot folks.   submitted by   /u/Anshuman3480 [link]   [comments]

12
r/MachineLearning community 13h ago

Spot/interruptible H100 and A100 pricing across RunPod, Vast.ai, and AWS - June 2026 data [D]

Following up on the on-demand comparison from a couple weeks back - pulled spot/ interruptible pricing this time since that's where the real savings conversation actually lives for anyone running checkpointed training or batch jobs. Checked: June 2026. Spot/interruptible tier,…

35
r/MachineLearning community 15h ago

How to "actually" network for jobs at ML conferences? [D]

Attending ICML for the first time (virtually) next week as a 3rd year PhD student in the US. I want to get into industry after finishing and have heard a lot about the benefits of networking at conferences to build industry connections. How do you actually go about doing this?…

30
r/MachineLearning community 16h ago

P Moth-Retrieval: Graph-Free Multi-Hop Retrieval via Query-Time Orchestration (Beating Graph-Based Systems on HotpotQA) [P]

We just open-sourced MOTHRAG, a multi-hop RAG framework that skips the knowledge graph entirely. We kept hitting the same wall building multi-hop RAG: the systems with the best accuracy (GraphRAG, HippoRAG, RAPTOR) all lean on a knowledge graph built offline, and that’s great…

27
r/MachineLearning community 17h ago

[D] Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the…

36
r/MachineLearning community 19h ago

On July 1, 2026, arXiv will spin out from Cornell University, its home for the past 25 years, to become an independent nonprofit organization. Major funding support from Simons Foundation and Schmidt Sciences. Ditching the red for their website. [N]

arXiv’s next chapter: Updates on our spin out from Cornell University: https://blog.arxiv.org/2026/06/30/arxivs-next-chapter/   submitted by   /u/Nunki08 [link]   [comments]

12
r/MachineLearning community 20h ago

ICML qr code visible [D]

Hi everyone, The check in QR code is visible at my profile despite that my card isn’t accepting the payment transaction. What does that even mean? Thanks!   submitted by   /u/misplacedlion [link]   [comments]

8
r/MachineLearning community 22h ago

A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]

Prompt injection has emerged as one of the most persistent failure modes in tool-using LLM systems, particularly in agentic workflows where models interact with external data sources. Most mitigation strategies focus on input filtering or model-side alignment, but these…

9
r/MachineLearning community 1d ago

Anyone looking into the new MARS2 Workshop/Competition @ ECCV 2026? I saw Tec-do posting it. [D]

I recently came across the announcement for the MARS2 Workshop (Multimodal Reasoning Competition) at ECCV 2026. From what I understand, it focuses on multimodal reasoning and test-time reasoning (“slow thinking”), especially applied to video and real-world scenarios like…

30
r/MachineLearning community 1d ago

[D] Monthly Who's Hiring and Who wants to be Hired?

For Job Postings please use this template Hiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for] For Those looking for jobs please use this template Want to be Hired: [Location], Salary…

16
r/MachineLearning community 1d ago

80TB+ of astronomy for the HDD-poor: crossmatch the Universe from your laptop [R]

Today is the day you (🫵!) get access to 80TB plus of data from over 30 astronomical surveys in one place. 4GB of RAM is enough even at Gaia Scale. Check out our writeup here: https://huggingface.co/blog/hugging-science/multimodal-universe-hats And a tutorial here…

6
r/MachineLearning community 1d ago

REAP: Automatic Curation of Coding Agent Benchmarks from Interactive Production Usage [R]

  submitted by   /u/julian88888888 [link]   [comments]

13
r/MachineLearning community 1d ago

How to improve a 5-class Diabetic Retinopathy model (APTOS 2019) – Mixed predictions across classes[P]

Hi everyone, I'm a final-year Computer Engineering student building a Flask-based AI Diabetic Retinopathy Detection system. The web application itself is complete with patient management, authentication, dashboard, PDF report generation, prediction history, and AI inference. The…

14
r/MachineLearning community 1d ago

Are all LLM research papers nowadays 100+ pages beasts?[D]

Was reading some research papers put out by Anthropic (and some other organizations/researchers) and one thing I've noticed is that these research papers consistently all share the same quality: Oftentimes over 100 pages of pure words, interspersed with screenshots of very…

25
r/MachineLearning community 1d ago

A map of the latest 11 million papers split by semantic similarity and time slices [P]

I have building alternative ways explore scientifc literature. The goal was to make the large number of papers published daily easier to keep up with by visualising the macro scopic trend. It is free to use at The Global Research Space for any one interested in giving it a try!…

26
r/MachineLearning community 1d ago

Update on CVIL: the free CV interview prep checklist after landing my internship... just added Segmentation, OCR, and VLM sections [D]

Hi everyone, Posted this a while back... a checklist I made while prepping for a CV internship (landed it, hence sharing). It's not a textbook, just a phase-by-phase map of what to actually study for CV/ML interviews: math → CNNs → ViTs → detection → tracking, plus…

15
r/MachineLearning community 1d ago

EACL 2027: Author response and author-reviewer discussion are now two separate stages and allow more time [D]

EACL 2027 just published their CFP which contains an important change to the common ARR process: For this cycle, author response and author-reviewer discussion are two separate stages Looking at the deadlines, they not only split the process but also allow more time: Author…

4
r/MachineLearning community 2d ago

Loss functions in Instance Representation Learning [R]

In Wu et. al , the MLE objective is computationally infeasible due to the high number of images in the dataset. Non-parametric Softmax Negative Log-Likelihood With large n, the denominator in (2) is hard to compute. Therefore, they use NCE (Noise-Contrastive Estimation). The NCE…

35
r/MachineLearning community 2d ago

Price elasticity model [R]

Need to build a ml model to find the price elasticity at the product group level first the given price and discount. What are features I need have and which model used in the industry for these type of use cases . I have used regression and random regression to predict the qty…

30
r/MachineLearning community 2d ago

Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]

Premise: this work is my first year PhD, and I dropped out for personal reasons. I still want to do research but independently. I have tried to submit my explainability paper to MICCAI. Sadly, for doubtful/good reasons, it got rejected. Among the reviewers, one explicitly…

4
r/MachineLearning community 2d ago

I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]

(this was deleted before but i dont know if it was the filters of reddit or the moderators, if is the moderators i will not post it again after you delete it sorry.) (The name will probably change soon because I didn't realize "AgroVision" is already a registered trademark lol.)…

15
r/MachineLearning community 2d ago

I'm trying to implement CALM paper, and I have some questions. [P]

Hello, I'm trying to implement the Pocket TTS by kyutai-labs represented by this paper . Since they have didn't released the training/fine-tuning code. I'm trying to implement it on my own for learning some stuff. I have read the paper, tried to implement it with much more…

34
r/MachineLearning community 2d ago

Adaptive Mixture of Experts Gate (AMG) [R]

[Project] Post-hoc Adaptive MoE Gating on Qwen3.6-35B — empirical benchmarking of an open research gap Adaptive MoE routing — selecting a variable number of experts per token based on routing confidence — has been studied in papers (XMoE 2024, DynMoE ICLR 2025, TopP routing…

5
r/MachineLearning community 2d ago

I do historical swordfighting and noticed AI struggles to track it. I’m building an open dataset to help fix this. Does my schema make sense? [P]

Hi everyone, I’m a historical swordfighter (HEMA practitioner), and while I’m not a computer vision engineer or a roboticist, I’ve been reading a lot about the current bottlenecks in embodied AI, specifically around the Sim2Real gap and thin-object tracking. It occurred to me…

18
r/MachineLearning community 2d ago

Cerebras OpenAI deal capacity has effectively killed the waitlist for everyone else [D]

I’m pretty annoyed. We’re a small AI startup building a real-time coding agent. Our p95 latency requirements are tight (and self imposed, but thats the product). We need sustained high-throughput inference with ~1-2k tokens/second. Been on the Cerebras waitlist for months trying…

25
r/MachineLearning community 2d ago

EML Trees are Universal Approximators [R]

Hey! The EML function made the rounds recently on the internet as a “cool trick” that allows for the representation of all elementary functions through composition. As a mathematical curiosity, we prove a universal approximation theorem for EML(-type) trees. Intuitively, one…

11
r/MachineLearning community 2d ago

What do you think of Recursive Self Improvement ? [D]

There was a workshop in ICLR Recursive Self Improvement. Is this something worth pursing for a Phd topic? Webpage : https://recursive-workshop.github.io/   submitted by   /u/Successful_Bowl2564 [link]   [comments]

34
r/MachineLearning community 2d ago

Google's Agentic Peer-Reviewer Handled ~10K Papers at ICML/STOC — Formal Research Paper Now Out [R]

Google deployed an agentic AI peer-reviewer at two top CS conferences — reviewing ~10,000 papers with 30-minute turnaround — and the new formal research paper shows it catches 34% more mathematical errors than zero-shot prompting; the precedent for AI-automated scientific review…

23
r/MachineLearning community 2d ago

I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]

Link: https://ai-values.com/ There is a small 15 question quiz you can take before taking the full big quiz. The results of the big quiz update in realtime as you go so you dont have to actually go through all the questions (but they do get more fun in the personality section).…

27
r/MachineLearning community 3d ago

RAGless: Q-Q retrieval with score aggregation for closed-domain FAQ [P]

What it does RAGless is a semantic retrieval system based on Question-to-Question matching. At ingestion, an LLM generates multiple question variants per answer (3–5) and each variant gets its own embedding. At query time, the user question is embedded, Top-K nearest question…

23
r/MachineLearning community 3d ago

[D] Looking for people serious about ML, DL & DSA 🚀[D]

I recently started a Telegram community called The Daily Commit. The goal is simple: stay consistent and hold each other accountable. What we do: 🧠 Share what we learned every day. ❓ Discuss ML, DL & DSA doubts. 📚 Share quality resources. 🚀 Build projects together. 💪 Stay…

26
r/MachineLearning community 3d ago

ECCV 2026 Final Decisions after Provisional Acceptance [D]

Has anyone actually received final acceptance following their provisional acceptance email from ECCV 2026? I am very confused. Thank you so much.   submitted by   /u/Land_Heavy [link]   [comments]

15
r/MachineLearning community 3d ago

Double-Blind submission in single-blind tracks [D]

Hi everyone. First-time reviewer for data mining venues here. For the applied tracks in ICDM and KDD, the CFP states submissions should be single-blind, showing the author's name and affiliations. I received some submissions in double-blind (no author names and affiliations).…

9
r/MachineLearning community 3d ago

Evaluating long-term memory limits in stateless LLM chatbots — feedback needed [D]

Hi all, I’m working on a research project exploring how stateless LLM-based chatbots handle long conversations and whether important earlier information is still reliably retained over time. My idea is to: Run a chatbot using an LLM API without any external memory system…

28
r/MachineLearning community 3d ago

I shrank a transformer until every number fitted on the screen and made the weights editable [R]

I've been teaching myself how LLMs actually work, not at the API level, but down to the matrix multiplications. To force myself to really understand the forward pass, I first built a complete transformer by hand in a spreadsheet from embeddings through to the loss. Then I turned…

31
r/MachineLearning community 4d ago

NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]

Hello r/MachineLearning , I wanted to share the architecture and challenges behind a project I’ve been building called NagaTranslate . The goal is to build a translation and speech pipeline for the low-resource languages of Nagaland, India (currently supporting Nagamese, Ao, and…

30
r/MachineLearning community 4d ago

Do we still need to study algorithms now that AI writes most of our code? [D]

I've been thinking about this for a while. AI can now write functions, explain code, refactor projects, generate tests, and even solve many programming problems better than many junior developers. I've also noticed that Stack Overflow seems far less active than it used to be…

4
r/MachineLearning community 4d ago

Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]

When evaluating migrating production LLM workloads off commercial cloud APIs, the conversation usually gets oversimplified into a trade-off between quality and infrastructure cost. To look past clean, isolated averages, I built a repeatable evaluation matrix using a real-world…

29
r/MachineLearning community 4d ago

MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]

Repo link and results - https://github.com/Abhinand20/MathFormer Task: Given a factorized expression like (7-3*z)*(-5*z-9), predict the expanded form -> 15*z\*2-8\*z-63 Key takeaway: A tiny (4M param) seq2seq model trained with no math knowledge reaches ~98.6% accuracy on…

7
r/MachineLearning community 4d ago

Built an LLM training framework that actually runs on older GPUs without crashing [P]

Hey guys, I was playing around with Nanotron recently and got super frustrated by how many heavy, hardware-specific dependencies it imports at the module level ( flash-attn , triton, functorch , etc.). If you try to run it on older or budget GPUs like a T4 or V100, it just…

30
r/MachineLearning community 4d ago

Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]

Hey everyone, I'd like to share my project along with a short explanation of the process and why it came about in the first place. To start off, I'm not exactly the best at cryptography/steganography, in my case it's always been something that sat in the background, as one of…

12
r/MachineLearning community 5d ago

Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]

Hey all, a bit of background - I'm an ex Amateur MMA fighter and BJJ brown belt and am also in the AI/ML space ... weird combo but wanted to know if anyone else was at the intersection of ML/AI and MMA/BJJ. In short, I'm building AI models that "watch" fights and are able to…

20
r/MachineLearning community 5d ago

Kicking off GPU Mode [D]

Hey ! I’m starting a series to document my work on GPU infrastructure, LLMs, and CV. Stop #1 is up: A brief look at why GPUs are the center of the industry, the CPU/GPU divide, and why nvidia-smi is the first place you check when things break. We’ll move past the basics quickly…

27
r/MachineLearning community 5d ago

I silently break training codes or configs so I made pybench [P]

It is like pytest but for statistical tests: it ensures no regression of your metrics at a statistical level. It manages tedious things such that seeds, past benchmark results, ... Simple CLI working like pytest but with benchmarks/ directory instead of tests/: pybench # 1st…

38

SentryCode: Real-time Auditor + Honeytokens for AI Coding Agents [P]

[D] Self-Promotion Thread

Making Optimization Work When Labels Are Scarce [R]

Hamiltonian Neural Networks from a Differential Geometry Perspective [D]

New PyMuPDF release, supports Markdown [N]

How to describe a model that has higher accuracy with fewer #param and FLOPs? [D]

ACL ARR May 2026[D]

Spot/interruptible H100 and A100 pricing across RunPod, Vast.ai, and AWS - June 2026 data [D]

How to "actually" network for jobs at ML conferences? [D]

P Moth-Retrieval: Graph-Free Multi-Hop Retrieval via Query-Time Orchestration (Beating Graph-Based Systems on HotpotQA) [P]

[D] Simple Questions Thread

On July 1, 2026, arXiv will spin out from Cornell University, its home for the past 25 years, to become an independent nonprofit organization. Major funding support from Simons Foundation and Schmidt Sciences. Ditching the red for their website. [N]

ICML qr code visible [D]

A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]

Anyone looking into the new MARS2 Workshop/Competition @ ECCV 2026? I saw Tec-do posting it. [D]

[D] Monthly Who's Hiring and Who wants to be Hired?

80TB+ of astronomy for the HDD-poor: crossmatch the Universe from your laptop [R]

REAP: Automatic Curation of Coding Agent Benchmarks from Interactive Production Usage [R]

How to improve a 5-class Diabetic Retinopathy model (APTOS 2019) – Mixed predictions across classes[P]

Are all LLM research papers nowadays 100+ pages beasts?[D]

A map of the latest 11 million papers split by semantic similarity and time slices [P]

Update on CVIL: the free CV interview prep checklist after landing my internship... just added Segmentation, OCR, and VLM sections [D]

EACL 2027: Author response and author-reviewer discussion are now two separate stages and allow more time [D]

Loss functions in Instance Representation Learning [R]

Price elasticity model [R]

Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]

I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]

I'm trying to implement CALM paper, and I have some questions. [P]

Adaptive Mixture of Experts Gate (AMG) [R]

I do historical swordfighting and noticed AI struggles to track it. I’m building an open dataset to help fix this. Does my schema make sense? [P]

Cerebras OpenAI deal capacity has effectively killed the waitlist for everyone else [D]

EML Trees are Universal Approximators [R]

What do you think of Recursive Self Improvement ? [D]

Google's Agentic Peer-Reviewer Handled ~10K Papers at ICML/STOC — Formal Research Paper Now Out [R]

I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]

RAGless: Q-Q retrieval with score aggregation for closed-domain FAQ [P]

[D] Looking for people serious about ML, DL & DSA 🚀[D]

ECCV 2026 Final Decisions after Provisional Acceptance [D]

Double-Blind submission in single-blind tracks [D]

Evaluating long-term memory limits in stateless LLM chatbots — feedback needed [D]

I shrank a transformer until every number fitted on the screen and made the weights editable [R]

NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]

Do we still need to study algorithms now that AI writes most of our code? [D]

Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]

MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]

Built an LLM training framework that actually runs on older GPUs without crashing [P]

Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]

Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]

Kicking off GPU Mode [D]

I silently break training codes or configs so I made pybench [P]