r/MachineLearning
500 articles archived · Visit source ↗ · RSS
-
r/MachineLearning community 5h ago
[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc. Please mention the payment and pricing requirements for products and services. Please do not post link shorteners, link aggregator websites , or auto-subscribe links. -- Any abuse…
17 -
r/MachineLearning community 7h ago
Making Optimization Work When Labels Are Scarce [R]
https://www.gnosyslabs.com/case-studies/safety-classifier-sparse-labels Gnosys is an autonomous model engineer: it improves prompts and classifiers when ground truth is too sparse for conventional optimization. On ToxicChat, a public safety benchmark, under realistic label…
23 -
-
r/MachineLearning community 10h ago
New PyMuPDF release, supports Markdown [N]
https://pymupdf.io/blog/markdown-in-pymupdf-1-28 PyMuPDF 1.28 release, introduces Markdown as a first class document in PyMuPDF. Seems useful for a variety of workflows. You can create PDFs from Markdown text with control over appearance using CSS   submitted by  …
9 -
r/MachineLearning community 13h ago
ACL ARR May 2026[D]
Hi everyone. Do the ACL arr may 2026 reviews come out of July 2nd or do they come out on July 7 th?? How much does one need to get into Main or Findings? I am a bit new to this. Thanks a lot folks.   submitted by   /u/Anshuman3480 [link]   [comments]
12 -
r/MachineLearning community 15h ago
How to "actually" network for jobs at ML conferences? [D]
Attending ICML for the first time (virtually) next week as a 3rd year PhD student in the US. I want to get into industry after finishing and have heard a lot about the benefits of networking at conferences to build industry connections. How do you actually go about doing this?…
30 -
-
r/MachineLearning community 17h ago
[D] Simple Questions Thread
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the…
36 -
r/MachineLearning community 1d ago
[D] Monthly Who's Hiring and Who wants to be Hired?
For Job Postings please use this template Hiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for] For Those looking for jobs please use this template Want to be Hired: [Location], Salary…
16 -
r/MachineLearning community 1d ago
Are all LLM research papers nowadays 100+ pages beasts?[D]
Was reading some research papers put out by Anthropic (and some other organizations/researchers) and one thing I've noticed is that these research papers consistently all share the same quality: Oftentimes over 100 pages of pure words, interspersed with screenshots of very…
25 -
-
r/MachineLearning community 2d ago
Loss functions in Instance Representation Learning [R]
In Wu et. al , the MLE objective is computationally infeasible due to the high number of images in the dataset. Non-parametric Softmax Negative Log-Likelihood With large n, the denominator in (2) is hard to compute. Therefore, they use NCE (Noise-Contrastive Estimation). The NCE…
35 -
r/MachineLearning community 2d ago
Price elasticity model [R]
Need to build a ml model to find the price elasticity at the product group level first the given price and discount. What are features I need have and which model used in the industry for these type of use cases . I have used regression and random regression to predict the qty…
30 -
r/MachineLearning community 2d ago
I'm trying to implement CALM paper, and I have some questions. [P]
Hello, I'm trying to implement the Pocket TTS by kyutai-labs represented by this paper . Since they have didn't released the training/fine-tuning code. I'm trying to implement it on my own for learning some stuff. I have read the paper, tried to implement it with much more…
34 -
r/MachineLearning community 2d ago
Adaptive Mixture of Experts Gate (AMG) [R]
[Project] Post-hoc Adaptive MoE Gating on Qwen3.6-35B — empirical benchmarking of an open research gap Adaptive MoE routing — selecting a variable number of experts per token based on routing confidence — has been studied in papers (XMoE 2024, DynMoE ICLR 2025, TopP routing…
5 -
r/MachineLearning community 2d ago
EML Trees are Universal Approximators [R]
Hey! The EML function made the rounds recently on the internet as a “cool trick” that allows for the representation of all elementary functions through composition. As a mathematical curiosity, we prove a universal approximation theorem for EML(-type) trees. Intuitively, one…
11 -
r/MachineLearning community 2d ago
What do you think of Recursive Self Improvement ? [D]
There was a workshop in ICLR Recursive Self Improvement. Is this something worth pursing for a Phd topic? Webpage : https://recursive-workshop.github.io/   submitted by   /u/Successful_Bowl2564 [link]   [comments]
34 -
-
r/MachineLearning community 3d ago
[D] Looking for people serious about ML, DL & DSA 🚀[D]
I recently started a Telegram community called The Daily Commit. The goal is simple: stay consistent and hold each other accountable. What we do: 🧠 Share what we learned every day. ❓ Discuss ML, DL & DSA doubts. 📚 Share quality resources. 🚀 Build projects together. 💪 Stay…
26 -
r/MachineLearning community 3d ago
ECCV 2026 Final Decisions after Provisional Acceptance [D]
Has anyone actually received final acceptance following their provisional acceptance email from ECCV 2026? I am very confused. Thank you so much.   submitted by   /u/Land_Heavy [link]   [comments]
15 -
r/MachineLearning community 3d ago
Double-Blind submission in single-blind tracks [D]
Hi everyone. First-time reviewer for data mining venues here. For the applied tracks in ICDM and KDD, the CFP states submissions should be single-blind, showing the author's name and affiliations. I received some submissions in double-blind (no author names and affiliations).…
9 -
-
-
r/MachineLearning community 5d ago
Kicking off GPU Mode [D]
Hey ! I’m starting a series to document my work on GPU infrastructure, LLMs, and CV. Stop #1 is up: A brief look at why GPUs are the center of the industry, the CPU/GPU divide, and why nvidia-smi is the first place you check when things break. We’ll move past the basics quickly…
27 -
r/MachineLearning community 5d ago
I silently break training codes or configs so I made pybench [P]
It is like pytest but for statistical tests: it ensures no regression of your metrics at a statistical level. It manages tedious things such that seeds, past benchmark results, ... Simple CLI working like pytest but with benchmarks/ directory instead of tests/: pybench # 1st…
38