News / #developer-tool Tag Developer Tool 500 articles archived under #developer-tool · RSS Sign in to follow arXiv — Machine Learning research 1mo ago Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur… 13 arXiv — NLP / Computation & Language research 1mo ago BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking arXiv:2605.27380v1 Announce Type: new Abstract: Cross-lingual biomedical entity linking (BEL) maps mentions in any language to unique identifiers in a biomedical knowledge base (KB), supporting clinical and biomedical NLP applications. However, expert-annotated training data for… 32 arXiv — NLP / Computation & Language research 1mo ago StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation arXiv:2605.27393v1 Announce Type: new Abstract: Large language models (LLMs) can generate fluent dialogue, but prior works lack situational grounding, dynamic strategy control, and evaluation aligned with clinical standards in motivational interviewing (MI). We introduce… 7 arXiv — NLP / Computation & Language research 1mo ago Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand… 28 arXiv — NLP / Computation & Language research 1mo ago Challenges in Explaining Pretrained Clinical Text Classifiers arXiv:2605.28060v1 Announce Type: new Abstract: Explaining the predictions of neural models in clinical NLP remains a significant challenge, especially for complex tasks involving long, unstructured medical texts. While post-hoc methods like LIME and SHAP are widely used, they… 19 r/MachineLearning community 1mo ago noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P] If you've ever tried to pick an STT vendor for a phone-based voice agent or call center product, you've probably hit this wall: you have plenty of real production audio, but it's unlabeled, so you can't compute WER on it. And the annotated public datasets (FLEURS, CommonVoice,… 31 TechCrunch — AI news-outlet 1mo ago ClickHouse triples anualized revenue to $250M, charting a path toward an IPO The database provider is eyeing a public debut within the next few years. 8 TechCrunch — AI news-outlet 1mo ago ClickHouse triples annualized revenue to $250M, charting a path toward an IPO The database provider is eyeing a public debut within the next few years. 32 r/LocalLLaMA community 1mo ago AI is not for everyone This may be a controversial take, but AI is not for everyone. I've made a post here before about the vibecoded garbage I see on this subreddit every time I click on it but there seems to be a larger issue. AI isn't just a set and forget karma farm. You actually have to put work… 14 The Information — AI news-outlet 1mo ago Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring Micron Technology crossed $1 trillion in market value for the first time Tuesday, as shares climbed 19% on rising demand for memory chips used in AI systems. It was Micron’s largest single-day gain since 2011. The rally came after UBS sharply raised its price target for Micron… 33 arXiv — Machine Learning research 1mo ago GEM: Geometric Entropy Mixing for Optimal LLM Data Curation arXiv:2605.26121v1 Announce Type: new Abstract: LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering… 27 arXiv — Machine Learning research 1mo ago On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series arXiv:2605.26194v1 Announce Type: new Abstract: Clinical time-series learning is routinely constrained by small, heterogeneous cohorts and protocol drift, while its downstream use spans both classification (e.g., pathology diagnosis) and regression (e.g., temporal forecasting).… 30 arXiv — Machine Learning research 1mo ago MuCon: Clipped Muon Updates for LLM Training arXiv:2605.26459v1 Announce Type: new Abstract: Muon-style optimizers take a matrix-valued momentum or preconditioned update $B = U \operatorname{diag}(\sigma_1,\ldots,\sigma_r) V^\top$ and replace it with its canonical partial polar factor $\operatorname{Pol}(B) = U V^\top$.… 31 arXiv — Machine Learning research 1mo ago Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense… 32 arXiv — Machine Learning research 1mo ago Separate Aggregation of Split Network for Personalized Federated Learning arXiv:2605.26571v1 Announce Type: new Abstract: Federated learning enables collaborative model training without sharing raw data, but its performance can degrade substantially under heterogeneous client data distributions. A single global model often cannot satisfy diverse… 33 arXiv — Machine Learning research 1mo ago Image Feature Fusion-based Federated Client Unlearning (FCU) arXiv:2605.26715v1 Announce Type: new Abstract: Major data protection regulations all mention the "right to be forgotten," and that's what pushed federated unlearning (FU) techniques forward. But one stubborn issue remains: catastrophic forgetting--you erase the target… 9 arXiv — Machine Learning research 1mo ago Adversarial Training for Robust Coverage Network under Worst-case Facility Losses arXiv:2605.26763v1 Announce Type: new Abstract: The Maximal Covering Location-Interdiction Problem (MCLIP) is a classic bi-level optimization problem, which is fundamental to resilient infrastructure planning yet remains computationally intractable. Specifically, the upper level… 5 arXiv — Machine Learning research 1mo ago Ratio-Variance Regularized Policy Optimization arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that… 29 arXiv — NLP / Computation & Language research 1mo ago The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology arXiv:2605.26346v1 Announce Type: new Abstract: Objective: To describe the design and early clinical evaluation of The Daily Dose (TDD), an LLM-driven, automated clinical summarization and clinical-trial identification system integrated into routine radiation oncology practice.… 7 arXiv — NLP / Computation & Language research 1mo ago Curation and Extraction of Drug-Related Entities from Reddit Platform arXiv:2605.26445v1 Announce Type: new Abstract: Physicians learn primarily about illicit drugs from clinical overdose cases, limiting their understanding of real-world usage. Meanwhile, drug users share first-hand experiences online, offering insights into dosage and effects of… 31 arXiv — NLP / Computation & Language research 1mo ago Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records arXiv:2605.26463v1 Announce Type: new Abstract: Data consistency between unstructured clinical notes and structured tables in Electronic Health Records (EHRs) is essential for patient safety and clinical decision-making. However, existing work on note-table consistency… 7 arXiv — NLP / Computation & Language research 1mo ago Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline arXiv:2605.26560v1 Announce Type: new Abstract: Objective. Outpatient notes carry follow-up instructions pairing actions with future times ("MRI brain in two weeks"). Extracting (action, date) pairs supports scheduling and audit, but generative extractors miss the date because… 19 Vercel — AI dev-tools 1mo ago Experimental native binaries for Vercel CLI The Vercel CLI now ships an optional experimental native binary that starts faster, is even more secure, and requires no Node.js runtime dependency. Binaries are code-signed, allowing your OS to verify that they came from Vercel and haven't been modified. Additionally, on macOS,… 30 r/LocalLLaMA community 1mo ago Turning local agents into self-optimizing agents I was experimenting with a self-optimizing agentic pipeline to climb the benchmark leaderboard (TerminalBench). On a 10-task subset, I got the performance to rise from ~30% → ~90%. That loop worked, so I asked: can the same reflect-and-rewrite step run continuously against… 17 Hugging Face Daily Papers research 1mo ago ECHO: Terminal Agents Learn World Models for Free Abstract Environment cross-entropy hybrid objective combines policy-gradient loss with auxiliary environment observation prediction to provide dense supervision from terminal feedback, improving agent performance and self-improvement capabilities. AI-generated summary CLI agents… 23 r/LocalLLaMA community 1mo ago Llamacpp server : How do the -np and -c flags interact? I've been using lm studio for a few months. I want to try hermes agents with Qwen 3.6 MoE, so I'm switching to llama.cpp and I don't understand well how the server slots -np and the context size -c interact. The context for each parallel client appears to be equally distributed… 10 arXiv — Machine Learning research 1mo ago Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis arXiv:2605.24162v1 Announce Type: new Abstract: Biological systems are governed by structured molecular interactions, where pathways, regulatory circuits, and functional gene relationships shape cellular behavior and disease progression. Much of this knowledge is naturally… 6 arXiv — Machine Learning research 1mo ago PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets arXiv:2605.24249v1 Announce Type: new Abstract: The growing availability of clinical data has increased the use of machine learning, yet centralized data aggregation is often infeasible for sensitive health information. Federated Learning (FL) offers a distributed alternative,… 19 arXiv — Machine Learning research 1mo ago Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence arXiv:2605.24261v1 Announce Type: new Abstract: A critical challenge facing clinicians managing chronic disease interventions is sustaining long-run patient health given limited information and resources. Digital therapeutics (DTs) provide a cost-effective way to manage… 31 arXiv — Machine Learning research 1mo ago Lake Detection and Water Quality Estimation in Sentinel-2 Data arXiv:2605.24515v1 Announce Type: new Abstract: With climate change and increasing human pressure on natural landscapes, inland water resources are becoming progressively scarcer, more vulnerable, and more difficult to manage sustainably. Reliable and automated methods for… 25 arXiv — Machine Learning research 1mo ago ECHO: Terminal Agents Learn World Models for Free arXiv:2605.24517v1 Announce Type: new Abstract: CLI agents are the closest thing language models have to an embodied setting: the model emits commands, the terminal executes them, and the returned stream -- stdout, errors, files, logs, and traces -- records the consequences. We… 25 arXiv — Machine Learning research 1mo ago Hardware-Aware Federated Learning for Speech Emotion Recognition arXiv:2605.24712v1 Announce Type: new Abstract: Federated learning (FL) enables privacy-preserving collaborative training across distributed edge devices, but real deployments involve heterogeneous clients with different processing power, memory capacity, and communication… 16 arXiv — NLP / Computation & Language research 1mo ago A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks arXiv:2605.23977v1 Announce Type: new Abstract: This paper audits benchmark evaluation in clinical-interview depression detection through four complementary probes across DAIC/E-DAIC, CMDC, ANDROIDS, MODMA, and PDCH. First, we re-evaluate E-DAIC under strict subject-disjoint… 27 arXiv — NLP / Computation & Language research 1mo ago When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation arXiv:2605.24902v1 Announce Type: new Abstract: Reasoning-enabled LLMs perform strongly on medical reasoning benchmarks, but it remains unclear whether these gains transfer to structured clinical documentation; we investigate this question using SOAP note generation from… 13 arXiv — NLP / Computation & Language research 1mo ago Overview of the PsyDefDetect Shared Task at BioNLP 2026: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations arXiv:2605.24907v1 Announce Type: new Abstract: We present an overview of PsyDefDetect, the shared task on detecting levels of psychological defense mechanisms in emotional support dialogues, co-located with BioNLP@ACL 2026. Grounded in the clinically validated Defense Mechanism… 20 arXiv — NLP / Computation & Language research 1mo ago TRACE: A taxonomy-grounded synthetic dataset for teaching-program generation and session interpretation in Applied Behavior Analysis arXiv:2605.25038v1 Announce Type: new Abstract: Applied Behavior Analysis (ABA) is a clinical discipline whose documentation, teaching programs and multi-session behavioral logs, is formulaic and high-volume, yet real session data is HIPAA-protected and bound by professional… 28 arXiv — NLP / Computation & Language research 1mo ago Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence arXiv:2605.25120v1 Announce Type: new Abstract: Radiology reports remain the primary mechanism by which imaging findings are communicated to clinical teams. However, much of the structured information behind these reports, including measurements, image evidence, prior… 8 Hugging Face Daily Papers research 1mo ago Geometry-Aware Image Flow Matching Abstract Geometry-aware generative models leveraging spherical manifolds and optimal transport techniques outperform traditional Euclidean approaches for natural image synthesis. AI-generated summary Recent advances in generative models highlight the power of geometry-aware… 29 Simon Willison community 1mo ago Notes on Pope Leo XIV's encyclical on AI Dropped this morning by the Vatican: Magnifica Humanitas of His Holiness Pope Leo XIV on Safeguarding the Human Person in the Time of Artificial Intelligence . This is a very interesting document. It's some of the clearest writing I've seen on the ethics of integrating AI into… 12 r/LocalLLaMA community 1mo ago AI content detector based on Qwen 0.8b fine-tuned on Pangram dataset I've fine-tuned Qwen 3.5 0.8B on the dataset provided by Pangram with their EditLens paper. It's available via a Chrome extension ; you can just click selected text and it's going to give you the probability distribution of how likely it is AI-generated. It takes under 1s on my… 36 r/MachineLearning community 1mo ago Is AI inference platform really that saturated now? [D] I’m thinking of expanding an on-device inference SDk into a full blown AI inference platform and seeing more and more inference platform popping out. Been talking with a VC from Seattle/NY. Is this space really that saturated?   submitted by   /u/kampak212 [link]  … 35 TechCrunch — AI news-outlet 1mo ago What ClickUp’s mass layoff tells us about the future of work The nine-year-old startup is replacing hundreds of employees with thousands of AI agents. 18 TechCrunch — AI news-outlet 1mo ago The pope’s AI encyclical isn’t really about AI Pope Leo XIV's first encyclical uses AI as a lens to diagnose older problems: concentrated power, eroding democracy, and a tech elite that shapes the world to its own advantage. 34 Hacker News — AI on Front Page community 1mo ago Magnifica Humanitas (Encyclical Letter) Article URL: https://www.vatican.va/content/leo-xiv/en/encyclicals/documents/20260515-magnifica-humanitas.html Comments URL: https://news.ycombinator.com/item?id=48265206 Points: 229 # Comments: 63 36 r/LocalLLaMA community 1mo ago We added W8A8 activation quantization to MLX — prefill went from 2.84s to 2.52s on M5 Pro Hey, I work on inference tooling at Mininglamp AI. We needed faster prefill for a 4B VLM running on Apple Silicon. Problem was MLX only does weight-only quant — activations stay FP16 the whole way through. So we wrote Cider, a small SDK that adds W8A8 activation quant on top of… 21 arXiv — Machine Learning research 1mo ago MedExpMem: Adapting Experience Memory for Differential Diagnosis arXiv:2605.22872v1 Announce Type: new Abstract: Experienced physicians develop diagnostic expertise through clinical practice, acquiring not only disease knowledge but also the ability to differentiate confusable conditions. Current medical vision-language models (VLMs) lack… 24 arXiv — Machine Learning research 1mo ago FederatedRSF : Federated Random Survival Forests for Partially Overlapping Medical Data arXiv:2605.22954v1 Announce Type: new Abstract: Multi-center survival prediction can improve robustness and generalizability, yet privacy regulations and institutional governance often prevent pooling patient-level clinical and genomic data across institutions. In practice,… 28 arXiv — Machine Learning research 1mo ago Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance arXiv:2605.23453v1 Announce Type: new Abstract: We conducted a reproducibility-oriented re-evaluation of prior migraine classification studies, correcting for data leakage and metric bias. We then introduced (i) a clinically motivated aggregation of two hemiplegic subtypes… 23 arXiv — NLP / Computation & Language research 1mo ago When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening arXiv:2605.23148v1 Announce Type: new Abstract: As demand for mental health care outpaces clinician-delivered assessment, scalable screening tools are increasingly needed. Large language models (LLMs) may identify psychiatric risk from patient narratives, but their reliability… 9 arXiv — NLP / Computation & Language research 1mo ago ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication arXiv:2605.23326v1 Announce Type: new Abstract: We present ClimateChat-300K, a large-scale dataset of 299,329 public Facebook posts about climate change collected between May 2020 and May 2024 through the CrowdTangle platform. The dataset contains 41 metadata features including… 5 Page 8 of 10 · 500 articles ← Newer Older →