Tag

Developer Tool

500 articles archived under #developer-tool · RSS

arXiv — Machine Learning research 1mo ago

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee

arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur…

13
arXiv — NLP / Computation & Language research 1mo ago

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

arXiv:2605.27380v1 Announce Type: new Abstract: Cross-lingual biomedical entity linking (BEL) maps mentions in any language to unique identifiers in a biomedical knowledge base (KB), supporting clinical and biomedical NLP applications. However, expert-annotated training data for…

32
arXiv — NLP / Computation & Language research 1mo ago

StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation

arXiv:2605.27393v1 Announce Type: new Abstract: Large language models (LLMs) can generate fluent dialogue, but prior works lack situational grounding, dynamic strategy control, and evaluation aligned with clinical standards in motivational interviewing (MI). We introduce…

7
arXiv — NLP / Computation & Language research 1mo ago

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand…

28
arXiv — NLP / Computation & Language research 1mo ago

Challenges in Explaining Pretrained Clinical Text Classifiers

arXiv:2605.28060v1 Announce Type: new Abstract: Explaining the predictions of neural models in clinical NLP remains a significant challenge, especially for complex tasks involving long, unstructured medical texts. While post-hoc methods like LIME and SHAP are widely used, they…

19
r/MachineLearning community 1mo ago

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

If you've ever tried to pick an STT vendor for a phone-based voice agent or call center product, you've probably hit this wall: you have plenty of real production audio, but it's unlabeled, so you can't compute WER on it. And the annotated public datasets (FLEURS, CommonVoice,…

31
TechCrunch — AI news-outlet 1mo ago

ClickHouse triples anualized revenue to $250M, charting a path toward an IPO

The database provider is eyeing a public debut within the next few years.

8
TechCrunch — AI news-outlet 1mo ago

ClickHouse triples annualized revenue to $250M, charting a path toward an IPO

The database provider is eyeing a public debut within the next few years.

32
r/LocalLLaMA community 1mo ago

AI is not for everyone

This may be a controversial take, but AI is not for everyone. I've made a post here before about the vibecoded garbage I see on this subreddit every time I click on it but there seems to be a larger issue. AI isn't just a set and forget karma farm. You actually have to put work…

14
The Information — AI news-outlet 1mo ago

Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring

Micron Technology crossed $1 trillion in market value for the first time Tuesday, as shares climbed 19% on rising demand for memory chips used in AI systems. It was Micron’s largest single-day gain since 2011. The rally came after UBS sharply raised its price target for Micron…

33
arXiv — Machine Learning research 1mo ago

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

arXiv:2605.26121v1 Announce Type: new Abstract: LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering…

27
arXiv — Machine Learning research 1mo ago

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

arXiv:2605.26194v1 Announce Type: new Abstract: Clinical time-series learning is routinely constrained by small, heterogeneous cohorts and protocol drift, while its downstream use spans both classification (e.g., pathology diagnosis) and regression (e.g., temporal forecasting).…

30
arXiv — Machine Learning research 1mo ago

MuCon: Clipped Muon Updates for LLM Training

arXiv:2605.26459v1 Announce Type: new Abstract: Muon-style optimizers take a matrix-valued momentum or preconditioned update $B = U \operatorname{diag}(\sigma_1,\ldots,\sigma_r) V^\top$ and replace it with its canonical partial polar factor $\operatorname{Pol}(B) = U V^\top$.…

31
arXiv — Machine Learning research 1mo ago

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense…

32
arXiv — Machine Learning research 1mo ago

Separate Aggregation of Split Network for Personalized Federated Learning

arXiv:2605.26571v1 Announce Type: new Abstract: Federated learning enables collaborative model training without sharing raw data, but its performance can degrade substantially under heterogeneous client data distributions. A single global model often cannot satisfy diverse…

33
arXiv — Machine Learning research 1mo ago

Image Feature Fusion-based Federated Client Unlearning (FCU)

arXiv:2605.26715v1 Announce Type: new Abstract: Major data protection regulations all mention the "right to be forgotten," and that's what pushed federated unlearning (FU) techniques forward. But one stubborn issue remains: catastrophic forgetting--you erase the target…

9
arXiv — Machine Learning research 1mo ago

Adversarial Training for Robust Coverage Network under Worst-case Facility Losses

arXiv:2605.26763v1 Announce Type: new Abstract: The Maximal Covering Location-Interdiction Problem (MCLIP) is a classic bi-level optimization problem, which is fundamental to resilient infrastructure planning yet remains computationally intractable. Specifically, the upper level…

5
arXiv — Machine Learning research 1mo ago

Ratio-Variance Regularized Policy Optimization

arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that…

29
arXiv — NLP / Computation & Language research 1mo ago

The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology

arXiv:2605.26346v1 Announce Type: new Abstract: Objective: To describe the design and early clinical evaluation of The Daily Dose (TDD), an LLM-driven, automated clinical summarization and clinical-trial identification system integrated into routine radiation oncology practice.…

7
arXiv — NLP / Computation & Language research 1mo ago

Curation and Extraction of Drug-Related Entities from Reddit Platform

arXiv:2605.26445v1 Announce Type: new Abstract: Physicians learn primarily about illicit drugs from clinical overdose cases, limiting their understanding of real-world usage. Meanwhile, drug users share first-hand experiences online, offering insights into dosage and effects of…

31
arXiv — NLP / Computation & Language research 1mo ago

Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records

arXiv:2605.26463v1 Announce Type: new Abstract: Data consistency between unstructured clinical notes and structured tables in Electronic Health Records (EHRs) is essential for patient safety and clinical decision-making. However, existing work on note-table consistency…

7
arXiv — NLP / Computation & Language research 1mo ago

Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline

arXiv:2605.26560v1 Announce Type: new Abstract: Objective. Outpatient notes carry follow-up instructions pairing actions with future times ("MRI brain in two weeks"). Extracting (action, date) pairs supports scheduling and audit, but generative extractors miss the date because…

19
Vercel — AI dev-tools 1mo ago

Experimental native binaries for Vercel CLI

The Vercel CLI now ships an optional experimental native binary that starts faster, is even more secure, and requires no Node.js runtime dependency. Binaries are code-signed, allowing your OS to verify that they came from Vercel and haven't been modified. Additionally, on macOS,…

30
r/LocalLLaMA community 1mo ago

Turning local agents into self-optimizing agents

I was experimenting with a self-optimizing agentic pipeline to climb the benchmark leaderboard (TerminalBench). On a 10-task subset, I got the performance to rise from ~30% → ~90%. That loop worked, so I asked: can the same reflect-and-rewrite step run continuously against…

17
Hugging Face Daily Papers research 1mo ago

ECHO: Terminal Agents Learn World Models for Free

Abstract Environment cross-entropy hybrid objective combines policy-gradient loss with auxiliary environment observation prediction to provide dense supervision from terminal feedback, improving agent performance and self-improvement capabilities. AI-generated summary CLI agents…

23
r/LocalLLaMA community 1mo ago

Llamacpp server : How do the -np and -c flags interact?

I've been using lm studio for a few months. I want to try hermes agents with Qwen 3.6 MoE, so I'm switching to llama.cpp and I don't understand well how the server slots -np and the context size -c interact. The context for each parallel client appears to be equally distributed…

10
arXiv — Machine Learning research 1mo ago

Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis

arXiv:2605.24162v1 Announce Type: new Abstract: Biological systems are governed by structured molecular interactions, where pathways, regulatory circuits, and functional gene relationships shape cellular behavior and disease progression. Much of this knowledge is naturally…

6
arXiv — Machine Learning research 1mo ago

PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets

arXiv:2605.24249v1 Announce Type: new Abstract: The growing availability of clinical data has increased the use of machine learning, yet centralized data aggregation is often infeasible for sensitive health information. Federated Learning (FL) offers a distributed alternative,…

19
arXiv — Machine Learning research 1mo ago

Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence

arXiv:2605.24261v1 Announce Type: new Abstract: A critical challenge facing clinicians managing chronic disease interventions is sustaining long-run patient health given limited information and resources. Digital therapeutics (DTs) provide a cost-effective way to manage…

31
arXiv — Machine Learning research 1mo ago

Lake Detection and Water Quality Estimation in Sentinel-2 Data

arXiv:2605.24515v1 Announce Type: new Abstract: With climate change and increasing human pressure on natural landscapes, inland water resources are becoming progressively scarcer, more vulnerable, and more difficult to manage sustainably. Reliable and automated methods for…

25
arXiv — Machine Learning research 1mo ago

ECHO: Terminal Agents Learn World Models for Free

arXiv:2605.24517v1 Announce Type: new Abstract: CLI agents are the closest thing language models have to an embodied setting: the model emits commands, the terminal executes them, and the returned stream -- stdout, errors, files, logs, and traces -- records the consequences. We…

25
arXiv — Machine Learning research 1mo ago

Hardware-Aware Federated Learning for Speech Emotion Recognition

arXiv:2605.24712v1 Announce Type: new Abstract: Federated learning (FL) enables privacy-preserving collaborative training across distributed edge devices, but real deployments involve heterogeneous clients with different processing power, memory capacity, and communication…

16
arXiv — NLP / Computation & Language research 1mo ago

A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks

arXiv:2605.23977v1 Announce Type: new Abstract: This paper audits benchmark evaluation in clinical-interview depression detection through four complementary probes across DAIC/E-DAIC, CMDC, ANDROIDS, MODMA, and PDCH. First, we re-evaluate E-DAIC under strict subject-disjoint…

27
arXiv — NLP / Computation & Language research 1mo ago

When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation

arXiv:2605.24902v1 Announce Type: new Abstract: Reasoning-enabled LLMs perform strongly on medical reasoning benchmarks, but it remains unclear whether these gains transfer to structured clinical documentation; we investigate this question using SOAP note generation from…

13
arXiv — NLP / Computation & Language research 1mo ago

Overview of the PsyDefDetect Shared Task at BioNLP 2026: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations

arXiv:2605.24907v1 Announce Type: new Abstract: We present an overview of PsyDefDetect, the shared task on detecting levels of psychological defense mechanisms in emotional support dialogues, co-located with BioNLP@ACL 2026. Grounded in the clinically validated Defense Mechanism…

20
arXiv — NLP / Computation & Language research 1mo ago

TRACE: A taxonomy-grounded synthetic dataset for teaching-program generation and session interpretation in Applied Behavior Analysis

arXiv:2605.25038v1 Announce Type: new Abstract: Applied Behavior Analysis (ABA) is a clinical discipline whose documentation, teaching programs and multi-session behavioral logs, is formulaic and high-volume, yet real session data is HIPAA-protected and bound by professional…

28
arXiv — NLP / Computation & Language research 1mo ago

Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence

arXiv:2605.25120v1 Announce Type: new Abstract: Radiology reports remain the primary mechanism by which imaging findings are communicated to clinical teams. However, much of the structured information behind these reports, including measurements, image evidence, prior…

8
Hugging Face Daily Papers research 1mo ago

Geometry-Aware Image Flow Matching

Abstract Geometry-aware generative models leveraging spherical manifolds and optimal transport techniques outperform traditional Euclidean approaches for natural image synthesis. AI-generated summary Recent advances in generative models highlight the power of geometry-aware…

29
Simon Willison community 1mo ago

Notes on Pope Leo XIV's encyclical on AI

Dropped this morning by the Vatican: Magnifica Humanitas of His Holiness Pope Leo XIV on Safeguarding the Human Person in the Time of Artificial Intelligence . This is a very interesting document. It's some of the clearest writing I've seen on the ethics of integrating AI into…

12
r/LocalLLaMA community 1mo ago

AI content detector based on Qwen 0.8b fine-tuned on Pangram dataset

I've fine-tuned Qwen 3.5 0.8B on the dataset provided by Pangram with their EditLens paper. It's available via a Chrome extension ; you can just click selected text and it's going to give you the probability distribution of how likely it is AI-generated. It takes under 1s on my…

36
r/MachineLearning community 1mo ago

Is AI inference platform really that saturated now? [D]

I’m thinking of expanding an on-device inference SDk into a full blown AI inference platform and seeing more and more inference platform popping out. Been talking with a VC from Seattle/NY. Is this space really that saturated?   submitted by   /u/kampak212 [link]  …

35
TechCrunch — AI news-outlet 1mo ago

What ClickUp’s mass layoff tells us about the future of work

The nine-year-old startup is replacing hundreds of employees with thousands of AI agents.

18
TechCrunch — AI news-outlet 1mo ago

The pope’s AI encyclical isn’t really about AI

Pope Leo XIV's first encyclical uses AI as a lens to diagnose older problems: concentrated power, eroding democracy, and a tech elite that shapes the world to its own advantage.

34
Hacker News — AI on Front Page community 1mo ago

Magnifica Humanitas (Encyclical Letter)

Article URL: https://www.vatican.va/content/leo-xiv/en/encyclicals/documents/20260515-magnifica-humanitas.html Comments URL: https://news.ycombinator.com/item?id=48265206 Points: 229 # Comments: 63

36
r/LocalLLaMA community 1mo ago

We added W8A8 activation quantization to MLX — prefill went from 2.84s to 2.52s on M5 Pro

Hey, I work on inference tooling at Mininglamp AI. We needed faster prefill for a 4B VLM running on Apple Silicon. Problem was MLX only does weight-only quant — activations stay FP16 the whole way through. So we wrote Cider, a small SDK that adds W8A8 activation quant on top of…

21
arXiv — Machine Learning research 1mo ago

MedExpMem: Adapting Experience Memory for Differential Diagnosis

arXiv:2605.22872v1 Announce Type: new Abstract: Experienced physicians develop diagnostic expertise through clinical practice, acquiring not only disease knowledge but also the ability to differentiate confusable conditions. Current medical vision-language models (VLMs) lack…

24
arXiv — Machine Learning research 1mo ago

FederatedRSF : Federated Random Survival Forests for Partially Overlapping Medical Data

arXiv:2605.22954v1 Announce Type: new Abstract: Multi-center survival prediction can improve robustness and generalizability, yet privacy regulations and institutional governance often prevent pooling patient-level clinical and genomic data across institutions. In practice,…

28
arXiv — Machine Learning research 1mo ago

Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance

arXiv:2605.23453v1 Announce Type: new Abstract: We conducted a reproducibility-oriented re-evaluation of prior migraine classification studies, correcting for data leakage and metric bias. We then introduced (i) a clinically motivated aggregation of two hemiplegic subtypes…

23
arXiv — NLP / Computation & Language research 1mo ago

When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening

arXiv:2605.23148v1 Announce Type: new Abstract: As demand for mental health care outpaces clinician-delivered assessment, scalable screening tools are increasingly needed. Large language models (LLMs) may identify psychiatric risk from patient narratives, but their reliability…

9
arXiv — NLP / Computation & Language research 1mo ago

ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication

arXiv:2605.23326v1 Announce Type: new Abstract: We present ClimateChat-300K, a large-scale dataset of 299,329 public Facebook posts about climate change collected between May 2020 and May 2024 through the CrowdTangle platform. The dataset contains 41 metadata features including…

5

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

Challenges in Explaining Pretrained Clinical Text Classifiers

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

ClickHouse triples anualized revenue to $250M, charting a path toward an IPO

ClickHouse triples annualized revenue to $250M, charting a path toward an IPO

AI is not for everyone

Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

MuCon: Clipped Muon Updates for LLM Training

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

Separate Aggregation of Split Network for Personalized Federated Learning

Image Feature Fusion-based Federated Client Unlearning (FCU)

Adversarial Training for Robust Coverage Network under Worst-case Facility Losses

Ratio-Variance Regularized Policy Optimization

The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology

Curation and Extraction of Drug-Related Entities from Reddit Platform

Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records

Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline

Experimental native binaries for Vercel CLI

Turning local agents into self-optimizing agents

ECHO: Terminal Agents Learn World Models for Free

Llamacpp server : How do the -np and -c flags interact?

Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis

PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets

Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence

Lake Detection and Water Quality Estimation in Sentinel-2 Data

ECHO: Terminal Agents Learn World Models for Free

Hardware-Aware Federated Learning for Speech Emotion Recognition

A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks

When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation

Overview of the PsyDefDetect Shared Task at BioNLP 2026: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations

TRACE: A taxonomy-grounded synthetic dataset for teaching-program generation and session interpretation in Applied Behavior Analysis

Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence

Geometry-Aware Image Flow Matching

Notes on Pope Leo XIV's encyclical on AI

AI content detector based on Qwen 0.8b fine-tuned on Pangram dataset

Is AI inference platform really that saturated now? [D]

What ClickUp&#8217;s mass layoff tells us about the future of work

The pope’s AI encyclical isn’t really about AI

Magnifica Humanitas (Encyclical Letter)

We added W8A8 activation quantization to MLX — prefill went from 2.84s to 2.52s on M5 Pro

MedExpMem: Adapting Experience Memory for Differential Diagnosis

FederatedRSF : Federated Random Survival Forests for Partially Overlapping Medical Data

Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance

When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening

ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication

What ClickUp’s mass layoff tells us about the future of work