News / #developer-tool Tag Developer Tool 500 articles archived under #developer-tool · RSS Sign in to follow arXiv — Machine Learning research 24d ago The Identity Trap in EEG Foundation Models: A Diagnostic Audit arXiv:2606.06647v1 Announce Type: new Abstract: Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or… 38 arXiv — Machine Learning research 24d ago SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling arXiv:2606.06820v1 Announce Type: new Abstract: Agentic Large Language Model (LLM) systems decompose complex tasks into workflow Directed Acyclic Graphs (DAGs) whose primitives must be scheduled on heterogeneous clusters. Existing deep reinforcement learning (DRL) schedulers are… 26 arXiv — Machine Learning research 24d ago Constructing VAE Latent Spaces with Prescribed Topology arXiv:2606.07058v1 Announce Type: new Abstract: Variational autoencoders (VAEs) learn low-dimensional latent representations of high-dimensional data. When the data lies on a manifold with non-Euclidean topology, the standard Gaussian prior introduces a topological mismatch that… 17 arXiv — Machine Learning research 24d ago The discovery of the effects of women employment participation on the fertility of developing countries: A panel data approach arXiv:2606.07093v1 Announce Type: new Abstract: The fertility trend in developing countries has experienced a significant decline in the last few decades; at the same time, the role of women in the workplace has improved. To have a better insight of the causality of the rate of… 19 arXiv — Machine Learning research 24d ago REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference arXiv:2606.07141v1 Announce Type: new Abstract: Language models trained for clinical disease inference are trained on patient data, which may include sensitive and private information, and data owners may request the removal of their data from a trained model due to privacy or… 12 arXiv — Machine Learning research 24d ago A robust PPG foundation model using multimodal physiological supervision arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining… 8 arXiv — Machine Learning research 24d ago Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction arXiv:2606.06509v1 Announce Type: cross Abstract: Numerous medical imaging problems must be solved under limited labels and constrained compute, yet it remains unclear whether performance gains are driven mainly by more expressive models or by better representation of clinically… 17 arXiv — NLP / Computation & Language research 24d ago When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations arXiv:2606.07237v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in healthcare for tasks such as clinical question answering, diagnosis support, and report summarization. Despite their promise, these models remain highly sensitive to subtle… 17 arXiv — NLP / Computation & Language research 24d ago LLM-Guided Evolution for Medical Decision Pipelines arXiv:2606.07342v1 Announce Type: new Abstract: Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP-Elites evolution as an inference-time alternative for discovering… 9 arXiv — NLP / Computation & Language research 24d ago TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment arXiv:2606.07451v1 Announce Type: cross Abstract: Vision-language models such as CLIP are highly useful for diverse tasks due to their shared image-text embedding space. Despite this, the image and text embeddings are often poorly aligned, affecting downstream performance.… 6 arXiv — NLP / Computation & Language research 24d ago CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction arXiv:2508.03668v3 Announce Type: replace Abstract: Click-Through Rate (CTR) prediction, a core task in recommendation systems, estimates user click likelihood using historical behavioral data. Modeling user behavior sequences as text to leverage Language Models (LMs) for this… 5 r/LocalLLaMA community 25d ago Dockerized Nemotron 3.5 ASR — Switched from Parakeet, better multilingual support + streaming (4.5x realtime speed on cpu) I was originally using Parakeet for my speech recognition pipeline but decided to give Nemotron 3.5 a shot. After testing it on some multilingual audio clips, it's been working great so far. What sold me: - Better language support (40+ locales from one model) - Native streaming… 17 r/LocalLLaMA community 25d ago Another 1-click admin account takeover in pewdiepie's AI tool (language in video nsfw)   submitted by   /u/theonejvo [link]   [comments] 10 Hugging Face official-blog 26d ago Job Searcher Back to Articles Job Searcher Team Article Published June 6, 2026 Upvote 2 Emre emrekuruu build-small-hackathon Job hunting as a new grad is a full-time job by itself. You sift through hundreds of postings every week to find a handful worth applying to. You click "Easy Apply"… 15 Simon Willison community 26d ago micropython-wasm 0.1a2 Release: micropython-wasm 0.1a2 I added a CLI to micropython-wasm ( issue #7 ), inspired by the first draft of the blog entry when I realized it would be a great way to illustrate the Try it yourself section. Tags: python , sandboxing , webassembly , micropython 4 Anthropic SDK (Python) releases dev-tools 26d ago v0.106.0 0.106.0 (2026-06-05) Full Changelog: v0.105.2...v0.106.0 Features api: mark Claude Opus 4.1 as deprecated ( 85068cc ) Bug Fixes client: make Foundry client copy() and with_options() work ( 94146ac ) transform schema: preserve $defs when schema root is a $ref ( #1642 ) ( fc58e06… 19 Hugging Face Daily Papers research 26d ago BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding Abstract BRepCLIP enables multimodal representation learning for CAD models by aligning boundary representation geometry with language and image embeddings through contrastive pretraining, achieving superior retrieval and classification performance compared to point-based… 7 Hugging Face Daily Papers research 26d ago MAOAM: Unified Object and Material Selection with Vision-Language Models Abstract A unified vision-language model framework enables precise object and material selection through text or click interactions, supporting diverse editing workflows with improved robustness. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Selection is a core operation in… 11 llama.cpp releases dev-tools 27d ago b9530 cli: fix model params not propagated ( #23893 ) Fixes #23847 macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu… 21 arXiv — Machine Learning research 27d ago DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum arXiv:2606.05435v1 Announce Type: new Abstract: Differentially private stochastic gradient descent (DP-SGD) has become the standard framework for privacy-preserving machine learning, yet its reliance on a fixed gradient clipping threshold to limit sensitivity remains a… 12 arXiv — Machine Learning research 27d ago Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping arXiv:2606.05731v1 Announce Type: new Abstract: In-season crop type mapping is critical for food security in the face of increasingly extreme climate-related threats to crops. Currently, the USDA Cropland Data Layer provides crop type labels at 30m resolution and is available… 8 arXiv — Machine Learning research 27d ago HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care arXiv:2606.05994v1 Announce Type: new Abstract: Medical knowledge graphs (MKGs) infused with clinical knowledge have been increasingly used to model electronic health records (EHRs) to support interpretable predictions in healthcare domain. However, existing MKG-based approaches… 31 arXiv — Machine Learning research 27d ago Your GFlowNet Secretly Learns an Optimal Transport Plan arXiv:2606.06272v1 Announce Type: new Abstract: Generative Flow Networks (GFlowNets) are a framework for sampling structured objects via stochastic trajectories in a directed graph. In this work, we establish a theoretical connection between non-acyclic GFlowNets and optimal… 5 arXiv — NLP / Computation & Language research 27d ago InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization arXiv:2606.05561v1 Announce Type: new Abstract: Speech-based mental health screening offers scalable depression detection, yet clinical deployment faces a significant barrier: users' privacy concerns about demographic information exposure. Current techniques struggle to resolve… 34 arXiv — NLP / Computation & Language research 27d ago Measuring the sensitivity of LLM-based structured extraction to prompt, model, and schema choices in clinical discharge summaries arXiv:2606.05970v1 Announce Type: new Abstract: Large language models are increasingly used for structured extraction from clinical free-text notes, but the sensitivity of their output to upstream configuration choices is less understood than their accuracy on fixed benchmarks.… 23 Hacker News — AI on Front Page community 27d ago Open Code Review – An AI-powered code review CLI tool Article URL: https://github.com/alibaba/open-code-review Comments URL: https://news.ycombinator.com/item?id=48406358 Points: 233 # Comments: 66 32 Hugging Face Daily Papers research 28d ago Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases Abstract MedSP1000 introduces an interactive benchmark derived from standardized patients to evaluate clinical agents' dynamic performance across encounters, revealing limitations of current large language models in medical applications. Generated by… 18 llama.cpp releases dev-tools 28d ago b9503 fix(mtmd): handle Gemma 4 audio projector embedding size ( #24091 ) mtmd: handle Gemma 4 audio projector embedding size rm projection_dim from clip_n_mmproj_embd Co-authored-by: Xuan Son Nguyen son@huggingface.co macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64,… 28 arXiv — Machine Learning research 28d ago Early Detection of Alzheimer's Disease Using Explainable Machine Learning on Clinical Biomarkers: A Multi-Class Classification Study Using the Alzheimer's Disease Neuroimaging Initiative (ADNI) Dataset arXiv:2606.03995v1 Announce Type: new Abstract: Background: Alzheimer's disease (AD) affects over 55 million people worldwide. Accurate, interpretable detection of normal cognition (NC), mild cognitive impairment (MCI), and AD from routine clinical assessments remains a critical… 14 arXiv — Machine Learning research 28d ago KODA: Contrastive Representation Comparison and Alignment for Vision-Language Foundation Models arXiv:2606.04180v1 Announce Type: new Abstract: Vision-language foundation models such as CLIP and SigLIP provide widely used representations for multimodal learning systems. While these models are typically compared through downstream performance, such evaluations often do not… 8 arXiv — NLP / Computation & Language research 28d ago When Clients Stop Following: A Cognitive Conceptualization Diagram-driven Framework for Strategic Counseling arXiv:2606.04389v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise in psychological counseling, yet existing benchmarks rely heavily on highly cooperative simulated clients. We observe a critical counselor-following phenomenon: these clients often rapidly… 14 arXiv — NLP / Computation & Language research 28d ago Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases arXiv:2606.05112v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly proposed as clinical agents, yet static, single-turn benchmarks cannot capture how a model dynamically delivers care across an encounter: gathering information, planning treatment, and… 32 Hugging Face official-blog 28d ago Designing the hf CLI as an agent-optimized way to work with the Hub Back to Articles Designing the hf CLI as an agent-optimized way to work with the Hub Published June 4, 2026 Update on GitHub Upvote 4 Célina Hanouti celinah Lucain Pouget Wauplin hf is the official command-line entrypoint to the Hugging Face Hub. Anything you can do on the Hub… 12 Ollama releases dev-tools 28d ago v0.30.4-rc1: llama-server: fix gemma4 patch wiring (#16477) This will fix the "clip.cpp:4399: Unknown projector type" crash. 4 Ollama releases dev-tools 28d ago v0.30.4: llama-server: fix gemma4 patch wiring (#16477) This will fix the "clip.cpp:4399: Unknown projector type" crash. 38 r/LocalLLaMA community 28d ago How to use audio and vision modalities in llama.cpp? How to use audio and vision modalities in llama.cpp with Gemma4 12B it? I’m on release b9494, but when I run llama-cli it shows “modalities: text” only, and crashes if I try to add an image.   submitted by   /u/No-Leave-4512 [link]   [comments] 20 Hugging Face Daily Papers research 29d ago KletterMix: Climbing Toward High-Quality German Pretraining Data Abstract A high-quality German-language corpus for language model pretraining is introduced through careful translation of an English corpus while preserving document structure and metadata, demonstrating improved downstream performance in German-language tasks. Generated by… 28 Hacker News — AI on Front Page community 29d ago Mouseless – keyboard-driven control of macOS/Linux/Windows Article URL: https://mouseless.click Comments URL: https://news.ycombinator.com/item?id=48383667 Points: 223 # Comments: 107 38 arXiv — Machine Learning research 29d ago Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation arXiv:2606.02604v1 Announce Type: new Abstract: ESG and climate risk data remain fragmented across heterogeneous Scope 1, Scope 2, and Scope 3 reporting environments, while conventional validation pipelines lack provenance aware auditability, hidden drift detection, and… 9 arXiv — Machine Learning research 29d ago DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data arXiv:2606.03209v1 Announce Type: new Abstract: Fine-tuning large language models (LLMs) in privacy-sensitive and resource-constrained environments remains challenging. Since training data are often distributed across multiple clients, decentralized fine-tuning offers a natural… 15 arXiv — Machine Learning research 29d ago Learning Temporal Causal Structure via Smooth Differentiable Optimization arXiv:2606.03227v1 Announce Type: new Abstract: Causal discovery with instantaneous effects in multivariate time series is challenging, as the instantaneous structure must be acyclic. Prior methods enforce this by either separating instantaneous and lagged estimation into… 4 arXiv — Machine Learning research 29d ago Multi-Modal Graph Neural Network with Transformer-Guided Adaptive Diffusion for Preclinical Alzheimer Classification arXiv:2606.03322v1 Announce Type: new Abstract: The graphical representation of the brain offers critical insights into diagnosing and prognosing neurodegenerative disease via relationships between regions of interest (ROIs). Despite recent emergence of various Graph Neural… 14 arXiv — NLP / Computation & Language research 29d ago AI Rater Discrimination Depends on Scoring Protocol in Complex Clinical Decision-Making arXiv:2606.03198v1 Announce Type: new Abstract: Clinical AI evaluation increasingly delegates scoring to large language models (LLMs) acting as AI raters, yet their scoring behavior across evaluation conditions has not been quantitatively characterized. We address this gap… 17 arXiv — NLP / Computation & Language research 29d ago The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP arXiv:2606.03250v1 Announce Type: new Abstract: Digital healthcare generates vast amounts of clinical text that can support AI-assisted applications, yet German biomedical language models remain limited by older architectures or restricted training data. We present ChristBERT… 33 arXiv — NLP / Computation & Language research 29d ago SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series arXiv:2606.03301v1 Announce Type: new Abstract: We introduce SagaQA, a long-form video benchmark for multi-hop reasoning over full-length TV series. Existing video reasoning benchmarks often emphasize local understanding of adjacent frames or clips. SagaQA addresses this gap by… 33 arXiv — NLP / Computation & Language research 29d ago Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models arXiv:2606.03399v1 Announce Type: new Abstract: While large language models (LLMs) are increasingly used for clinical applications, many existing pipelines require sending raw sensitive health information to remote servers for processing, which heightens the risk of privacy… 4 arXiv — NLP / Computation & Language research 29d ago Does Language Shift Break Medical Vision-Language Models? Indonesian Radiology Visual Question Answering Case Study arXiv:2606.03693v1 Announce Type: new Abstract: Medical Vision-Language Models (VLMs) are typically evaluated on English radiology visual question answering benchmarks, leaving their robustness under non-English clinical language largely unexplored. We introduce IndoRad-VQA, an… 10 arXiv — NLP / Computation & Language research 29d ago KletterMix: Climbing Toward High-Quality German Pretraining Data arXiv:2606.03773v1 Announce Type: new Abstract: High-quality pretraining data is a central ingredient in modern language models, but German-language resources remain far less developed than their English counterparts: they are often smaller, less carefully curated, weakly… 19 Hacker News — AI on Front Page community 29d ago MAI-Code-1-Flash https://microsoft.ai/models/mai-code-1-flash/ https://microsoft.ai/pdf/MAI-Code-1-Flash-Model-Card.PDF Launching seven new MAI models: https://microsoft.ai/news/building-a-hillclimbing-machine-la... Comments URL: https://news.ycombinator.com/item?id=48374466 Points: 228 #… 36 Hacker News — AI on Front Page community 1mo ago 1-Click GitHub Token Stealing via a VSCode Bug Article URL: https://blog.ammaraskar.com/github-token-stealing/ Comments URL: https://news.ycombinator.com/item?id=48371562 Points: 220 # Comments: 30 4 Page 6 of 10 · 500 articles ← Newer Older →