Tag

Developer Tool

500 articles archived under #developer-tool · RSS

arXiv — Machine Learning research 24d ago

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

arXiv:2606.06647v1 Announce Type: new Abstract: Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or…

38
arXiv — Machine Learning research 24d ago

SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling

arXiv:2606.06820v1 Announce Type: new Abstract: Agentic Large Language Model (LLM) systems decompose complex tasks into workflow Directed Acyclic Graphs (DAGs) whose primitives must be scheduled on heterogeneous clusters. Existing deep reinforcement learning (DRL) schedulers are…

26
arXiv — Machine Learning research 24d ago

Constructing VAE Latent Spaces with Prescribed Topology

arXiv:2606.07058v1 Announce Type: new Abstract: Variational autoencoders (VAEs) learn low-dimensional latent representations of high-dimensional data. When the data lies on a manifold with non-Euclidean topology, the standard Gaussian prior introduces a topological mismatch that…

17
arXiv — Machine Learning research 24d ago

The discovery of the effects of women employment participation on the fertility of developing countries: A panel data approach

arXiv:2606.07093v1 Announce Type: new Abstract: The fertility trend in developing countries has experienced a significant decline in the last few decades; at the same time, the role of women in the workplace has improved. To have a better insight of the causality of the rate of…

19
arXiv — Machine Learning research 24d ago

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

arXiv:2606.07141v1 Announce Type: new Abstract: Language models trained for clinical disease inference are trained on patient data, which may include sensitive and private information, and data owners may request the removal of their data from a trained model due to privacy or…

12
arXiv — Machine Learning research 24d ago

A robust PPG foundation model using multimodal physiological supervision

arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining…

8
arXiv — Machine Learning research 24d ago

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

arXiv:2606.06509v1 Announce Type: cross Abstract: Numerous medical imaging problems must be solved under limited labels and constrained compute, yet it remains unclear whether performance gains are driven mainly by more expressive models or by better representation of clinically…

17
arXiv — NLP / Computation & Language research 24d ago

When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations

arXiv:2606.07237v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in healthcare for tasks such as clinical question answering, diagnosis support, and report summarization. Despite their promise, these models remain highly sensitive to subtle…

17
arXiv — NLP / Computation & Language research 24d ago

LLM-Guided Evolution for Medical Decision Pipelines

arXiv:2606.07342v1 Announce Type: new Abstract: Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP-Elites evolution as an inference-time alternative for discovering…

9
arXiv — NLP / Computation & Language research 24d ago

TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment

arXiv:2606.07451v1 Announce Type: cross Abstract: Vision-language models such as CLIP are highly useful for diverse tasks due to their shared image-text embedding space. Despite this, the image and text embeddings are often poorly aligned, affecting downstream performance.…

6
arXiv — NLP / Computation & Language research 24d ago

CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction

arXiv:2508.03668v3 Announce Type: replace Abstract: Click-Through Rate (CTR) prediction, a core task in recommendation systems, estimates user click likelihood using historical behavioral data. Modeling user behavior sequences as text to leverage Language Models (LMs) for this…

5
r/LocalLLaMA community 25d ago

Dockerized Nemotron 3.5 ASR — Switched from Parakeet, better multilingual support + streaming (4.5x realtime speed on cpu)

I was originally using Parakeet for my speech recognition pipeline but decided to give Nemotron 3.5 a shot. After testing it on some multilingual audio clips, it's been working great so far. What sold me: - Better language support (40+ locales from one model) - Native streaming…

17
r/LocalLLaMA community 25d ago

Another 1-click admin account takeover in pewdiepie's AI tool (language in video nsfw)

  submitted by   /u/theonejvo [link]   [comments]

10
Hugging Face official-blog 26d ago

Job Searcher

Back to Articles Job Searcher Team Article Published June 6, 2026 Upvote 2 Emre emrekuruu build-small-hackathon Job hunting as a new grad is a full-time job by itself. You sift through hundreds of postings every week to find a handful worth applying to. You click "Easy Apply"…

15
Simon Willison community 26d ago

micropython-wasm 0.1a2

Release: micropython-wasm 0.1a2 I added a CLI to micropython-wasm ( issue #7 ), inspired by the first draft of the blog entry when I realized it would be a great way to illustrate the Try it yourself section. Tags: python , sandboxing , webassembly , micropython

4
Anthropic SDK (Python) releases dev-tools 26d ago

v0.106.0

0.106.0 (2026-06-05) Full Changelog: v0.105.2...v0.106.0 Features api: mark Claude Opus 4.1 as deprecated ( 85068cc ) Bug Fixes client: make Foundry client copy() and with_options() work ( 94146ac ) transform schema: preserve $defs when schema root is a $ref ( #1642 ) ( fc58e06…

19
Hugging Face Daily Papers research 26d ago

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

Abstract BRepCLIP enables multimodal representation learning for CAD models by aligning boundary representation geometry with language and image embeddings through contrastive pretraining, achieving superior retrieval and classification performance compared to point-based…

7
Hugging Face Daily Papers research 26d ago

MAOAM: Unified Object and Material Selection with Vision-Language Models

Abstract A unified vision-language model framework enables precise object and material selection through text or click interactions, supporting diverse editing workflows with improved robustness. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Selection is a core operation in…

11
llama.cpp releases dev-tools 27d ago

b9530

cli: fix model params not propagated ( #23893 ) Fixes #23847 macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu…

21
arXiv — Machine Learning research 27d ago

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

arXiv:2606.05435v1 Announce Type: new Abstract: Differentially private stochastic gradient descent (DP-SGD) has become the standard framework for privacy-preserving machine learning, yet its reliance on a fixed gradient clipping threshold to limit sensitivity remains a…

12
arXiv — Machine Learning research 27d ago

Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping

arXiv:2606.05731v1 Announce Type: new Abstract: In-season crop type mapping is critical for food security in the face of increasingly extreme climate-related threats to crops. Currently, the USDA Cropland Data Layer provides crop type labels at 30m resolution and is available…

8
arXiv — Machine Learning research 27d ago

HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care

arXiv:2606.05994v1 Announce Type: new Abstract: Medical knowledge graphs (MKGs) infused with clinical knowledge have been increasingly used to model electronic health records (EHRs) to support interpretable predictions in healthcare domain. However, existing MKG-based approaches…

31
arXiv — Machine Learning research 27d ago

Your GFlowNet Secretly Learns an Optimal Transport Plan

arXiv:2606.06272v1 Announce Type: new Abstract: Generative Flow Networks (GFlowNets) are a framework for sampling structured objects via stochastic trajectories in a directed graph. In this work, we establish a theoretical connection between non-acyclic GFlowNets and optimal…

5
arXiv — NLP / Computation & Language research 27d ago

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

arXiv:2606.05561v1 Announce Type: new Abstract: Speech-based mental health screening offers scalable depression detection, yet clinical deployment faces a significant barrier: users' privacy concerns about demographic information exposure. Current techniques struggle to resolve…

34
arXiv — NLP / Computation & Language research 27d ago

Measuring the sensitivity of LLM-based structured extraction to prompt, model, and schema choices in clinical discharge summaries

arXiv:2606.05970v1 Announce Type: new Abstract: Large language models are increasingly used for structured extraction from clinical free-text notes, but the sensitivity of their output to upstream configuration choices is less understood than their accuracy on fixed benchmarks.…

23
Hacker News — AI on Front Page community 27d ago

Open Code Review – An AI-powered code review CLI tool

Article URL: https://github.com/alibaba/open-code-review Comments URL: https://news.ycombinator.com/item?id=48406358 Points: 233 # Comments: 66

32
Hugging Face Daily Papers research 28d ago

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

Abstract MedSP1000 introduces an interactive benchmark derived from standardized patients to evaluate clinical agents' dynamic performance across encounters, revealing limitations of current large language models in medical applications. Generated by…

18
llama.cpp releases dev-tools 28d ago

b9503

fix(mtmd): handle Gemma 4 audio projector embedding size ( #24091 ) mtmd: handle Gemma 4 audio projector embedding size rm projection_dim from clip_n_mmproj_embd Co-authored-by: Xuan Son Nguyen son@huggingface.co macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64,…

28
arXiv — Machine Learning research 28d ago

Early Detection of Alzheimer's Disease Using Explainable Machine Learning on Clinical Biomarkers: A Multi-Class Classification Study Using the Alzheimer's Disease Neuroimaging Initiative (ADNI) Dataset

arXiv:2606.03995v1 Announce Type: new Abstract: Background: Alzheimer's disease (AD) affects over 55 million people worldwide. Accurate, interpretable detection of normal cognition (NC), mild cognitive impairment (MCI), and AD from routine clinical assessments remains a critical…

14
arXiv — Machine Learning research 28d ago

KODA: Contrastive Representation Comparison and Alignment for Vision-Language Foundation Models

arXiv:2606.04180v1 Announce Type: new Abstract: Vision-language foundation models such as CLIP and SigLIP provide widely used representations for multimodal learning systems. While these models are typically compared through downstream performance, such evaluations often do not…

8
arXiv — NLP / Computation & Language research 28d ago

When Clients Stop Following: A Cognitive Conceptualization Diagram-driven Framework for Strategic Counseling

arXiv:2606.04389v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise in psychological counseling, yet existing benchmarks rely heavily on highly cooperative simulated clients. We observe a critical counselor-following phenomenon: these clients often rapidly…

14
arXiv — NLP / Computation & Language research 28d ago

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

arXiv:2606.05112v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly proposed as clinical agents, yet static, single-turn benchmarks cannot capture how a model dynamically delivers care across an encounter: gathering information, planning treatment, and…

32
Hugging Face official-blog 28d ago

Designing the hf CLI as an agent-optimized way to work with the Hub

Back to Articles Designing the hf CLI as an agent-optimized way to work with the Hub Published June 4, 2026 Update on GitHub Upvote 4 Célina Hanouti celinah Lucain Pouget Wauplin hf is the official command-line entrypoint to the Hugging Face Hub. Anything you can do on the Hub…

12
Ollama releases dev-tools 28d ago

v0.30.4-rc1: llama-server: fix gemma4 patch wiring (#16477)

This will fix the "clip.cpp:4399: Unknown projector type" crash.

4
Ollama releases dev-tools 28d ago

v0.30.4: llama-server: fix gemma4 patch wiring (#16477)

This will fix the "clip.cpp:4399: Unknown projector type" crash.

38
r/LocalLLaMA community 28d ago

How to use audio and vision modalities in llama.cpp?

How to use audio and vision modalities in llama.cpp with Gemma4 12B it? I’m on release b9494, but when I run llama-cli it shows “modalities: text” only, and crashes if I try to add an image.   submitted by   /u/No-Leave-4512 [link]   [comments]

20
Hugging Face Daily Papers research 29d ago

KletterMix: Climbing Toward High-Quality German Pretraining Data

Abstract A high-quality German-language corpus for language model pretraining is introduced through careful translation of an English corpus while preserving document structure and metadata, demonstrating improved downstream performance in German-language tasks. Generated by…

28
Hacker News — AI on Front Page community 29d ago

Mouseless – keyboard-driven control of macOS/Linux/Windows

Article URL: https://mouseless.click Comments URL: https://news.ycombinator.com/item?id=48383667 Points: 223 # Comments: 107

38
arXiv — Machine Learning research 29d ago

Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation

arXiv:2606.02604v1 Announce Type: new Abstract: ESG and climate risk data remain fragmented across heterogeneous Scope 1, Scope 2, and Scope 3 reporting environments, while conventional validation pipelines lack provenance aware auditability, hidden drift detection, and…

9
arXiv — Machine Learning research 29d ago

DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data

arXiv:2606.03209v1 Announce Type: new Abstract: Fine-tuning large language models (LLMs) in privacy-sensitive and resource-constrained environments remains challenging. Since training data are often distributed across multiple clients, decentralized fine-tuning offers a natural…

15
arXiv — Machine Learning research 29d ago

Learning Temporal Causal Structure via Smooth Differentiable Optimization

arXiv:2606.03227v1 Announce Type: new Abstract: Causal discovery with instantaneous effects in multivariate time series is challenging, as the instantaneous structure must be acyclic. Prior methods enforce this by either separating instantaneous and lagged estimation into…

4
arXiv — Machine Learning research 29d ago

Multi-Modal Graph Neural Network with Transformer-Guided Adaptive Diffusion for Preclinical Alzheimer Classification

arXiv:2606.03322v1 Announce Type: new Abstract: The graphical representation of the brain offers critical insights into diagnosing and prognosing neurodegenerative disease via relationships between regions of interest (ROIs). Despite recent emergence of various Graph Neural…

14
arXiv — NLP / Computation & Language research 29d ago

AI Rater Discrimination Depends on Scoring Protocol in Complex Clinical Decision-Making

arXiv:2606.03198v1 Announce Type: new Abstract: Clinical AI evaluation increasingly delegates scoring to large language models (LLMs) acting as AI raters, yet their scoring behavior across evaluation conditions has not been quantitatively characterized. We address this gap…

17
arXiv — NLP / Computation & Language research 29d ago

The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP

arXiv:2606.03250v1 Announce Type: new Abstract: Digital healthcare generates vast amounts of clinical text that can support AI-assisted applications, yet German biomedical language models remain limited by older architectures or restricted training data. We present ChristBERT…

33
arXiv — NLP / Computation & Language research 29d ago

SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series

arXiv:2606.03301v1 Announce Type: new Abstract: We introduce SagaQA, a long-form video benchmark for multi-hop reasoning over full-length TV series. Existing video reasoning benchmarks often emphasize local understanding of adjacent frames or clips. SagaQA addresses this gap by…

33
arXiv — NLP / Computation & Language research 29d ago

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

arXiv:2606.03399v1 Announce Type: new Abstract: While large language models (LLMs) are increasingly used for clinical applications, many existing pipelines require sending raw sensitive health information to remote servers for processing, which heightens the risk of privacy…

4
arXiv — NLP / Computation & Language research 29d ago

Does Language Shift Break Medical Vision-Language Models? Indonesian Radiology Visual Question Answering Case Study

arXiv:2606.03693v1 Announce Type: new Abstract: Medical Vision-Language Models (VLMs) are typically evaluated on English radiology visual question answering benchmarks, leaving their robustness under non-English clinical language largely unexplored. We introduce IndoRad-VQA, an…

10
arXiv — NLP / Computation & Language research 29d ago

KletterMix: Climbing Toward High-Quality German Pretraining Data

arXiv:2606.03773v1 Announce Type: new Abstract: High-quality pretraining data is a central ingredient in modern language models, but German-language resources remain far less developed than their English counterparts: they are often smaller, less carefully curated, weakly…

19
Hacker News — AI on Front Page community 29d ago

MAI-Code-1-Flash

https://microsoft.ai/models/mai-code-1-flash/ https://microsoft.ai/pdf/MAI-Code-1-Flash-Model-Card.PDF Launching seven new MAI models: https://microsoft.ai/news/building-a-hillclimbing-machine-la... Comments URL: https://news.ycombinator.com/item?id=48374466 Points: 228 #…

36
Hacker News — AI on Front Page community 1mo ago

1-Click GitHub Token Stealing via a VSCode Bug

Article URL: https://blog.ammaraskar.com/github-token-stealing/ Comments URL: https://news.ycombinator.com/item?id=48371562 Points: 220 # Comments: 30

4

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling

Constructing VAE Latent Spaces with Prescribed Topology

The discovery of the effects of women employment participation on the fertility of developing countries: A panel data approach

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

A robust PPG foundation model using multimodal physiological supervision

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations

LLM-Guided Evolution for Medical Decision Pipelines

TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment

CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction

Dockerized Nemotron 3.5 ASR — Switched from Parakeet, better multilingual support + streaming (4.5x realtime speed on cpu)

Another 1-click admin account takeover in pewdiepie's AI tool (language in video nsfw)

Job Searcher

micropython-wasm 0.1a2

v0.106.0

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

MAOAM: Unified Object and Material Selection with Vision-Language Models

b9530

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping

HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care

Your GFlowNet Secretly Learns an Optimal Transport Plan

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

Measuring the sensitivity of LLM-based structured extraction to prompt, model, and schema choices in clinical discharge summaries

Open Code Review – An AI-powered code review CLI tool

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

b9503

Early Detection of Alzheimer's Disease Using Explainable Machine Learning on Clinical Biomarkers: A Multi-Class Classification Study Using the Alzheimer's Disease Neuroimaging Initiative (ADNI) Dataset

KODA: Contrastive Representation Comparison and Alignment for Vision-Language Foundation Models

When Clients Stop Following: A Cognitive Conceptualization Diagram-driven Framework for Strategic Counseling

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

Designing the hf CLI as an agent-optimized way to work with the Hub

v0.30.4-rc1: llama-server: fix gemma4 patch wiring (#16477)

v0.30.4: llama-server: fix gemma4 patch wiring (#16477)

How to use audio and vision modalities in llama.cpp?

KletterMix: Climbing Toward High-Quality German Pretraining Data

Mouseless – keyboard-driven control of macOS/Linux/Windows

Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation

DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data

Learning Temporal Causal Structure via Smooth Differentiable Optimization

Multi-Modal Graph Neural Network with Transformer-Guided Adaptive Diffusion for Preclinical Alzheimer Classification

AI Rater Discrimination Depends on Scoring Protocol in Complex Clinical Decision-Making

The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP

SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

Does Language Shift Break Medical Vision-Language Models? Indonesian Radiology Visual Question Answering Case Study

KletterMix: Climbing Toward High-Quality German Pretraining Data

MAI-Code-1-Flash

1-Click GitHub Token Stealing via a VSCode Bug