Tag

Developer Tool

500 articles archived under #developer-tool · RSS

arXiv — NLP / Computation & Language research 20d ago

Cross-Modal Masked Compositional Concept Modeling for Enhancing Visio-Linguistic Compositionality

arXiv:2606.13288v1 Announce Type: cross Abstract: Contrastively trained vision-language models like CLIP, have made remarkable progress in learning joint image-text representations, but still face challenges in compositional understanding. They often exhibit a "bag-of-words"…

38
Vercel — AI dev-tools 20d ago

Program Claude Code, Codex, Pi and other agent harnesses with AI SDK

AI SDK 7 introduces HarnessAgent , a single API for running established agent harnesses, including Claude Code, Codex, and Pi. AI SDK has always let you switch models without rewriting your agent. Now you can switch the harness the same way. Write the agent once. Use the best…

7
NVIDIA Developer Blog official-blog 20d ago

One-Click Multi-Tenant Security with NVIDIA Quantum InfiniBand

NVIDIA Quantum InfiniBand now offers intent-based security profiles in Unified Fabric Manager (UFM) that enable multi-tenant fabric security in a single...

33
r/MachineLearning community 20d ago

What should context compression keep? I looked at how six agents handle it[D]

I use Claude Code, Codex CLI, OpenCode, Cline, Cursor, and Amp enough to notice a pattern in how they handle long context. They are all converging on layered progressive compression, but they disagree on what to protect. Most protect recent user messages as a first-class asset.…

20
arXiv — Machine Learning research 21d ago

Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data

arXiv:2606.11272v1 Announce Type: new Abstract: Federated Learning (FL) enables collaborative and privacy-preserving model training across distributed clients, but most existing FL systems implicitly assume data stationarity. In real-world settings-such as healthcare, industrial…

10
arXiv — Machine Learning research 21d ago

Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity

arXiv:2606.11431v1 Announce Type: new Abstract: Mirror Descent (MD) extends Gradient Descent (GD) beyond Euclidean geometry and has recently reappeared as a lens for KL-regularized policy optimization in reinforcement learning and LLM post-training. This raises a basic…

10
arXiv — Machine Learning research 21d ago

LSTM-Based Detection of Structural Breaks in Property Insurance Loss Reserving: A Climate-Informed Approach

arXiv:2606.11463v1 Announce Type: new Abstract: Accurate loss reserving is foundational to insurer solvency, yet accelerating climate driven catastrophes systematically violate the stability assumptions on which traditional actuarial methods depend. This white paper presents a…

30
arXiv — Machine Learning research 21d ago

AI4Land: Scalable Deep Learning for Global High-Resolution Land Use Reconstruction

arXiv:2606.11793v1 Announce Type: new Abstract: Uncertainty in the terrestrial carbon cycle remains a major constraint in climate projections, partly driven by the uncertainties affecting the land surface representation and variability in Earth system models. To address this…

11
arXiv — Machine Learning research 21d ago

Multimodal Ordinal Modeling of Alzheimer's Disease Severity Using Structural MRI and Clinical Data

arXiv:2606.11794v1 Announce Type: new Abstract: Neurodegenerative diseases such as Alzheimer's disease (AD) require accurate and scalable tools for assessing disease severity, yet current clinical staging remains time-intensive and prone to variability. We propose an…

17
arXiv — Machine Learning research 21d ago

Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation

arXiv:2606.12006v1 Announce Type: new Abstract: Predicting time-to-event outcomes such as mortality is a fundamental task in clinical decision-making, commonly addressed through survival analysis. While classical statistical and deep learning approaches have been widely studied,…

33
arXiv — Machine Learning research 21d ago

PCA-Enhanced Adaptive NVAR Framework for High-Resolution Sea Surface Temperature Forecasting in the East Sea

arXiv:2606.12141v1 Announce Type: new Abstract: Accurate forecasting of sea surface temperature (SST) in regional seas such as the East Sea is crucial for monitoring marine ecosystems, assessing climate risks, managing fisheries, and conducting naval operations. Traditional…

34
arXiv — Machine Learning research 21d ago

Using Explainability as a Training-Time Reliability Signal for Efficient ECG Classification

arXiv:2606.12252v1 Announce Type: new Abstract: Training deep neural networks for clinical time-series analysis is computationally demanding, yet many healthcare settings lack the resources required for repeated model development and deployment. This challenge is particularly…

8
arXiv — NLP / Computation & Language research 21d ago

BioDivergence: A Benchmark and Evaluation Framework for Hidden Contextual Contradictions in Biomedical Abstracts

arXiv:2606.11208v1 Announce Type: new Abstract: Biomedical findings often seem to conflict across studies, but many of these differences are context-dependent rather than true contradictions. Variations in cohort, geography, assay protocol, disease subtype, and clinical setting…

29
arXiv — NLP / Computation & Language research 21d ago

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

arXiv:2606.12250v1 Announce Type: new Abstract: Large language models (LLMs) in medicine are mainly evaluated using multiple-choice question answering (MCQA), which can overestimate real clinical ability due to guessing strategies and answer biases. To address these limitations,…

37
The Information — AI news-outlet 21d ago

Xbox Plans Layoffs as Revenue, Profit Margins Decline

Microsoft’s Xbox gaming unit plans to cut staff in the coming months as its financial picture worsens, according to someone with knowledge of the plans. In a note to staff Wednesday, CEO Asha Sharma said that Xbox’s “accountability margins”—a term Microsoft uses internally to…

30
TechCrunch — AI news-outlet 21d ago

Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues

Companies are burning through exorbitant sums of money to keep pace in the AI arms race. Debt is climbing.

14
GitHub Blog — AI & ML official-blog 21d ago

Give GitHub Copilot CLI real code intelligence with language servers

Install and configure LSP servers for GitHub Copilot CLI, replacing brute-force grep/decompile with real code intelligence. The post Give GitHub Copilot CLI real code intelligence with language servers appeared first on The GitHub Blog .

34
Hugging Face Daily Papers research 22d ago

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Abstract BrainSurgery is a tool for robust and reproducible tensor manipulation of neural network checkpoints through declarative YAML plans with built-in validation. Generated by Qwen/Qwen2.5-Coder-32B-Instruct As deep learning models scale, managing, inspecting, and modifying…

12
Hugging Face Daily Papers research 22d ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Abstract Flow-DPPO replaces ratio clipping with divergence proximal constraints in flow matching models, improving training stability and multi-objective optimization through exact KL divergence computation. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Recent work has…

34
arXiv — Machine Learning research 22d ago

TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification

arXiv:2606.09898v1 Announce Type: new Abstract: Cancer treatment planning requires decisions across multiple clinical dimensions at once. Clinicians must determine whether a patient should receive targeted molecular therapy, radiation therapy, and whether they are likely to…

22
arXiv — Machine Learning research 22d ago

LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts

arXiv:2606.09907v1 Announce Type: new Abstract: Multimodal clinical learning is increasingly important for integrating diverse patient data, including imaging, text, and personalised health records. However, it faces two fundamental challenges: i) modality missingness, where…

28
arXiv — Machine Learning research 22d ago

FedSteer: Taming Extreme Gradient Staleness in Federated Learning with Corrective Projections and Caching

arXiv:2606.10124v1 Announce Type: new Abstract: Federated learning (FL) is often subject to aggregation variance if clients do not consistently participate in training rounds. While reusing stale model updates from inactive clients is a common technique to reduce this variance,…

33
arXiv — Machine Learning research 22d ago

MMClima: A Framework for Multimodal Climate Science Data and Evaluation

arXiv:2606.10194v1 Announce Type: new Abstract: Climate change research increasingly requires AI systems that reason across text, dynamic visual content, and scientific figures, yet existing climate QA benchmarks are small, mostly textual, and cover a narrow range of models. We…

20
arXiv — Machine Learning research 22d ago

DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction

arXiv:2606.10243v1 Announce Type: new Abstract: Offsite conversion rate (OCVR) prediction is an important ranking problem in computational recommendation systems. This task presents a modeling challenge: click signals are abundant and exhibit short temporal horizons, whereas…

25
arXiv — NLP / Computation & Language research 22d ago

Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs

arXiv:2606.10722v1 Announce Type: new Abstract: We study dense-to-sparse continual training as a way to construct channel-sparse large language models from dense checkpoints. Starting from a Qwen2.5-8B dense backbone, we continue training at 32K context and introduce a…

24
arXiv — NLP / Computation & Language research 22d ago

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

arXiv:2606.10796v1 Announce Type: new Abstract: Automatic Depression Detection (ADD) from clinical interviews is a pivotal task in computational mental health, yet it remains challenging due to two critical obstacles: 1) difficulty in modeling complex but sparsely distributed…

5
arXiv — NLP / Computation & Language research 22d ago

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

arXiv:2606.10279v1 Announce Type: cross Abstract: Supervised fine-tuning with synthetic rationale data is widely assumed to improve language model performance on clinical prediction tasks by teaching models not just what to predict but why. We test this assumption on five-year…

28
TechCrunch — AI news-outlet 22d ago

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button

Anthropic's Claude Fable 5 is going to be a big hit with the web's vibe coders.

27
llama.cpp releases dev-tools 22d ago

b9586: webui: implement pinned conversations support (#21387)

webui: implement pinned conversations support webui: linter/prettier pass Fix the unused handleMobileSidebarItemClick from the component. the search should find pinned conversations as well Co-authored-by: Pascal admin@serveurperso.com Co-authored-by: Pascal…

24
Anthropic SDK (Python) releases dev-tools 22d ago

v0.108.0

0.108.0 (2026-06-09) Full Changelog: v0.107.1...v0.108.0 Features api: add support for claude-mythos-5 and claude-fable-5, with support for server-side fallbacks on refusal ( 6b76649 ) client: adds client-side fallbacks middleware for API providers that do not support…

12
GitHub Blog — AI & ML official-blog 22d ago

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Custom agents let GitHub Copilot CLI understand your stack and team workflows, turning one-off terminal prompts into repeatable, reviewable processes. The post From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI appeared first on The GitHub Blog .

20
NVIDIA Developer Blog official-blog 22d ago

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodipine,...

9
Hugging Face Daily Papers research 23d ago

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

Abstract SkeMex is a self-evolving framework that enhances medical agents through structured skill memory, improving long-term clinical reasoning by distinguishing useful experiences and governing memory retention based on contextual utility. Generated by…

32
arXiv — Machine Learning research 23d ago

TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation

arXiv:2606.07569v1 Announce Type: new Abstract: Accurate carbon emission monitoring is critical for climate policy and emerging regulatory mechanisms such as the EU Carbon Border Adjustment Mechanism, yet city-level high-frequency monitoring data remain extremely scarce,…

31
arXiv — Machine Learning research 23d ago

HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning

arXiv:2606.07621v1 Announce Type: new Abstract: Edge services increasingly use federated learning to personalize on-device models while keeping sensitive data local. In practice, deployments must handle heterogeneity in both client resources and local data distributions.…

24
arXiv — Machine Learning research 23d ago

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

arXiv:2606.07692v1 Announce Type: new Abstract: Foundation models for wearable biosignals have matched or exceeded supervised specialists across a range of clinical tasks, yet all rely on modalities that require deliberate user action--wearing a device or visiting a sleep lab.…

14
arXiv — Machine Learning research 23d ago

EvoCSFL: Surrogate-Assisted Evolutionary Client Selection for Efficient and Robust Federated Learning

arXiv:2606.07702v1 Announce Type: new Abstract: The heterogeneity of client data and systems makes it difficult to achieve satisfactory convergence speed and robustness in federated learning with random client selection. To address this issue, this paper proposes a…

29
arXiv — Machine Learning research 23d ago

Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling

arXiv:2606.07898v1 Announce Type: new Abstract: High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key…

16
arXiv — Machine Learning research 23d ago

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

arXiv:2606.08037v1 Announce Type: new Abstract: Electrocardiogram (ECG) classification models often suffer from severe label scarcity, making semi-supervised learning (SSL) an attractive strategy for reducing annotation costs. In clinical settings, however, unlabeled pools…

26
r/LocalLLaMA community 23d ago

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

I fine-tuned NVIDIA's Parakeet TDT 0.6B v2 for clinical speech and am releasing the weights as Omi Med STT v1 (CC-BY-4.0). Disclosure: I'm the founder of Omi Health and built this. Happy to dig into the training mix, benchmark, failure cases, quantization, or anything else. The…

14
r/LocalLLaMA community 23d ago

Here's a llama.cpp CLI Command builder.

No accounts or sign up. No email requirements. No pop-ups and no cookies. No ads. Info is saved locally in your browser so you dont lose any progress. Its got every single flag and argument that could be found in the documentation. Tool tips are added to everything. Every field…

19
Vercel — AI dev-tools 23d ago

Domain Search is now available through the Vercel CLI

You can now use the Vercel CLI to search domains. Using the vercel domains search command, you can supply a domain name and retrieve availability and price results for all TLDs that Vercel supports. You can also filter by TLD, apply sorting, and filter out unavailable domains.…

7
Vercel — AI dev-tools 23d ago

How Fern runs multi-tenant docs for Webflow and ElevenLabs on Vercel

Fern on Vercel 3x faster time to first byte Page load times reduced by 80% 6 million+ page views per month from 1 million+ unique visitors 65% of the platform migrated from Pages Router to App Router in 7 days Fern helps companies ship developer documentation and SDKs, running…

4
llama.cpp releases dev-tools 23d ago

b9562

mtmd : add video input support ( #24269 ) wip ok: lazy bitmap API remember to free lazy text wip add mtmd_helper_video support video input on server (base64 input) add MTMD_VIDEO config add timestamp update CLI cli: allow auto-completion for video add --video arg fix build…

22
llama.cpp releases dev-tools 23d ago

b9559

cli: fix spinner not show during prompt processing ( #24283 ) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu…

10
r/LocalLLaMA community 24d ago

llama-launcher Release

Hello everyone, I've been working on a point and click GUI to make tinkering with llama-server flags much quicker and easier, I thought I'd share for anyone else who might be interested. It's also great for anyone new to llama.cpp that is looking to get into it and doesn't want…

7
r/MachineLearning community 24d ago

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

I've been building agents for about a year and recently shipped one for a client running ~140 MCP-exposed tools at peak. Along the way I made the canonical mistake. I used cosine similarity over tool description embeddings to pick which tools the model could see per turn. Worked…

9
r/LocalLLaMA community 24d ago

Meddies PII: An Open Multilingual De-identification Model for Clinical Text

A clinical AI model does not need to know who the patient is to reason clinically. It needs the symptoms, medications, lab results, diagnosis history, and treatment course. The problem is that in real medical records, those facts usually sit next to identifiers: names, record…

38
Ars Technica — AI news-outlet 24d ago

The weather and climate science AI revolution isn’t revolutionary

Machine learning has its limits—how is it being used?

21
arXiv — Machine Learning research 24d ago

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

arXiv:2606.06647v1 Announce Type: new Abstract: Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or…

38

Cross-Modal Masked Compositional Concept Modeling for Enhancing Visio-Linguistic Compositionality

Program Claude Code, Codex, Pi and other agent harnesses with AI SDK

One-Click Multi-Tenant Security with NVIDIA Quantum InfiniBand

What should context compression keep? I looked at how six agents handle it[D]

Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data

Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity

LSTM-Based Detection of Structural Breaks in Property Insurance Loss Reserving: A Climate-Informed Approach

AI4Land: Scalable Deep Learning for Global High-Resolution Land Use Reconstruction

Multimodal Ordinal Modeling of Alzheimer's Disease Severity Using Structural MRI and Clinical Data

Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation

PCA-Enhanced Adaptive NVAR Framework for High-Resolution Sea Surface Temperature Forecasting in the East Sea

Using Explainability as a Training-Time Reliability Signal for Efficient ECG Classification

BioDivergence: A Benchmark and Evaluation Framework for Hidden Contextual Contradictions in Biomedical Abstracts

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

Xbox Plans Layoffs as Revenue, Profit Margins Decline

Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues

Give GitHub Copilot CLI real code intelligence with language servers

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification

LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts

FedSteer: Taming Extreme Gradient Staleness in Federated Learning with Corrective Projections and Caching

MMClima: A Framework for Multimodal Climate Science Data and Evaluation

DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction

Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

Anthropic&#8217;s Fable 5 can make weirdly fun video games with the click of a button

b9586: webui: implement pinned conversations support (#21387)

v0.108.0

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation

HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

EvoCSFL: Surrogate-Assisted Evolutionary Client Selection for Efficient and Robust Federated Learning

Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

Here's a llama.cpp CLI Command builder.

Domain Search is now available through the Vercel CLI

How Fern runs multi-tenant docs for Webflow and ElevenLabs on Vercel

b9562

b9559

llama-launcher Release

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

Meddies PII: An Open Multilingual De-identification Model for Clinical Text

The weather and climate science AI revolution isn’t revolutionary

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button