Tag

Developer Tool

500 articles archived under #developer-tool · RSS

Vercel — AI dev-tools 1mo ago

Edit Git settings for all projects in a repo

Monorepos that deploy many projects can now configure all of their project's Git settings more conveniently. Previously, if you wanted to consistently configure each project's settings for commit status, repository_dispatch events , etc., you had to click through to every…

16
Hugging Face Daily Papers research 1mo ago

Multi-Agent Computer Use

Abstract Multi-agent computer use systems outperform single-agent approaches on complex tasks by enabling parallel execution and dynamic task decomposition through directed acyclic graphs. AI-generated summary Computer use agents (CUAs) today are primarily deployed as single…

18
arXiv — Machine Learning research 1mo ago

PE-means: Improved Differentially Private $k$-means Clustering through Private Evolution

arXiv:2606.00342v1 Announce Type: new Abstract: We study the problem of differentially private (DP) $k$-means clustering in Euclidean space. Previous solutions rely on summing the private data directly, which induces a sensitivity proportional to the domain. We introduce…

17
arXiv — Machine Learning research 1mo ago

Canonicalized Stable-List Replay for Private Federated Continual Learning over Language-Model Embeddings

arXiv:2606.00426v1 Announce Type: new Abstract: Federated continual learning (FCL) lets distributed clients adapt language-model heads to evolving NLP tasks without sharing raw text. Under user-level differential privacy (DP), replay-based continual learning faces a structural…

17
arXiv — NLP / Computation & Language research 1mo ago

A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models

arXiv:2606.00027v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed across healthcare, yet existing benchmarks fail to capture model behavior under adversarial or ethically complex conditions common in clinical practice. We developed a…

37
arXiv — NLP / Computation & Language research 1mo ago

LLMs for Cardiovascular Risk Prediction from Structured Clinical Data

arXiv:2606.00031v1 Announce Type: new Abstract: Coronary artery disease (CAD) remains one of the leading causes of death globally, highlighting the need for reliable predictive systems to support early diagnosis and risk assessment. While traditional machine learning models…

14
arXiv — NLP / Computation & Language research 1mo ago

LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification

arXiv:2606.00647v1 Announce Type: new Abstract: Detecting psychological defense mechanisms in conversational text remains a challenging clinical NLP problem. For the PsyDefDetect 2026 shared task (nine-class utterance classification evaluated via macro F1), our team LinguIUTics…

5
arXiv — NLP / Computation & Language research 1mo ago

Med-HEAL: Analyzing and Mitigating Hallucinations in Medical LLMs with Hallucination-Aware In-Context Learning

arXiv:2606.01301v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) pose serious risks for clinical decision support, particularly when models must reason over complex electronic health records (EHRs). However, existing benchmarks often lack a…

8
r/MachineLearning community 1mo ago

MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D]

79% of enterprises have adopted AI agents. Only 11% run them in production. We've spent the past year building agent systems for banks, clinical operations teams, and engineering orgs. The problem isn't that agents don't work — they work fine. The problem is that every framework…

12
Vercel — AI dev-tools 1mo ago

Build Chat SDK web UIs in Vue or Svelte

The Chat SDK web adapter now has first-class support for Vue and Svelte, joining the existing React integration. Because the adapter speaks the AI SDK UI message stream protocol , the same server route works. Each framework ships its own useChat , built on the matching AI SDK…

16
Vercel — AI dev-tools 1mo ago

Build custom Slack runtimes

Chat SDK now ships the Slack adapter 's primitives as standalone imports for apps that already handle their own routing, state, or workflow execution. Use only what you need: Request verification and payload parsing ( @chat-adapter/slack/webhook ) Markdown formatting (…

20
OpenAI Python SDK releases dev-tools 1mo ago

v2.40.0

2.40.0 (2026-06-01) Full Changelog: v2.39.0...v2.40.0 Features api: Add Amazon Bedrock Responses support Bug Fixes api: allow setting bedrock api keys on the client directly ( 4d5bfde )

19
Vercel — AI dev-tools 1mo ago

Chat SDK adds Velt support

Chat SDK now supports Velt with the new vendor-official adapter . Build bots that read and reply within Velt comment threads, right where your team already works: documents, text editors, and canvases. Tag the bot, and it will answer in the same thread, grounding its reply with…

24
Vercel — AI dev-tools 1mo ago

Chat SDK adds AgentPhone support

Chat SDK now supports AgentPhone with the new vendor-official adapter . Give your bot its own phone number so it can handle voice calls and text messages using the same handlers you already write. When a call ends, the transcript is delivered as a message, allowing your bot to…

14
Hacker News — AI on Front Page community 1mo ago

NPM packages from RedHat have been compromised

Article URL: https://github.com/RedHatInsights/javascript-clients/issues/492 Comments URL: https://news.ycombinator.com/item?id=48356625 Points: 327 # Comments: 151

37
r/LocalLLaMA community 1mo ago

MTP is nice and all, but what about PP speeds?

I don't know for the rest of you, but with my setup, as soon as i enable MTP, the PP performance and GPU usage drops significantly for some reason. It's not as much a memory issue for me as it is declining performance. My setup is: 2x Radeon VII 16gb on ROCm, 1x Rtx3080 8gb Max…

28
Hugging Face Daily Papers research 1mo ago

One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation

Abstract Group Prompting enables efficient cell instance segmentation by leveraging per-type prompting through a training-free framework that uses multi-scale encoder features and recursive prompt expansion. AI-generated summary Cell instance segmentation models trained on…

32
Hugging Face Daily Papers research 1mo ago

How can embedding models bind concepts?

Abstract Vision-language models like CLIP struggle with concept binding despite recognizing individual concepts, but controlled transformer models can learn low-complexity binding functions that generalize better through multiplicative interactions. AI-generated summary Humans…

11
r/LocalLLaMA community 1mo ago

Just found a 1-click RCE in pewdiepie's Odysseus Chat

PR being submitted to help the project as we speak. Sound on for extra lols.   submitted by   /u/theonejvo [link]   [comments]

7
Vercel — AI dev-tools 1mo ago

Qwen 3.7 Plus now available on AI Gateway

Qwen 3.7 Plus from Alibaba is now available on Vercel AI Gateway . The model unifies vision and language into a single agent foundation, with capabilities spanning GUI and CLI operation, coding and productivity workflows with full-modality input, and visual agent tasks including…

26
arXiv — Machine Learning research 1mo ago

Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics

arXiv:2605.30374v1 Announce Type: new Abstract: Estimating hip muscle forces and joint moments during gait typically relies on musculoskeletal simulation, which is informative but time-consuming and difficult to apply in clinical settings. This study developed a deep learning…

10
arXiv — Machine Learning research 1mo ago

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents

arXiv:2605.30590v1 Announce Type: new Abstract: Two clinical AI systems can score nearly identically on coverage-based rubrics yet behave radically differently when their patient inputs change: one updates its recommendations to match the new clinical signal, while the other…

23
arXiv — NLP / Computation & Language research 1mo ago

Generalistic or Specific Embeddings, Which is Better? An Empirical Study on Search for Clinical Coding in Non-English Languages

arXiv:2605.30529v1 Announce Type: new Abstract: Sentence-embedding models for semantic search are overwhelmingly developed and evaluated on English corpora. When applied to clinical retrieval in other languages -- particularly retrieval of ICD-10-CM / CIE-10 codes -- recall…

26
arXiv — NLP / Computation & Language research 1mo ago

Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs

arXiv:2605.30646v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in clinical applications. However, their behavior remains highly sensitive to subtle linguistic variations, such as rephrasing or syntactic variation. This sensitivity poses risks…

27
r/LocalLLaMA community 1mo ago

Llama Studio v0.2.0

I have made an update to my llama-server WebUI based on some awesome feedback and interaction with the community. 1) JSON model config replaced by per-model shell scripts. Run from CLI, paste from unsloth, email to your buddy or post to reddit: Using real shell scripts to store…

17
Hacker News — AI on Front Page community 1mo ago

Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30%

Article URL: https://thesciverse.org/scientists-found-that-the-creatine-supplement-millions-take-for-muscle-gains-is-quietly-raising-brain-energy-levels-and-slowing-early-alzheimers-cognitive-decline-by-30/ Comments URL: https://news.ycombinator.com/item?id=48346947 Points: 230…

15
Vercel — AI dev-tools 1mo ago

Chat SDK adds Lark and Feishu support

Chat SDK now supports Lark and Feishu via a new vendor-official adapter . Build bots that post, edit, and delete messages, stream replies via Lark's native cardkit typewriter API, send interactive cards, and react with emojis across both Lark and Feishu conversations. The…

20
r/LocalLLaMA community 1mo ago

Step-3.7-Flash-NVFP4 thinking for many minutes

Anyone else seeing Step-3.7-Flash-NVFP4 thinking for many minutes? I'm using it with Cline and can see it thinking for in some cases 14 minutes with vLLM reporting generation of 90 tokens/s every 10s.   submitted by   /u/NaiRogers [link]   [comments]

19
llama.cpp releases dev-tools 1mo ago

b9414

mtmd: Add DeepSeekOCR 2 Support ( #20975 ) mtmd: DeepSeek-OCR 2 support, with multi-tile dynamic resolution introduced clip_image_f32::add_viewsep address PR review drop redundant ggml_cpy ops in both deepseekocr versions build drop no-op ggml_cont in build_sam assert…

30
TechCrunch — AI news-outlet 1mo ago

What happens when companies become too AI-pilled?

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
Marcus on AI community 1mo ago

What happens next, after the decline of tokenmaxxing?

Two very different sets of predictions

25
TechCrunch — AI news-outlet 1mo ago

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
MIT Technology Review — AI news-outlet 1mo ago

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

Pope Leo XIV’s new encyclical on artificial intelligence includes a statement that warrants serious attention from technologists and policymakers: “Technology is never neutral.” Magnifica Humanitas (“Magnificent Humanity”) is a clarion call to all people to act with courage and…

8
Hacker News — AI on Front Page community 1mo ago

Volkswagen blocks Home Assistant by requiring client assertion

Article URL: https://github.com/robinostlund/homeassistant-volkswagencarnet/issues/967 Comments URL: https://news.ycombinator.com/item?id=48319509 Points: 221 # Comments: 112

32
arXiv — Machine Learning research 1mo ago

Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data

arXiv:2605.29058v1 Announce Type: new Abstract: Bayesian Networks (BNs) are of interest from an explainable AI viewpoint, offering transparent probabilistic models for decision support. Baymex is a recently introduced multi-objective evolutionary algorithm for learning…

24
arXiv — Machine Learning research 1mo ago

Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions

arXiv:2605.29172v1 Announce Type: new Abstract: Seasonal climate predictions support planning and risk management by offering early information of the most likely-to-occur climate conditions in the coming months, and associated uncertainties. Ensemble forecasts enable this by…

20
arXiv — Machine Learning research 1mo ago

SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction

arXiv:2605.29236v1 Announce Type: new Abstract: Alarm fatigue in intensive care units (ICUs) is a well documented patient safety crisis. Clinical monitors generate 350 or more alarms per patient per day, out of which 72-99% are clinically irrelevant. Staff desensitization to…

29
arXiv — Machine Learning research 1mo ago

Causal Label Recovery in Payment Networks

arXiv:2605.29272v1 Announce Type: new Abstract: Fraud detection models in payment networks train on chargeback labels that are systematically biased. Every label must survive three sequential gates: authorization (declined transactions generate no labels), issuer reporting…

36
arXiv — NLP / Computation & Language research 1mo ago

Specialty-Specific Medical Language Model for Immune-Mediated Diseases

arXiv:2605.28838v1 Announce Type: new Abstract: Extracting detailed clinical information from free-text medical narratives remains a practical challenge for researchers and healthcare systems. Terminology for immune-mediated and infectious diseases is especially inconsistent…

29
arXiv — NLP / Computation & Language research 1mo ago

Hallucination Detection-Guided Preference Optimization for Clinical Summarization

arXiv:2605.28910v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise on summarization tasks, but they often produce hallucinations, which are unsupported or incorrect statements that limit their reliability in specialized healthcare applications. We…

21
llama.cpp releases dev-tools 1mo ago

b9393

mtmd: fix gemma 4 audio rms norm eps ( #23815 ) mtmd: fix gemma 4 audio rms norm eps Update tools/mtmd/clip.cpp Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com macOS/iOS: macOS Apple Silicon (arm64) macOS…

34
The Information — AI news-outlet 1mo ago

Blue Origin New Glenn Rocket Explodes During Test

Jeff Bezos’ space company Blue Origin suffered a serious setback Thursday evening when its New Glenn rocket exploded on a launch pad in Florida during a test. Video clips of the incident show a giant fireball engulfing the rocket and surrounding structures. No one was on board…

18
r/LocalLLaMA community 1mo ago

Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it.

diff --git a/vllm/entrypoints/anthropic/protocol.py b/vllm/entrypoints/anthropic/protocol.py index 3ebc17117..2d5726d73 100644 --- a/vllm/entrypoints/anthropic/protocol.py +++ b/vllm/entrypoints/anthropic/protocol.py @@ -65,7 +65,7 @@ class AnthropicContentBlock(BaseModel):…

29
r/MachineLearning community 1mo ago

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

Are agents aging after deployment? : https://arxiv.org/abs/2605.26302 On a new longitudinal deployment benchmark, switching the Claude Code CLI agent from Sonnet 4.6 to Opus 4.7 dropped PyTest pass rate by ~15%. This (to me) is a counterintuitive-enough result to pay attention…

6
Don't Worry About the Vase community 1mo ago

AI #170: Lack of Executive Order

Last week ended on a cliffhanger of sorts.

28
arXiv — Machine Learning research 1mo ago

Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility

arXiv:2605.27467v1 Announce Type: new Abstract: Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks…

19
arXiv — Machine Learning research 1mo ago

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

arXiv:2605.27583v1 Announce Type: new Abstract: Electrocardiograms (ECGs) are widely used non-invasive measurements of cardiac activity and play a central role in clinical diagnosis. Recent multimodal approaches align ECG signals with clinical reports to incorporate diagnostic…

4
arXiv — Machine Learning research 1mo ago

Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?

arXiv:2605.27733v1 Announce Type: new Abstract: Training instabilities such as loss spikes are frequently the result of stochastic gradient noise. Because of rare expressions in language training data, and multiple layer composition, the noise impact is heavy-tailed and survives…

33
arXiv — Machine Learning research 1mo ago

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

arXiv:2605.27954v1 Announce Type: new Abstract: Agentic large language models are increasingly used to solve real-world tasks by reasoning over goals, invoking tools, and interacting with external environments. Reinforcement learning provides a natural framework for improving…

38
arXiv — Machine Learning research 1mo ago

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee

arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur…

13

Edit Git settings for all projects in a repo

Multi-Agent Computer Use

PE-means: Improved Differentially Private $k$-means Clustering through Private Evolution

Canonicalized Stable-List Replay for Private Federated Continual Learning over Language-Model Embeddings

A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models

LLMs for Cardiovascular Risk Prediction from Structured Clinical Data

LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification

Med-HEAL: Analyzing and Mitigating Hallucinations in Medical LLMs with Hallucination-Aware In-Context Learning

MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D]

Build Chat SDK web UIs in Vue or Svelte

Build custom Slack runtimes

v2.40.0

Chat SDK adds Velt support

Chat SDK adds AgentPhone support

NPM packages from RedHat have been compromised

MTP is nice and all, but what about PP speeds?

One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation

How can embedding models bind concepts?

Just found a 1-click RCE in pewdiepie's Odysseus Chat

Qwen 3.7 Plus now available on AI Gateway

Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents

Generalistic or Specific Embeddings, Which is Better? An Empirical Study on Search for Clinical Coding in Non-English Languages

Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs

Llama Studio v0.2.0

Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30%

Chat SDK adds Lark and Feishu support

Step-3.7-Flash-NVFP4 thinking for many minutes

b9414

What happens when companies become too AI-pilled?

What happens next, after the decline of tokenmaxxing?

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

Volkswagen blocks Home Assistant by requiring client assertion

Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data

Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions

SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction

Causal Label Recovery in Payment Networks

Specialty-Specific Medical Language Model for Immune-Mediated Diseases

Hallucination Detection-Guided Preference Optimization for Clinical Summarization

b9393

Blue Origin New Glenn Rocket Explodes During Test

Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it.

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

AI #170: Lack of Executive Order

Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee