News / #developer-tool Tag Developer Tool 500 articles archived under #developer-tool · RSS Sign in to follow Vercel — AI dev-tools 1mo ago Edit Git settings for all projects in a repo Monorepos that deploy many projects can now configure all of their project's Git settings more conveniently. Previously, if you wanted to consistently configure each project's settings for commit status, repository_dispatch events , etc., you had to click through to every… 16 Hugging Face Daily Papers research 1mo ago Multi-Agent Computer Use Abstract Multi-agent computer use systems outperform single-agent approaches on complex tasks by enabling parallel execution and dynamic task decomposition through directed acyclic graphs. AI-generated summary Computer use agents (CUAs) today are primarily deployed as single… 18 arXiv — Machine Learning research 1mo ago PE-means: Improved Differentially Private $k$-means Clustering through Private Evolution arXiv:2606.00342v1 Announce Type: new Abstract: We study the problem of differentially private (DP) $k$-means clustering in Euclidean space. Previous solutions rely on summing the private data directly, which induces a sensitivity proportional to the domain. We introduce… 17 arXiv — Machine Learning research 1mo ago Canonicalized Stable-List Replay for Private Federated Continual Learning over Language-Model Embeddings arXiv:2606.00426v1 Announce Type: new Abstract: Federated continual learning (FCL) lets distributed clients adapt language-model heads to evolving NLP tasks without sharing raw text. Under user-level differential privacy (DP), replay-based continual learning faces a structural… 17 arXiv — NLP / Computation & Language research 1mo ago A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models arXiv:2606.00027v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed across healthcare, yet existing benchmarks fail to capture model behavior under adversarial or ethically complex conditions common in clinical practice. We developed a… 37 arXiv — NLP / Computation & Language research 1mo ago LLMs for Cardiovascular Risk Prediction from Structured Clinical Data arXiv:2606.00031v1 Announce Type: new Abstract: Coronary artery disease (CAD) remains one of the leading causes of death globally, highlighting the need for reliable predictive systems to support early diagnosis and risk assessment. While traditional machine learning models… 14 arXiv — NLP / Computation & Language research 1mo ago LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification arXiv:2606.00647v1 Announce Type: new Abstract: Detecting psychological defense mechanisms in conversational text remains a challenging clinical NLP problem. For the PsyDefDetect 2026 shared task (nine-class utterance classification evaluated via macro F1), our team LinguIUTics… 5 arXiv — NLP / Computation & Language research 1mo ago Med-HEAL: Analyzing and Mitigating Hallucinations in Medical LLMs with Hallucination-Aware In-Context Learning arXiv:2606.01301v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) pose serious risks for clinical decision support, particularly when models must reason over complex electronic health records (EHRs). However, existing benchmarks often lack a… 8 r/MachineLearning community 1mo ago MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D] 79% of enterprises have adopted AI agents. Only 11% run them in production. We've spent the past year building agent systems for banks, clinical operations teams, and engineering orgs. The problem isn't that agents don't work — they work fine. The problem is that every framework… 12 Vercel — AI dev-tools 1mo ago Build Chat SDK web UIs in Vue or Svelte The Chat SDK web adapter now has first-class support for Vue and Svelte, joining the existing React integration. Because the adapter speaks the AI SDK UI message stream protocol , the same server route works. Each framework ships its own useChat , built on the matching AI SDK… 16 Vercel — AI dev-tools 1mo ago Build custom Slack runtimes Chat SDK now ships the Slack adapter 's primitives as standalone imports for apps that already handle their own routing, state, or workflow execution. Use only what you need: Request verification and payload parsing ( @chat-adapter/slack/webhook ) Markdown formatting (… 20 OpenAI Python SDK releases dev-tools 1mo ago v2.40.0 2.40.0 (2026-06-01) Full Changelog: v2.39.0...v2.40.0 Features api: Add Amazon Bedrock Responses support Bug Fixes api: allow setting bedrock api keys on the client directly ( 4d5bfde ) 19 Vercel — AI dev-tools 1mo ago Chat SDK adds Velt support Chat SDK now supports Velt with the new vendor-official adapter . Build bots that read and reply within Velt comment threads, right where your team already works: documents, text editors, and canvases. Tag the bot, and it will answer in the same thread, grounding its reply with… 24 Vercel — AI dev-tools 1mo ago Chat SDK adds AgentPhone support Chat SDK now supports AgentPhone with the new vendor-official adapter . Give your bot its own phone number so it can handle voice calls and text messages using the same handlers you already write. When a call ends, the transcript is delivered as a message, allowing your bot to… 14 Hacker News — AI on Front Page community 1mo ago NPM packages from RedHat have been compromised Article URL: https://github.com/RedHatInsights/javascript-clients/issues/492 Comments URL: https://news.ycombinator.com/item?id=48356625 Points: 327 # Comments: 151 37 r/LocalLLaMA community 1mo ago MTP is nice and all, but what about PP speeds? I don't know for the rest of you, but with my setup, as soon as i enable MTP, the PP performance and GPU usage drops significantly for some reason. It's not as much a memory issue for me as it is declining performance. My setup is: 2x Radeon VII 16gb on ROCm, 1x Rtx3080 8gb Max… 28 Hugging Face Daily Papers research 1mo ago One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation Abstract Group Prompting enables efficient cell instance segmentation by leveraging per-type prompting through a training-free framework that uses multi-scale encoder features and recursive prompt expansion. AI-generated summary Cell instance segmentation models trained on… 32 Hugging Face Daily Papers research 1mo ago How can embedding models bind concepts? Abstract Vision-language models like CLIP struggle with concept binding despite recognizing individual concepts, but controlled transformer models can learn low-complexity binding functions that generalize better through multiplicative interactions. AI-generated summary Humans… 11 r/LocalLLaMA community 1mo ago Just found a 1-click RCE in pewdiepie's Odysseus Chat PR being submitted to help the project as we speak. Sound on for extra lols.   submitted by   /u/theonejvo [link]   [comments] 7 Vercel — AI dev-tools 1mo ago Qwen 3.7 Plus now available on AI Gateway Qwen 3.7 Plus from Alibaba is now available on Vercel AI Gateway . The model unifies vision and language into a single agent foundation, with capabilities spanning GUI and CLI operation, coding and productivity workflows with full-modality input, and visual agent tasks including… 26 arXiv — Machine Learning research 1mo ago Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics arXiv:2605.30374v1 Announce Type: new Abstract: Estimating hip muscle forces and joint moments during gait typically relies on musculoskeletal simulation, which is informative but time-consuming and difficult to apply in clinical settings. This study developed a deep learning… 10 arXiv — Machine Learning research 1mo ago Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents arXiv:2605.30590v1 Announce Type: new Abstract: Two clinical AI systems can score nearly identically on coverage-based rubrics yet behave radically differently when their patient inputs change: one updates its recommendations to match the new clinical signal, while the other… 23 arXiv — NLP / Computation & Language research 1mo ago Generalistic or Specific Embeddings, Which is Better? An Empirical Study on Search for Clinical Coding in Non-English Languages arXiv:2605.30529v1 Announce Type: new Abstract: Sentence-embedding models for semantic search are overwhelmingly developed and evaluated on English corpora. When applied to clinical retrieval in other languages -- particularly retrieval of ICD-10-CM / CIE-10 codes -- recall… 26 arXiv — NLP / Computation & Language research 1mo ago Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs arXiv:2605.30646v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in clinical applications. However, their behavior remains highly sensitive to subtle linguistic variations, such as rephrasing or syntactic variation. This sensitivity poses risks… 27 r/LocalLLaMA community 1mo ago Llama Studio v0.2.0 I have made an update to my llama-server WebUI based on some awesome feedback and interaction with the community. 1) JSON model config replaced by per-model shell scripts. Run from CLI, paste from unsloth, email to your buddy or post to reddit: Using real shell scripts to store… 17 Hacker News — AI on Front Page community 1mo ago Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30% Article URL: https://thesciverse.org/scientists-found-that-the-creatine-supplement-millions-take-for-muscle-gains-is-quietly-raising-brain-energy-levels-and-slowing-early-alzheimers-cognitive-decline-by-30/ Comments URL: https://news.ycombinator.com/item?id=48346947 Points: 230… 15 Vercel — AI dev-tools 1mo ago Chat SDK adds Lark and Feishu support Chat SDK now supports Lark and Feishu via a new vendor-official adapter . Build bots that post, edit, and delete messages, stream replies via Lark's native cardkit typewriter API, send interactive cards, and react with emojis across both Lark and Feishu conversations. The… 20 r/LocalLLaMA community 1mo ago Step-3.7-Flash-NVFP4 thinking for many minutes Anyone else seeing Step-3.7-Flash-NVFP4 thinking for many minutes? I'm using it with Cline and can see it thinking for in some cases 14 minutes with vLLM reporting generation of 90 tokens/s every 10s.   submitted by   /u/NaiRogers [link]   [comments] 19 llama.cpp releases dev-tools 1mo ago b9414 mtmd: Add DeepSeekOCR 2 Support ( #20975 ) mtmd: DeepSeek-OCR 2 support, with multi-tile dynamic resolution introduced clip_image_f32::add_viewsep address PR review drop redundant ggml_cpy ops in both deepseekocr versions build drop no-op ggml_cont in build_sam assert… 30 TechCrunch — AI news-outlet 1mo ago What happens when companies become too AI-pilled? The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 Marcus on AI community 1mo ago What happens next, after the decline of tokenmaxxing? Two very different sets of predictions 25 TechCrunch — AI news-outlet 1mo ago Does your CEO have AI psychosis? Aaron Levie thinks most of them do. The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 MIT Technology Review — AI news-outlet 1mo ago How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment Pope Leo XIV’s new encyclical on artificial intelligence includes a statement that warrants serious attention from technologists and policymakers: “Technology is never neutral.” Magnifica Humanitas (“Magnificent Humanity”) is a clarion call to all people to act with courage and… 8 Hacker News — AI on Front Page community 1mo ago Volkswagen blocks Home Assistant by requiring client assertion Article URL: https://github.com/robinostlund/homeassistant-volkswagencarnet/issues/967 Comments URL: https://news.ycombinator.com/item?id=48319509 Points: 221 # Comments: 112 32 arXiv — Machine Learning research 1mo ago Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data arXiv:2605.29058v1 Announce Type: new Abstract: Bayesian Networks (BNs) are of interest from an explainable AI viewpoint, offering transparent probabilistic models for decision support. Baymex is a recently introduced multi-objective evolutionary algorithm for learning… 24 arXiv — Machine Learning research 1mo ago Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions arXiv:2605.29172v1 Announce Type: new Abstract: Seasonal climate predictions support planning and risk management by offering early information of the most likely-to-occur climate conditions in the coming months, and associated uncertainties. Ensemble forecasts enable this by… 20 arXiv — Machine Learning research 1mo ago SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction arXiv:2605.29236v1 Announce Type: new Abstract: Alarm fatigue in intensive care units (ICUs) is a well documented patient safety crisis. Clinical monitors generate 350 or more alarms per patient per day, out of which 72-99% are clinically irrelevant. Staff desensitization to… 29 arXiv — Machine Learning research 1mo ago Causal Label Recovery in Payment Networks arXiv:2605.29272v1 Announce Type: new Abstract: Fraud detection models in payment networks train on chargeback labels that are systematically biased. Every label must survive three sequential gates: authorization (declined transactions generate no labels), issuer reporting… 36 arXiv — NLP / Computation & Language research 1mo ago Specialty-Specific Medical Language Model for Immune-Mediated Diseases arXiv:2605.28838v1 Announce Type: new Abstract: Extracting detailed clinical information from free-text medical narratives remains a practical challenge for researchers and healthcare systems. Terminology for immune-mediated and infectious diseases is especially inconsistent… 29 arXiv — NLP / Computation & Language research 1mo ago Hallucination Detection-Guided Preference Optimization for Clinical Summarization arXiv:2605.28910v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise on summarization tasks, but they often produce hallucinations, which are unsupported or incorrect statements that limit their reliability in specialized healthcare applications. We… 21 llama.cpp releases dev-tools 1mo ago b9393 mtmd: fix gemma 4 audio rms norm eps ( #23815 ) mtmd: fix gemma 4 audio rms norm eps Update tools/mtmd/clip.cpp Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com macOS/iOS: macOS Apple Silicon (arm64) macOS… 34 The Information — AI news-outlet 1mo ago Blue Origin New Glenn Rocket Explodes During Test Jeff Bezos’ space company Blue Origin suffered a serious setback Thursday evening when its New Glenn rocket exploded on a launch pad in Florida during a test. Video clips of the incident show a giant fireball engulfing the rocket and surrounding structures. No one was on board… 18 r/LocalLLaMA community 1mo ago Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it. diff --git a/vllm/entrypoints/anthropic/protocol.py b/vllm/entrypoints/anthropic/protocol.py index 3ebc17117..2d5726d73 100644 --- a/vllm/entrypoints/anthropic/protocol.py +++ b/vllm/entrypoints/anthropic/protocol.py @@ -65,7 +65,7 @@ class AnthropicContentBlock(BaseModel):… 29 r/MachineLearning community 1mo ago Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R] Are agents aging after deployment? : https://arxiv.org/abs/2605.26302 On a new longitudinal deployment benchmark, switching the Claude Code CLI agent from Sonnet 4.6 to Opus 4.7 dropped PyTest pass rate by ~15%. This (to me) is a counterintuitive-enough result to pay attention… 6 Don't Worry About the Vase community 1mo ago AI #170: Lack of Executive Order Last week ended on a cliffhanger of sorts. 28 arXiv — Machine Learning research 1mo ago Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility arXiv:2605.27467v1 Announce Type: new Abstract: Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks… 19 arXiv — Machine Learning research 1mo ago Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals arXiv:2605.27583v1 Announce Type: new Abstract: Electrocardiograms (ECGs) are widely used non-invasive measurements of cardiac activity and play a central role in clinical diagnosis. Recent multimodal approaches align ECG signals with clinical reports to incorporate diagnostic… 4 arXiv — Machine Learning research 1mo ago Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients? arXiv:2605.27733v1 Announce Type: new Abstract: Training instabilities such as loss spikes are frequently the result of stochastic gradient noise. Because of rare expressions in language training data, and multiple layer composition, the noise impact is heavy-tailed and survives… 33 arXiv — Machine Learning research 1mo ago Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning arXiv:2605.27954v1 Announce Type: new Abstract: Agentic large language models are increasingly used to solve real-world tasks by reasoning over goals, invoking tools, and interacting with external environments. Reinforcement learning provides a natural framework for improving… 38 arXiv — Machine Learning research 1mo ago Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur… 13 Page 7 of 10 · 500 articles ← Newer Older →