News / #developer-tool Tag Developer Tool 500 articles archived under #developer-tool · RSS Sign in to follow arXiv — NLP / Computation & Language research 20d ago Cross-Modal Masked Compositional Concept Modeling for Enhancing Visio-Linguistic Compositionality arXiv:2606.13288v1 Announce Type: cross Abstract: Contrastively trained vision-language models like CLIP, have made remarkable progress in learning joint image-text representations, but still face challenges in compositional understanding. They often exhibit a "bag-of-words"… 38 Vercel — AI dev-tools 20d ago Program Claude Code, Codex, Pi and other agent harnesses with AI SDK AI SDK 7 introduces HarnessAgent , a single API for running established agent harnesses, including Claude Code, Codex, and Pi. AI SDK has always let you switch models without rewriting your agent. Now you can switch the harness the same way. Write the agent once. Use the best… 7 NVIDIA Developer Blog official-blog 20d ago One-Click Multi-Tenant Security with NVIDIA Quantum InfiniBand NVIDIA Quantum InfiniBand now offers intent-based security profiles in Unified Fabric Manager (UFM) that enable multi-tenant fabric security in a single... 33 r/MachineLearning community 20d ago What should context compression keep? I looked at how six agents handle it[D] I use Claude Code, Codex CLI, OpenCode, Cline, Cursor, and Amp enough to notice a pattern in how they handle long context. They are all converging on layered progressive compression, but they disagree on what to protect. Most protect recent user messages as a first-class asset.… 20 arXiv — Machine Learning research 21d ago Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data arXiv:2606.11272v1 Announce Type: new Abstract: Federated Learning (FL) enables collaborative and privacy-preserving model training across distributed clients, but most existing FL systems implicitly assume data stationarity. In real-world settings-such as healthcare, industrial… 10 arXiv — Machine Learning research 21d ago Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity arXiv:2606.11431v1 Announce Type: new Abstract: Mirror Descent (MD) extends Gradient Descent (GD) beyond Euclidean geometry and has recently reappeared as a lens for KL-regularized policy optimization in reinforcement learning and LLM post-training. This raises a basic… 10 arXiv — Machine Learning research 21d ago LSTM-Based Detection of Structural Breaks in Property Insurance Loss Reserving: A Climate-Informed Approach arXiv:2606.11463v1 Announce Type: new Abstract: Accurate loss reserving is foundational to insurer solvency, yet accelerating climate driven catastrophes systematically violate the stability assumptions on which traditional actuarial methods depend. This white paper presents a… 30 arXiv — Machine Learning research 21d ago AI4Land: Scalable Deep Learning for Global High-Resolution Land Use Reconstruction arXiv:2606.11793v1 Announce Type: new Abstract: Uncertainty in the terrestrial carbon cycle remains a major constraint in climate projections, partly driven by the uncertainties affecting the land surface representation and variability in Earth system models. To address this… 11 arXiv — Machine Learning research 21d ago Multimodal Ordinal Modeling of Alzheimer's Disease Severity Using Structural MRI and Clinical Data arXiv:2606.11794v1 Announce Type: new Abstract: Neurodegenerative diseases such as Alzheimer's disease (AD) require accurate and scalable tools for assessing disease severity, yet current clinical staging remains time-intensive and prone to variability. We propose an… 17 arXiv — Machine Learning research 21d ago Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation arXiv:2606.12006v1 Announce Type: new Abstract: Predicting time-to-event outcomes such as mortality is a fundamental task in clinical decision-making, commonly addressed through survival analysis. While classical statistical and deep learning approaches have been widely studied,… 33 arXiv — Machine Learning research 21d ago PCA-Enhanced Adaptive NVAR Framework for High-Resolution Sea Surface Temperature Forecasting in the East Sea arXiv:2606.12141v1 Announce Type: new Abstract: Accurate forecasting of sea surface temperature (SST) in regional seas such as the East Sea is crucial for monitoring marine ecosystems, assessing climate risks, managing fisheries, and conducting naval operations. Traditional… 34 arXiv — Machine Learning research 21d ago Using Explainability as a Training-Time Reliability Signal for Efficient ECG Classification arXiv:2606.12252v1 Announce Type: new Abstract: Training deep neural networks for clinical time-series analysis is computationally demanding, yet many healthcare settings lack the resources required for repeated model development and deployment. This challenge is particularly… 8 arXiv — NLP / Computation & Language research 21d ago BioDivergence: A Benchmark and Evaluation Framework for Hidden Contextual Contradictions in Biomedical Abstracts arXiv:2606.11208v1 Announce Type: new Abstract: Biomedical findings often seem to conflict across studies, but many of these differences are context-dependent rather than true contradictions. Variations in cohort, geography, assay protocol, disease subtype, and clinical setting… 29 arXiv — NLP / Computation & Language research 21d ago Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance? arXiv:2606.12250v1 Announce Type: new Abstract: Large language models (LLMs) in medicine are mainly evaluated using multiple-choice question answering (MCQA), which can overestimate real clinical ability due to guessing strategies and answer biases. To address these limitations,… 37 The Information — AI news-outlet 21d ago Xbox Plans Layoffs as Revenue, Profit Margins Decline Microsoft’s Xbox gaming unit plans to cut staff in the coming months as its financial picture worsens, according to someone with knowledge of the plans. In a note to staff Wednesday, CEO Asha Sharma said that Xbox’s “accountability margins”—a term Microsoft uses internally to… 30 TechCrunch — AI news-outlet 21d ago Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues Companies are burning through exorbitant sums of money to keep pace in the AI arms race. Debt is climbing. 14 GitHub Blog — AI & ML official-blog 21d ago Give GitHub Copilot CLI real code intelligence with language servers Install and configure LSP servers for GitHub Copilot CLI, replacing brute-force grep/decompile with real code intelligence. The post Give GitHub Copilot CLI real code intelligence with language servers appeared first on The GitHub Blog . 34 Hugging Face Daily Papers research 22d ago BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling Abstract BrainSurgery is a tool for robust and reproducible tensor manipulation of neural network checkpoints through declarative YAML plans with built-in validation. Generated by Qwen/Qwen2.5-Coder-32B-Instruct As deep learning models scale, managing, inspecting, and modifying… 12 Hugging Face Daily Papers research 22d ago Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Abstract Flow-DPPO replaces ratio clipping with divergence proximal constraints in flow matching models, improving training stability and multi-objective optimization through exact KL divergence computation. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Recent work has… 34 arXiv — Machine Learning research 22d ago TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification arXiv:2606.09898v1 Announce Type: new Abstract: Cancer treatment planning requires decisions across multiple clinical dimensions at once. Clinicians must determine whether a patient should receive targeted molecular therapy, radiation therapy, and whether they are likely to… 22 arXiv — Machine Learning research 22d ago LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts arXiv:2606.09907v1 Announce Type: new Abstract: Multimodal clinical learning is increasingly important for integrating diverse patient data, including imaging, text, and personalised health records. However, it faces two fundamental challenges: i) modality missingness, where… 28 arXiv — Machine Learning research 22d ago FedSteer: Taming Extreme Gradient Staleness in Federated Learning with Corrective Projections and Caching arXiv:2606.10124v1 Announce Type: new Abstract: Federated learning (FL) is often subject to aggregation variance if clients do not consistently participate in training rounds. While reusing stale model updates from inactive clients is a common technique to reduce this variance,… 33 arXiv — Machine Learning research 22d ago MMClima: A Framework for Multimodal Climate Science Data and Evaluation arXiv:2606.10194v1 Announce Type: new Abstract: Climate change research increasingly requires AI systems that reason across text, dynamic visual content, and scientific figures, yet existing climate QA benchmarks are small, mostly textual, and cover a narrow range of models. We… 20 arXiv — Machine Learning research 22d ago DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction arXiv:2606.10243v1 Announce Type: new Abstract: Offsite conversion rate (OCVR) prediction is an important ranking problem in computational recommendation systems. This task presents a modeling challenge: click signals are abundant and exhibit short temporal horizons, whereas… 25 arXiv — NLP / Computation & Language research 22d ago Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs arXiv:2606.10722v1 Announce Type: new Abstract: We study dense-to-sparse continual training as a way to construct channel-sparse large language models from dense checkpoints. Starting from a Qwen2.5-8B dense backbone, we continue training at 32K context and introduce a… 24 arXiv — NLP / Computation & Language research 22d ago Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning arXiv:2606.10796v1 Announce Type: new Abstract: Automatic Depression Detection (ADD) from clinical interviews is a pivotal task in computational mental health, yet it remains challenging due to two critical obstacles: 1) difficulty in modeling complex but sparsely distributed… 5 arXiv — NLP / Computation & Language research 22d ago Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction arXiv:2606.10279v1 Announce Type: cross Abstract: Supervised fine-tuning with synthetic rationale data is widely assumed to improve language model performance on clinical prediction tasks by teaching models not just what to predict but why. We test this assumption on five-year… 28 TechCrunch — AI news-outlet 22d ago Anthropic’s Fable 5 can make weirdly fun video games with the click of a button Anthropic's Claude Fable 5 is going to be a big hit with the web's vibe coders. 27 llama.cpp releases dev-tools 22d ago b9586: webui: implement pinned conversations support (#21387) webui: implement pinned conversations support webui: linter/prettier pass Fix the unused handleMobileSidebarItemClick from the component. the search should find pinned conversations as well Co-authored-by: Pascal admin@serveurperso.com Co-authored-by: Pascal… 24 Anthropic SDK (Python) releases dev-tools 22d ago v0.108.0 0.108.0 (2026-06-09) Full Changelog: v0.107.1...v0.108.0 Features api: add support for claude-mythos-5 and claude-fable-5, with support for server-side fallbacks on refusal ( 6b76649 ) client: adds client-side fallbacks middleware for API providers that do not support… 12 GitHub Blog — AI & ML official-blog 22d ago From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI Custom agents let GitHub Copilot CLI understand your stack and team workflows, turning one-off terminal prompts into repeatable, reviewable processes. The post From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI appeared first on The GitHub Blog . 20 NVIDIA Developer Blog official-blog 22d ago Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodipine,... 9 Hugging Face Daily Papers research 23d ago Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory Abstract SkeMex is a self-evolving framework that enhances medical agents through structured skill memory, improving long-term clinical reasoning by distinguishing useful experiences and governing memory retention based on contextual utility. Generated by… 32 arXiv — Machine Learning research 23d ago TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation arXiv:2606.07569v1 Announce Type: new Abstract: Accurate carbon emission monitoring is critical for climate policy and emerging regulatory mechanisms such as the EU Carbon Border Adjustment Mechanism, yet city-level high-frequency monitoring data remain extremely scarce,… 31 arXiv — Machine Learning research 23d ago HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning arXiv:2606.07621v1 Announce Type: new Abstract: Edge services increasingly use federated learning to personalize on-device models while keeping sensitive data local. In practice, deployments must handle heterogeneity in both client resources and local data distributions.… 24 arXiv — Machine Learning research 23d ago BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing arXiv:2606.07692v1 Announce Type: new Abstract: Foundation models for wearable biosignals have matched or exceeded supervised specialists across a range of clinical tasks, yet all rely on modalities that require deliberate user action--wearing a device or visiting a sleep lab.… 14 arXiv — Machine Learning research 23d ago EvoCSFL: Surrogate-Assisted Evolutionary Client Selection for Efficient and Robust Federated Learning arXiv:2606.07702v1 Announce Type: new Abstract: The heterogeneity of client data and systems makes it difficult to achieve satisfactory convergence speed and robustness in federated learning with random client selection. To address this issue, this paper proposes a… 29 arXiv — Machine Learning research 23d ago Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling arXiv:2606.07898v1 Announce Type: new Abstract: High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key… 16 arXiv — Machine Learning research 23d ago SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification arXiv:2606.08037v1 Announce Type: new Abstract: Electrocardiogram (ECG) classification models often suffer from severe label scarcity, making semi-supervised learning (SSL) an attractive strategy for reducing annotation costs. In clinical settings, however, unlabeled pools… 26 r/LocalLLaMA community 23d ago I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU I fine-tuned NVIDIA's Parakeet TDT 0.6B v2 for clinical speech and am releasing the weights as Omi Med STT v1 (CC-BY-4.0). Disclosure: I'm the founder of Omi Health and built this. Happy to dig into the training mix, benchmark, failure cases, quantization, or anything else. The… 14 r/LocalLLaMA community 23d ago Here's a llama.cpp CLI Command builder. No accounts or sign up. No email requirements. No pop-ups and no cookies. No ads. Info is saved locally in your browser so you dont lose any progress. Its got every single flag and argument that could be found in the documentation. Tool tips are added to everything. Every field… 19 Vercel — AI dev-tools 23d ago Domain Search is now available through the Vercel CLI You can now use the Vercel CLI to search domains. Using the vercel domains search command, you can supply a domain name and retrieve availability and price results for all TLDs that Vercel supports. You can also filter by TLD, apply sorting, and filter out unavailable domains.… 7 Vercel — AI dev-tools 23d ago How Fern runs multi-tenant docs for Webflow and ElevenLabs on Vercel Fern on Vercel 3x faster time to first byte Page load times reduced by 80% 6 million+ page views per month from 1 million+ unique visitors 65% of the platform migrated from Pages Router to App Router in 7 days Fern helps companies ship developer documentation and SDKs, running… 4 llama.cpp releases dev-tools 23d ago b9562 mtmd : add video input support ( #24269 ) wip ok: lazy bitmap API remember to free lazy text wip add mtmd_helper_video support video input on server (base64 input) add MTMD_VIDEO config add timestamp update CLI cli: allow auto-completion for video add --video arg fix build… 22 llama.cpp releases dev-tools 23d ago b9559 cli: fix spinner not show during prompt processing ( #24283 ) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu… 10 r/LocalLLaMA community 24d ago llama-launcher Release Hello everyone, I've been working on a point and click GUI to make tinkering with llama-server flags much quicker and easier, I thought I'd share for anyone else who might be interested. It's also great for anyone new to llama.cpp that is looking to get into it and doesn't want… 7 r/MachineLearning community 24d ago Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D] I've been building agents for about a year and recently shipped one for a client running ~140 MCP-exposed tools at peak. Along the way I made the canonical mistake. I used cosine similarity over tool description embeddings to pick which tools the model could see per turn. Worked… 9 r/LocalLLaMA community 24d ago Meddies PII: An Open Multilingual De-identification Model for Clinical Text A clinical AI model does not need to know who the patient is to reason clinically. It needs the symptoms, medications, lab results, diagnosis history, and treatment course. The problem is that in real medical records, those facts usually sit next to identifiers: names, record… 38 Ars Technica — AI news-outlet 24d ago The weather and climate science AI revolution isn’t revolutionary Machine learning has its limits—how is it being used? 21 arXiv — Machine Learning research 24d ago The Identity Trap in EEG Foundation Models: A Diagnostic Audit arXiv:2606.06647v1 Announce Type: new Abstract: Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or… 38 Page 5 of 10 · 500 articles ← Newer Older →