Hugging Face Daily Papers · June 30, 2026 · 3 min read

LLM Program Optimization via Retrieval Augmented Search

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Like Read original ↗

ACL 2026 Findings</p>\n","updatedAt":"2026-06-30T18:28:27.116Z","author":{"_id":"638d4b18ebda86f24b5f6391","avatarUrl":"/avatars/e8c518457c25c6444084ec56c28b20c1.svg","fullname":"Sagnik Anupam","name":"sagnikanupam","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.4728822112083435},"editors":["sagnikanupam"],"editorAvatarUrls":["/avatars/e8c518457c25c6444084ec56c28b20c1.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2501.18916","authors":[{"_id":"6a440a1741f04ae4d7ad95bf","name":"Sagnik Anupam","hidden":false},{"_id":"6a440a1741f04ae4d7ad95c0","name":"Alexander Shypula","hidden":false},{"_id":"6a440a1741f04ae4d7ad95c1","name":"Osbert Bastani","hidden":false}],"publishedAt":"2026-06-23T00:00:00.000Z","submittedOnDailyAt":"2026-06-30T00:00:00.000Z","title":"LLM Program Optimization via Retrieval Augmented Search","submittedOnDailyBy":{"_id":"638d4b18ebda86f24b5f6391","avatarUrl":"/avatars/e8c518457c25c6444084ec56c28b20c1.svg","isPro":false,"fullname":"Sagnik Anupam","user":"sagnikanupam","type":"user","name":"sagnikanupam"},"summary":"Recent work has demonstrated the potential of large language models (LLMs) for program optimization, a key challenge in programming languages. We propose a blackbox adaptation method called Retrieval Augmented Search (RAS) that performs beam search over candidate optimizations; at each step, it retrieves in-context examples from a given training dataset of slow-fast program pairs to guide the LLM. Critically, we find that performing contextual retrieval based on an LLM-generated natural language description significantly outperforms retrieval based on the source code. We also propose AEGIS, a method for improving interpretability by decomposing training examples into ''atomic edits'' that are significantly more incremental in nature. We show that RAS performs up to 2.06times better than prior state-of-the-art blackbox adaptation strategies on optimizing C++ programs, and that AEGIS performs up to 1.37times better while making significantly smaller edits. We also show that using RAS improves the mean runtime percentile of Python programs by 10.27 compared to baselines.","upvotes":2,"discussionId":"6a440a1741f04ae4d7ad95c2","projectPage":"https://sagnikanupam.com/papers/llmprogramoptimization/index.html","githubRepo":"https://github.com/sagnikanupam/llmprogramoptimization","githubRepoAddedBy":"user","ai_summary":"Blackbox adaptation methods using retrieval-augmented search and atomic edit decomposition improve program optimization performance for both C++ and Python code.","ai_keywords":["large language models","program optimization","blackbox adaptation","retrieval augmented search","beam search","in-context examples","natural language description","source code","AEGIS","atomic edits","C++","Python"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct","githubStars":2,"organization":{"_id":"633cfd005d7b7741bef6aa99","name":"upenn","fullname":"University of Pennsylvania","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/68e396f2b5bb631e9b2fac9a/FFSbROS6R8vPMIyXb7iIU.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"63ac5701c21e60a3e9b58aa7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63ac5701c21e60a3e9b58aa7/g6EX7diOpuA94R2ab-rZC.png","isPro":true,"fullname":"Dipankar Sarkar","user":"dipankarsarkar","type":"user"},{"_id":"638d4b18ebda86f24b5f6391","avatarUrl":"/avatars/e8c518457c25c6444084ec56c28b20c1.svg","isPro":false,"fullname":"Sagnik Anupam","user":"sagnikanupam","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"organization":{"_id":"633cfd005d7b7741bef6aa99","name":"upenn","fullname":"University of Pennsylvania","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/68e396f2b5bb631e9b2fac9a/FFSbROS6R8vPMIyXb7iIU.png"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2501/2501.18916.md","query":{}}">

Papers

arxiv:2501.18916

LLM Program Optimization via Retrieval Augmented Search

Published on Jun 23

· Submitted by

Sagnik Anupam on Jun 30

University of Pennsylvania

Upvote

Authors:

Abstract

Blackbox adaptation methods using retrieval-augmented search and atomic edit decomposition improve program optimization performance for both C++ and Python code.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

Recent work has demonstrated the potential of large language models (LLMs) for program optimization, a key challenge in programming languages. We propose a blackbox adaptation method called Retrieval Augmented Search (RAS) that performs beam search over candidate optimizations; at each step, it retrieves in-context examples from a given training dataset of slow-fast program pairs to guide the LLM. Critically, we find that performing contextual retrieval based on an LLM-generated natural language description significantly outperforms retrieval based on the source code. We also propose AEGIS, a method for improving interpretability by decomposing training examples into ''atomic edits'' that are significantly more incremental in nature. We show that RAS performs up to 2.06times better than prior state-of-the-art blackbox adaptation strategies on optimizing C++ programs, and that AEGIS performs up to 1.37times better while making significantly smaller edits. We also show that using RAS improves the mean runtime percentile of Python programs by 10.27 compared to baselines.

View arXiv page View PDF Project page GitHub 2 Add to collection

Community

sagnikanupam

Paper submitter about 7 hours ago

•

edited about 7 hours ago

ACL 2026 Findings

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2501.18916

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2501.18916 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2501.18916 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2501.18916 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

No comments yet. Sign in and be the first to say something.

LLM Program Optimization via Retrieval Augmented Search

Abstract

Community

Models citing this paper 0

Datasets citing this paper 0

Spaces citing this paper 0

Collections including this paper 0

Discussion (0)

More from Hugging Face Daily Papers