Multilingual-Multimodal-NLP/LoopCoder-V2 · Hugging Face
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
|
LoopCoder-V2 LoopCoder-v2 is a 7B instruction-tuned code model based on the Parallel Loop Transformer (PLT). The model studies test-time computation scaling through repeated application of shared Transformer blocks while keeping the parameter count fixed. The released checkpoint is the two-loop PLT variant ( Highlights
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation ScalingTL;DR. For Parallel Loop Transformers (PLT), more looping is not better. A 7B coder that loops just once more than usual (two passes total) lifts SWE-bench Verified from 43.0 → 64.4, while three or more loops regress. We explain this with a gain–cost view of looping and provide diagnostics for picking the loop count without brute-force sweeps. OverviewLooped Transformers scale latent computation by repeatedly applying a shared block, but sequential looping increases latency and KV-cache memory with the loop count. Parallel Loop Transformers (PLT) alleviate this with two mechanisms:
Once cost is flattened, loop count becomes a free design knob — and the question becomes: how many loops are actually worth it? We study this through a gain–cost lens: an extra loop may refine representations (gain), but CLP also introduces a roughly constant positional mismatch at each loop boundary (cost), which we quantify with an intrinsic offset cost Ω(r). We instantiate the study with LoopCoder-v2, a family of 7B PLT coders trained from scratch on 18T tokens of mixed text and code (1:1, 100+ programming languages), under matched training, instruction tuning, and evaluation. [link] [comments] |
More from r/LocalLLaMA
-
Palantir CEO rages against closed models
Jul 2
-
A cheap trick for reliable structured output: feed the validation error back into the retry
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
-
[Benchmark] Kimi K2.7 Code Q3 on Mac Studio M3 Ultra + RTX PRO 6000 over llama.cpp RPC: prefill improves, no changes in token generation/decode
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.