Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Qwen just released Qwen-AgentWorld-35B-A3B — a 35B-parameter MoE with only ~3B active parameters per token.
The interesting part: this is not positioned as a standard chat/instruction model or a full autonomous agent. It is a language world model trained to predict what an environment would return after an agent takes an action.
It covers seven agent interaction domains:
MCP / tool calling
Search
Terminal
Software engineering
Android
Web
Operating-system GUI interactions
The intended use seems to be simulating the environment side of an agent loop: given the action history and a new tool/GUI action, predict the next observation/state. That could be useful for agent training, offline evaluation, synthetic trajectories, testing tool-use workflows, or building sandbox-like environments without constantly running the real tools.
[link] [comments]
More from r/LocalLLaMA
-
What's in your RAG?
Jul 2
-
Palantir CEO rages against closed models
Jul 2
-
A cheap trick for reliable structured output: feed the validation error back into the retry
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.