Qwen-AgentWorld: Language World Models for General Agents

(arxiv.org)

31 points | by ilreb 3 hours ago

4 comments

psc007 25 minutes ago
Eli5? What is this compared to a regular llm assistant model like the base qwen?
[-]
- gavmor 2 minutes ago
  A regular LLM acts as a "policy," mapping a current state to a specific action (states → actions). Their new LLM acts as a "world model," mapping a current state and a chosen action to a predicted future state ((states, actions) → subsequent states). Instead of deciding "what to do," its explicit objective is to predict the exact environment observation that will result from the interaction history and the agent's current action.
  I assumed at first that it was trained on synthetic data, but they actually went and deployed real physical hosts and virtual machines (e.g. Ubuntu, macOS, and Android) and browsers. They ran agentic systems on these continuously and recorded the actual, real-world interactions.
  So it's an LLM that infers next state, or outcome,as structured data e.g. literal HTML code, UI view hierarchies, or accessibility trees.
Tepix 51 minutes ago
The labels of the very first chart (figure 1, bottom left) are obviously wrong which casts a doubt on the entire paper.
[-]
- dudisubekti 8 minutes ago
  This label?
  > Figure 1: Overview of Qwen-AgentWorld. Top: Qwen-AgentWorld is a unified native language world model across seven domains. Bottom: We explore two complementary strategies for applying world modeling to enhance language agents (mainly using the 35B-A3B model as agent): Decouple and Unify , where the world model serves as the environment simulator and agent foundation model, respectively.
  Where is the mistake?
verdverm 1 hour ago
35B model from the qwen-3.5 line
https://github.com/QwenLM/Qwen-AgentWorld
https://huggingface.co/Qwen/Qwen-AgentWorld-35B-A3B
[-]
- khimaros 40 minutes ago
  unsloth, activate!
stingraycharles 54 minutes ago
[dead]