Web World Models

Reading time: 6 minute
...

📝 Original Info

  • Title: Web World Models
  • ArXiv ID: 2512.23676
  • Date: 2025-12-29
  • Authors: Researchers from original ArXiv paper

📝 Abstract

Language agents increasingly require persistent worlds in which they can act, remember, and learn. Existing approaches sit at two extremes: conventional web frameworks provide reliable but fixed contexts backed by databases, while fully generative world models aim for unlimited environments, but the world is constructed primarily through generation, making it harder to maintain a fixed, deterministic global framework, reducing controllability. In this work, we introduce the Web World Model (WWM), a middle ground where world state and "physics" are implemented in ordinary web code to ensure logical consistency, while large language models generate context, narratives, and high-level decisions on top of this structured latent state. We build a suite of WWMs on a realistic web stack, including an infinite travel atlas grounded in real geography, fictional galaxy explorers, web-scale encyclopedic and narrative worlds, and simulation-and game-like environments. Across these systems, we identify practical design principles for WWMs: separating code-defined rules from model-driven imagination, representing latent state as typed web interfaces, and utilizing deterministic generation to achieve unlimited but structured exploration. Our results suggest that web stacks themselves can serve as a scalable substrate for world models, enabling controllable yet open-ended environments.

💡 Deep Analysis

Deep Dive into Web World Models.

Language agents increasingly require persistent worlds in which they can act, remember, and learn. Existing approaches sit at two extremes: conventional web frameworks provide reliable but fixed contexts backed by databases, while fully generative world models aim for unlimited environments, but the world is constructed primarily through generation, making it harder to maintain a fixed, deterministic global framework, reducing controllability. In this work, we introduce the Web World Model (WWM), a middle ground where world state and “physics” are implemented in ordinary web code to ensure logical consistency, while large language models generate context, narratives, and high-level decisions on top of this structured latent state. We build a suite of WWMs on a realistic web stack, including an infinite travel atlas grounded in real geography, fictional galaxy explorers, web-scale encyclopedic and narrative worlds, and simulation-and game-like environments. Across these systems, we iden

📄 Full Content

Web World Models Jichen Feng∗1,3 Yifan Zhang∗1 Chenggong Zhang∗2 Yifu Lu∗1 Shilong Liu1† Mengdi Wang1† 1Princeton University 2University of California, Los Angeles 3University of Pennsylvania Project Page: https://princeton-ai2-lab.github.io/Web-World-Models/ Abstract Language agents increasingly require persistent worlds in which they can act, remember, and learn. Existing approaches sit at two extremes: conventional web frameworks provide reliable but fixed contexts backed by databases, while fully generative world models aim for unlimited environments, but the world is constructed primarily through generation, making it harder to maintain a fixed, deterministic global framework, reducing controllability. In this work, we introduce the Web World Model (WWM), a middle ground where world state and “physics” are implemented in ordinary web code to ensure logical consistency, while large language models generate context, narratives, and high-level decisions on top of this structured latent state. We build a suite of WWMs on a realistic web stack, including an infinite travel atlas grounded in real geography, fictional galaxy explorers, web-scale encyclopedic and narrative worlds, and simulation- and game-like environments. Across these systems, we identify practical design principles for WWMs: separating code-defined rules from model-driven imagination, representing latent state as typed web interfaces, and utilizing deterministic generation to achieve unlimited but structured exploration. Our results suggest that web stacks themselves can serve as a scalable substrate for world models, enabling controllable yet open-ended environments. Web Framework Web World Model World Model Context is bounded by DB- backed content. Framework sets rules; LLMs fill in content. Models generate the whole world. Controllability Controllability Unlimited Context Text/code based env Controllability Text/code based env Unlimited Context Unlimited Context Text/code based env Figure 1 Left: Traditional Web Frameworks fix context in databases, limiting scalability. Center: The Web World Model (Ours) decouples logic from content, generating unlimited context via LLMs upon a code-based physics layer without heavy data storage. Right: Fully generative world models can produce unlimited context and rich video/3D content, but when the world is constructed primarily through generation, it is harder to maintain a fixed, deterministic global framework, reducing controllability. ∗Equal contribution; †Corresponding authors. 1 arXiv:2512.23676v1 [cs.AI] 29 Dec 2025 1 Introduction Modern language agents increasingly need persistent environments in which they can act, remember, and grow. Today, most practical systems sit at one of two extremes (Figure 1). On one side, conventional web frameworks operate with a fixed context: application state is stored in databases and exposed through hand-crafted endpoints. This design offers reliability, robust engineering tooling, and clear security boundaries, but the world agents can inhabit is ultimately bounded by the schema developers anticipated in advance. On the other side, general world models attempt to generate environments directly in the latent space of a model, in principle supporting unlimited context and arbitrary environment types. However, these fully generative worlds are difficult to control, hard to debug, and costly to scale, and they often lack the structural guarantees needed for long-running applications. As a result, there is a missing middle ground between fixed-context web applications and unconstrained world models. We propose to fill this gap with the notion of a Web World Model (WWM). A Web World Model is a world whose state and “physics” are defined by ordinary web code (e.g., TypeScript modules, HTTP handlers, and database schemas), while large language models generate context and narratives on top of this structured latent state. In this view, code specifies what kinds of entities exist, how they interact, and which actions are possible; the model is invoked to enrich these entities with descriptions, stories, or task-specific reasoning. WWMs thus inherit the controllability, observability, and tooling of web frameworks, yet they can procedurally expand to an effectively unlimited state space by using language models to synthesize new content on demand. Compared with fixed web systems, WWMs are not bound to a small, static context; compared with fully generative world models, they offer a programmable substrate that can be tested, versioned, and deployed using standard web infrastructure. Figure 2 Illustrations of a series of Web World Models introduced in this work. Details are presented in Section 3. To make this idea concrete, we implement a series of Web World Models on a unified web technology stack. These systems span a wide range of domains. An Infinite Travel Atlas turns the 2 real Earth into an explorable atlas, where any coordinate can be expanded into rich pl

…(Full text truncated)…

📸 Image Gallery

Alchemy.png Alchemy1.png Bookshelf_entry.webp Bookshelf_reader.webp Card.png Card1.png GalaxyTravel.webp Teaser2.jpg Voyager.png Voyager2.png WorldTravel.webp demo1.png demo10.png demo2.png demo3.png demo4.png demo5.png demo6.png demo7.png demo8.png demo9.png pb.webp pc.webp wwm-v5.webp

Reference

This content is AI-processed based on ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut