UXSim: Towards a Hybrid User Search Simulation

Abstract

Simulating nuanced user experiences within complex interactive search systems poses distinct challenge for traditional methodologies, which often rely on static user proxies or, more recently, on standalone large language model (LLM) agents that may lack deep, verifiable grounding. The true dynamism and personalization inherent in human-computer interaction demand a more integrated approach. This work introduces UXSim, a novel framework that integrates both approaches. It leverages grounded data from traditional simulators to inform and constrain the reasoning of an adaptive LLM agent. This synthesis enables more accurate and dynamic simulations of user behavior while also providing a pathway for the explainable validation of the underlying cognitive processes.

Key Features

Unified Architecture: Single simulation runner with pluggable components
Multiple Decision Policies: From simple component-based to sophisticated cognitive loops
Flexible Environments: Mock, web browser, abstract search, and API environments
Rich Agent Memory: Embedding-based memory with importance scoring
Persona-Driven: Realistic user personas guide agent behavior
Extensible Design: Easy to add new components and environments

Architecture

UXSim follows a Unified Modularity design with these core components:

1. Central Simulation Runner

simulation.py: Manages the main agent-environment interaction loop
Simple, consistent interface regardless of underlying complexity

2. Agent System

agent.py: Central agent with persona, memory, and cognitive state
Decision Policies: Pluggable strategies for action selection
- ComponentPolicy: Classic simulation using individual components
- CognitiveLoopPolicy: Full Perceive-Plan-Reflect-Act cognitive cycle

3. Simulator Components

Query Generators: Create search queries from persona intent
Action Selectors: Choose actions from available page elements
Relevance Classifiers: Determine content relevance to user goals

4. Environments

Mock Environment: Fast, deterministic testing environment
Web Browser Environment: High-fidelity browser automation (Playwright/Selenium)
Abstract Search Environment: Internal search indices (Whoosh)
API Environment: External search engine APIs

Quick Start

Installation

# Basic installation
pip install uxsim

# With web browser support
pip install uxsim[web]

# With LLM support
pip install uxsim[llm]

# Full installation
pip install uxsim[all]

Environment Setup

UXSim requires API keys for LLM providers and supports various environment variables:

Quick Setup

uxsim setup-env

# Edit .env file with your actual keys
nano .env

Required API Keys

OpenAI: Get your API key from OpenAI Platform
AWS Bedrock (optional): Configure AWS credentials

Environment Variables

Create a .env file in your project root:

# Required: OpenAI API Key
OPENAI_API_KEY=sk-proj-your-actual-key-here

# Optional: Browser display mode (false = show browser GUI)
HEADLESS=false

# Optional: AWS credentials (for AWS Bedrock)
AWS_ACCESS_KEY_ID=AKIA...
AWS_SECRET_ACCESS_KEY=your-secret-key

Or export them in your shell:

export OPENAI_API_KEY="sk-proj-your-actual-key-here"
export HEADLESS=false

Browser Setup

For web browser simulations, ensure Chrome is installed:

macOS: brew install google-chrome
Ubuntu: sudo apt install google-chrome-stable
Windows: Download from Google Chrome

Basic Usage

Create a persona:

uxsim create-persona --name "Sarah" --intent "buy wireless headphones" --age 28

Run a mock simulation (fast, no API keys needed):

uxsim run -p sarah_persona.json --policy component --environment mock

Run a web browser simulation (requires API key):

# Set up environment variables first
export OPENAI_API_KEY="sk-proj-your-actual-key"
export HEADLESS=false  # Show browser GUI

# Run with cognitive loop policy
uxsim run -p sarah_persona.json --policy cognitive_loop --environment web_browser

View results:

ls output/
# simulation_results.json, agent_memory.json, step_trace.json

Python API

import asyncio
from uxsim import SimulationConfig, run_simulation

# Configure simulation
config = SimulationConfig(
    persona_path="persona.json",
    environment_config={"type": "mock"},
    agent_policy="component",
    max_steps=20,
    output_dir="results"
)

# Run simulation
results = asyncio.run(run_simulation(config))
print(f"Completed in {results['execution']['steps_taken']} steps")

Example Scenarios

Scenario A: Classic, Fast Simulation

# exp_classic_trec.yml
agent_policy: "component"
environment:
  type: "abstract_search"
  index_path: "trec_index"
max_steps: 10

Workflow: Agent uses TrecQueryGenerator → AbstractSearchEnv processes with Whoosh → Fast, reproducible results

Scenario B: High-Fidelity Agent Emulation

# exp_agent_google.yml
agent_policy: "cognitive_loop"
environment:
  type: "web_browser"
  start_url: "https://www.google.com"
  headless: false
max_steps: 50

Workflow: Agent uses full cognitive cycle → WebBrowserEnv controls real browser → Realistic user simulation

Decision Policies

Component Policy

Chains individual components for classic simulation:

Query Generator → Action Selector → Relevance Classifier
Fast, deterministic, scalable
Perfect for batch experiments

Cognitive Loop Policy

Implements full cognitive cycle:

Perceive: Extract meaningful insights from observations
Plan: Generate/update plans based on current situation
Reflect: Analyze past actions and generate insights
Act: Select concrete actions based on plans

Personas

Personas drive agent behavior with rich characteristics:

{
  "persona": "Persona: Sarah\n\nBackground:\nTech-savvy marketing professional...",
  "intent": "buy wireless headphones for work calls",
  "age": 28,
  "gender": "female",
  "demographics": {...},
  "shopping_habits": "Research-oriented, values quality...",
  "professional_life": "Marketing manager at tech startup...",
  "personal_style": "Minimalist, functional preferences..."
}

Configuration

YAML Configuration

name: "My Simulation"
agent_policy: "cognitive_loop"
environment:
  type: "web_browser"
  start_url: "https://example.com"
  headless: true
max_steps: 30
output_dir: "results"

Batch Simulations

{
  "personas": ["persona1.json", "persona2.json"],
  "simulation_config": {
    "agent_policy": "component",
    "environment_config": {"type": "mock"},
    "max_steps": 20
  }
}

Output and Analysis

UXSim generates comprehensive results:

{
  "persona": {
    "name": "Sarah",
    "intent": "buy wireless headphones",
    "demographics": {...}
  },
  "execution": {
    "steps_taken": 15,
    "duration_seconds": 45.2,
    "completed": true
  },
  "agent_stats": {
    "total_memories": 28,
    "by_kind": {"observation": 10, "action": 8, "thought": 10},
    "api_calls": 15
  },
  "final_observation": {
    "url": "https://store.com/headphones",
    "has_content": true
  }
}

Development

Project Structure

uxsim/
├── core/                    # Core types and exceptions
├── agent.py                 # Central agent implementation
├── simulation.py            # Main simulation runner
├── simulators/              # Decision-making components
│   ├── policies/           # High-level decision policies
│   ├── components/         # Individual components
│   └── cognitive_loop/     # Cognitive cycle modules
├── environments/           # Environment implementations
├── personas/               # Persona management
├── configs/                # Example configurations
└── cli.py                  # Command-line interface

Adding New Components

New Query Generator:

from uxsim.simulators.components.query_generators import BaseQueryGenerator

class MyQueryGenerator(BaseQueryGenerator):
    async def generate_query(self, context):
        # Your implementation
        return query

New Environment:

from uxsim.environments.base_env import BaseEnvironment

class MyEnvironment(BaseEnvironment):
    async def observe(self):
        # Return Observation
    
    async def step(self, action):
        # Execute action, return new Observation

Testing

# Run tests
pytest

# Run with coverage
pytest --cov=uxsim

# Test specific component
pytest tests/test_agent.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figure		figure
output		output
runs/amazon_budget_shopper		runs/amazon_budget_shopper
uxsim.egg-info		uxsim.egg-info
uxsim		uxsim
.DS_Store		.DS_Store
.env		.env
README.md		README.md
amazon_simulation.log		amazon_simulation.log
amazon_simulation.py		amazon_simulation.py
example_usage.py		example_usage.py
persona.json		persona.json
pyproject.toml		pyproject.toml
sarah_persona.json		sarah_persona.json
test_amazon.py		test_amazon.py
test_persona.json		test_persona.json
test_uxsim.py		test_uxsim.py
uxsim_examples.py		uxsim_examples.py

Folders and files

Latest commit

History

Repository files navigation

UXSim: Towards a Hybrid User Search Simulation

Abstract

Key Features

Architecture

1. Central Simulation Runner

2. Agent System

3. Simulator Components

4. Environments

Quick Start

Installation

Environment Setup

Quick Setup

Required API Keys

Environment Variables

Browser Setup

Basic Usage

Python API

Example Scenarios

Scenario A: Classic, Fast Simulation

Scenario B: High-Fidelity Agent Emulation

Decision Policies

Component Policy

Cognitive Loop Policy

Personas

Configuration

YAML Configuration

Batch Simulations

Output and Analysis

Development

Project Structure

Adding New Components

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages