📧 OpenEnv: Email Triage System

title

Email Triage System

emoji

📧

colorFrom

blue

colorTo

purple

sdk

docker

app_file

server/app.py

pinned

false

📧 OpenEnv: Email Triage System

A complete, real-world OpenEnv environment for training and evaluating AI agents on the complex task of professional email triage.

🌟 Overview

The Email Triage System simulates the workflow of a digital assistant managing a high-volume inbox. Agents must process incoming emails through a multi-stage pipeline:

🔍 Classification: Determine the nature of the email (Spam, Important, Support, etc.).
🎯 Intent Detection: Extract the specific user need (Complaint, Pricing, Booking, etc.).
✍️ Reply Generation: Draft a contextually accurate and professional response.

This environment provides a structured, reproducible benchmark for evaluating an agent's ability to maintain state, follow business logic, and generate high-quality outputs.

🚀 Key Features

Full OpenEnv Compliance: Implements the complete step(), reset(), and state() API.
Typed Pydantic Models: Strictly enforced schemas for Observations, Actions, and Rewards.
Multi-Stage Trajectory: Episodes involve sequential decision-making, moving from classification to drafting.
Sophisticated Graders: Deterministic reward function with partial progress signals and reasoning quality checks.
Baseline Included: Reproducible inference script using OpenAI-compatible APIs.

📊 Task & Difficulty Levels

Task ID	Name	Difficulty	Description
`task_easy`	Email Classification	Easy	Simple spam vs. ham detection with clear triggers.
`task_medium`	Intent Detection	Medium	Understanding customer issues like card activation or billing.
`task_hard`	Drafting Reply	Hard	Generating professional emails that resolve user queries.

🛠️ Environment Specification

Action Space (`EmailTriageAction`)

action_type: Current stage (classification, intent, reply).
content: The agent's decision or text output.
reasoning: String explaining the logic behind the action.
confidence: Float [0.0 - 1.0] representing agent's certainty.

Observation Space (`EmailTriageObservation`)

email_text: The content of the email being triaged.
current_stage: Which stage the environment is in.
history: Detailed log of previous actions and their fine-grained scores.
reward: Scalar reward for the current step.
message: Natural language feedback from the grader.

🏆 Reward Function

The environment provides a dense reward signal decomposed into three components:

Label Correctness (60%): Accuracy relative to ground truth.
Reasoning Quality (30%): Evaluation of the reasoning field for keyword relevance and length.
Formatting & Metadata (10%): Proper usage of action_type and confidence.

Difficulty-based ceilings are applied to the final score:

Easy: max 0.90
Medium: max 0.80
Hard: max 0.70

📦 Installation & Setup

1. Prerequisite

Ensure you have Python 3.10+ and docker installed.

2. Local Setup

# Clone the repository
git clone https://github.com/Risbern21/OpenENV-Email-Triage-System.git
cd OpenENV-Email-Triage-System

# Create a virtual environment
python3 -m venv venv
source venv/bin/activate

# Install dependencies
pip install -e .

3. Start the Environment

# Run via Uvicorn
uvicorn server.app:app --host 0.0.0.0 --port 7860

4. Running Validation

# Run the built-in validator
./validate.sh http://localhost:7860

🤖 Baseline Inference

To run the baseline agent against the environment:

Set your OPENAI_API_KEY (or HF_TOKEN) in .env.
Run:

python inference.py

📄 License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
graders		graders
server		server
tasks		tasks
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
email_triage_env.ipynb		email_triage_env.ipynb
environment.py		environment.py
inference.py		inference.py
models.py		models.py
openenv.yaml		openenv.yaml
policies.py		policies.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock
validate.sh		validate.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📧 OpenEnv: Email Triage System

🌟 Overview

🚀 Key Features

📊 Task & Difficulty Levels

🛠️ Environment Specification

Action Space (`EmailTriageAction`)

Observation Space (`EmailTriageObservation`)

🏆 Reward Function

📦 Installation & Setup

1. Prerequisite

2. Local Setup

3. Start the Environment

4. Running Validation

🤖 Baseline Inference

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📧 OpenEnv: Email Triage System

🌟 Overview

🚀 Key Features

📊 Task & Difficulty Levels

🛠️ Environment Specification

Action Space (EmailTriageAction)

Observation Space (EmailTriageObservation)

🏆 Reward Function

📦 Installation & Setup

1. Prerequisite

2. Local Setup

3. Start the Environment

4. Running Validation

🤖 Baseline Inference

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Action Space (`EmailTriageAction`)

Observation Space (`EmailTriageObservation`)

Packages