1 of 77

Main

Welcome to Composabl

What is Composabl?

Composabl is a platform for building, training, and deploying multi-agent AI systems that optimize physical and industrial processes.

At its core, Composabl connects data, simulations, agents, training infrastructure, and runtime environments into a cohesive system that turns real-world domain knowledge into autonomous behavior.

The architecture below provides an overview of how the Composabl platform works:

🧩 Platform Overview

1. Import Data to Build Simulations

Composabl supports two ways to generate simulations:

Upload historical industrial data (CSV format) to generate data-driven simulations automatically.
Connect an existing simulation, such as a physics model or emulator, using a Docker container.

These simulations provide a safe environment for skill agents to learn and be evaluated.

2. Create Skill Agents in the No-Code UI

In the Agent Orchestration Studio, you can create skill agents without writing code by defining:

Goals – what the agent is trying to achieve (e.g., maximize throughput, maintain temperature)
Constraints – rules the agent must follow (e.g., stay within safe operating bounds)
Success Criteria – signals that the agent is performing correctly

This no-code method allows domain experts to encode their expertise directly into agent logic.

3. Import Skill Agents and ML Models

You can bring in your own logic and models using the Agent Import API.

Examples include:

Python-based MPC and PID controllers
LLM-powered agents
Custom applications and heuristics
Pretrained ML models

Once imported, these skill agents are available in the Agent Orchestration Studio—a no-code interface for assembling agents into a multi-agent system.

3. Design Multi-Agent Systems

Use the Agent Orchestration Studio to visually compose and coordinate skill agents, perceptors, and orchestrators. This is where domain experts design system-level behavior using drag-and-drop components.

4. Train Agents with Composabl’s AI Engine

Once orchestrated, your multi-agent system connects to Composabl’s AI Training Engine, which leverages scalable infrastructure—such as Microsoft Azure or AWS Kubernetes clusters—to train and evaluate agents in simulation.

Training uses a reward-driven learning process that improves agent performance over time.

5. Export to Runtime and Edge Environments

Once trained, agents can be exported for deployment:

Into real-world production environments
To external Edge IoT platforms
Or into downstream runtime applications that need to embed intelligence

This closes the loop from simulation to action—turning data into autonomy at scale.

Why Composabl?

By unifying simulation, orchestration, training, and deployment into a single platform, Composabl enables teams to:

Reduce time to deploy autonomous agents
Leverage existing models, data, and controls
Combine rule-based, learned, and ML-driven behavior
Bridge the gap between operational expertise and AI

Quick Links

Find articles on key topics with these links.

Try Out a Tutorial

Import Agents and ML-Models to Composabl

You can use any model, API, or Python algorithm with Composabl for training agent systems, adding perception, analysis, and communication, and making decisions. See how to configure different types of modules in the UI and by publishing them via the data science workflow for agent system design, training, and deployment.

Create Modular Skill Agents

Composabl multi-agent systems are built on modular skills that break down a task into separate parts. Learn how to create skill agents to train with deep reinforcement learning.

Deploy Multi-Agent Systems

Once Composabl agentic systems are designed and trained, you can export them to the Composabl runtime to connect with your system. Learn how to deploy an agent within the runtime container and how to use Composabl's tools to analyze agent behavior during both training and deployment.

Get Started

A Two-Part Platform

Composabl is a two-part platform with a no-code UI and a Python SDK and CLI. The interplay of these parts gives Composabl its combination of usability and power.

The two parts enable teams to work together effectively. People and roles who primarily use code, such as data scientists and controls engineers, use the SDK to create modules like ML models and deep reinforcement learning skills. Then, subject matter experts, process engineers, and others can use the no-code interface to easily create teams of agents from these modular building blocks and train them to succeed.

We designed the platform this way because, for complex, high-value processes, there will be some tasks that can only be done through coding, and some team members who prefer to work in code, and other tasks that are better performed through a visual interface. Both parts of the platform work together.

Most users use both parts of the platform to some extent but spend more time in the no-code UI or the SDK, depending on their expertise and role. How you will use the platform depends on your role and what part of the process you are working on.

Set Up Your Environment

The first thing you will need to do after you login to your account is to connect a simulation to Composabl.

Access Composabl

You can access Composabl Via a no-code UI and an SDK. They work together to enable you to build, train, and deploy autonomous agent systems.

No-Code UI

Python CLI and SDK Instalation

Installing the SDK

pip install -U composabl

Development

pip install -U composabl-dev

Note: The Composabl CLI and SDK require Python version 3.10.x

Workflow Steps

Step 1: UI | Create your first use case

Step 2: UI | Set performance goals
Step 3: UI and SDK | Create agent systems
- Create skill agents to train with rewards using deep reinforcement learning in the UI, or the SDK.
- Create or package ML models to import to UI with the SDK to add advanced perception to agent systems.
- Create or package LLMs to import with the SDK to the UI to add natural language to agent systems.
- Create or package controllers and optimization algorithms to import to the UI with the SDK to add programmed decision-making to skill agents.
Step 4: SDK | Publish to the UI with one CLI command

Step 5: UI | Orchestrate modular components together to create teams of agents in the UI
Step 6: UI | Train your teams of agents at scale with one click using the UI
Step 7: Notebook | Export Historian training data and perform detailed analysis

Step 8: UI and SDK | Export trained multi-agent systems and connect them to the Composabl runtime for deployment

Reference

CLI Reference

Composabl CLI API Documentation

Overview

The Composabl CLI provides command-line tools for managing Composabl projects, creating components, running simulators, submitting training jobs, and more. It's designed to streamline the development workflow for building autonomous agents.

Installation & Setup

Installation

The CLI is included with the main Composabl package:

Verify Installation

Environment Configuration

Authentication

Authenticate with Composabl services:

The login process:

Generates a unique session token
Opens your web browser for authentication
Saves credentials to ~/.composabl/token

Token Management

Debug & Version Information

Show Version

Show Debug Information

Project Structure

Recommended organization:

Environment Variables

Shell Completion

Enable tab completion for bash/zsh:

Error Handling

Common errors and solutions:

Performance Tips

Parallel Operations
Batch Publishing
Resource Optimization

Troubleshooting

Enable Debug Logging

Agent Management

Create New Agent

Generated Agent Structure

Train Agent

Visualize Agent

Skill Management

Create New Skill

# Interactive mode
composabl skill new

# Create teacher skill
composabl skill new \
  --name temperature-controller \
  --type teacher \
  --description "Maintains optimal temperature" \
  --location ./skills/

# Create controller skill
composabl skill new \
  --name pid-controller \
  --type controller \
  --description "PID control implementation" \
  --location ./skills/

# Create coordinated skill
composabl skill new \
  --name swarm-coordinator \
  --type "coordinated population" \
  --description "Coordinates drone swarm" \
  --location ./skills/

Skill Types

teacher: Reinforcement learning based
controller: Programmatic control logic
coordinated set: Coordinate specific skills
coordinated population: Coordinate populations of agents

Generated Skill Structure

my-skill/
├── pyproject.toml
├── README.md
└── my_skill/
    ├── __init__.py
    └── teacher.py  # or controller.py, coach.py

Example pyproject.toml:

[project]
name = "temperature-controller"
version = "0.1.0"
description = "Maintains optimal temperature"
dependencies = ["composabl-core"]

[composabl]
type = "skill-teacher"
entrypoint = "temperature_controller.teacher:Teacher"

Publish Skill

# Publish to Composabl platform
composabl skill publish ./skills/temperature-controller/

# Or specify path explicitly
composabl skill publish --path ./skills/temperature-controller/

List Skills

# List all skills in current project
composabl skill list

Output:

Name                Type        Version  Description              UUID
temperature-ctrl    teacher     2        Maintains temperature    abc123...
pressure-ctrl       controller  1        Controls pressure        def456...
team-coordinator    coach       1        Coordinates team         ghi789...

Delete Skill

# Interactive selection
composabl skill delete

Simulator Management

Create New Simulator

Generated Simulator Structure

Basic simulator:

Docker simulator adds:

Run Simulator

Validate Simulator

Validation checks:

Proper interface implementation
Space definitions
Reset/step functionality
Scenario handling
Rendering capabilities

Benchmark Simulator

Docker Simulator Management

Start Simulator Container

List Running Simulators

Get Simulator Status

View Logs

Stop Simulators

Get Simulator Information

Get Space Mappings

List Available Simulators

Publish Simulator

Delete Simulator

Perceptor Management

Create New Perceptor

composabl perceptor new \
  --name derivative-calc \
  --description "Calculates derivative of sensor values" \
  --location ./perceptors/

Generated Perceptor Structure

# perceptor.py
import time
from composabl_core import PerceptorImpl

class DemoPerceptor(PerceptorImpl):
    def __init__(self, *args, **kwargs):
        self.previous_value = None
        self.previous_time = None
    
    async def compute(self, obs_spec, obs):
        current_value = obs["counter"]
        current_time = time.time()
        value_derived = 0
        
        if self.previous_value is not None and self.previous_time is not None:
            time_delta = current_time - self.previous_time
            if time_delta > 0:
                value_derived = (current_value - self.previous_value) / time_delta
        
        self.previous_value = current_value
        self.previous_time = current_time
        
        return {"counter_derived": value_derived}
    
    def filtered_sensor_space(self, obs):
        return ["counter"]

Perceptor Commands

# Publish perceptor
composabl perceptor publish ./perceptors/derivative-calc/

# List perceptors
composabl perceptor list

# Delete perceptor
composabl perceptor delete

Orchestrator Management

Create New Orchestrator

Orchestrator Commands

Job Management

Submit Training Job

# Submit agent training
composabl job submit --agent ./agents/my-agent.py

# Submit with JSON config
composabl job submit --agent-json ./configs/agent.json

List Jobs

composabl job list

# Output:
Job Id          Type        Start Time           Status
job-abc123      training    2024-01-15 10:30:00  RUNNING
job-def456      evaluation  2024-01-15 09:15:00  COMPLETED
job-ghi789      training    2024-01-15 08:00:00  FAILED

Stop Job

composabl job stop job-abc123

Start Job (Kubernetes)

# For Kubernetes controller
composabl job start \
  --job-id job-abc123 \
  --k8s-id k8s-abc123 \
  --json-path ./config.json \
  --resume

Historian Management

The historian records training data for analysis and visualization.

Start Historian

# Start with default output directory
composabl historian start

# Start with custom output path
composabl historian start /path/to/output/

# Output:
Created the containers.
Service         Container Name  Status   Connection Details      Ports
EMQX           emqx            running  localhost:1883          1883->1883, 18083->18083
Historian      historian       running  /path/to/output/

Check Historian Status

# Full status
composabl historian status

# Get EMQX connection string
composabl historian status --moniker-emqx
# Output: localhost:1883

# Get historian output path
composabl historian status --moniker-historian
# Output: /path/to/output/

Stop Historian

composabl historian stop

Clean Up Resources

# Remove all historian containers and data
composabl historian clean

Historian Data Format

The historian saves data in Delta Lake format:

output_directory/
└── run_id/
    ├── _delta_log/
    │   └── 00000000000000000000.json
    └── part-00000-*.parquet

Access data with Python:

from deltalake import DeltaTable

dt = DeltaTable("/path/to/output/run_id")
df = dt.to_pandas()
print(f"Recorded {len(df)} steps")

Benchmarking Tools

Protocol Benchmark

Test simulator performance and protocol overhead:

Benchmark Multiple Environments

CI/CD Integration

Continuous Integration

Continuous Deployment

Scripting with CLI

Plugins

The CLI supports custom plugins:

# ~/.composabl/plugins/my_plugin.py
import typer

app = typer.Typer()

@app.command()
def custom_command(name: str):
    """My custom command"""
    print(f"Hello, {name}!")

# Register in ~/.composabl/plugins.yaml
plugins:
  - name: my_plugin
    module: my_plugin
    command: custom

Use custom commands:

composabl custom custom-command World
# Output: Hello, World!

Main

Composabl Main API Documentation

Overview

The Composabl Main API is the primary interface for the Composabl SDK. It provides a unified wrapper that combines functionality from composabl-core, composabl-train, and composabl-cli into a single, convenient package.

Installation

pip install composabl

This single installation provides access to all Composabl SDK components.

Package Structure

The main package re-exports all public APIs from:

composabl_core: Core components and building blocks
composabl_train: Training infrastructure
composabl_cli: Command-line interface (available via composabl command)

Basic Usage

Importing

All functionality is available through the main composabl import:

from composabl import (
    # Core Components
    Agent, Skill, Sensor, Scenario, Perceptor,
    
    # Skill Types
    SkillTeacher, SkillController, SkillSelector,
    SkillCoordinatedSet, SkillCoordinatedPopulation,
    
    # Goals
    MaintainGoal, ApproachGoal, AvoidGoal, 
    MaximizeGoal, MinimizeGoal,
    
    # Training
    Trainer,
)

Environment Setup

Before using Composabl, configure your environment:

import os

# Required: Set your license key
os.environ["COMPOSABL_LICENSE"] = "your-license-key"

# Required: Accept the EULA
os.environ["COMPOSABL_EULA_AGREED"] = "1"

# Optional: Set log level
os.environ["LOGLEVEL"] = "INFO"  # DEBUG, INFO, WARNING, ERROR

Goal Types

# Maintain a value
MaintainGoal(sensor_name, description, target, stop_distance)

# Approach a target
ApproachGoal(sensor_name, description, target)

# Avoid a value
AvoidGoal(sensor_name, description, target, stop_distance)

# Maximize a metric
MaximizeGoal(sensor_name, description)

# Minimize a metric
MinimizeGoal(sensor_name, description)

Configuration Options

config = {
    "license": "key",
    "target": {
        # Choose one:
        "local": {"address": "host:port"},
        "docker": {"image": "name:tag"}
    },
    "env": {
        "name": "environment-id",
        "init": {}  # Environment parameters
    },
    "resources": {
        "sim_count": 4,
        "num_workers": 2,
        "num_gpus": 0
    }
}

Core Classes

# Agent - Main orchestrator
agent = Agent()
agent.add_sensor(sensor)
agent.add_sensors([sensor1, sensor2])
agent.add_skill(skill)
agent.add_skills([skill1, skill2])
agent.add_perceptor(perceptor)
agent.export(path)
agent.draw()  # Visualize structure

# Skill - Behavior module
skill = Skill(name, implementation)

# Sensor - Data transformer
sensor = Sensor(name, description, extractor_fn)

# Scenario - Initial conditions
scenario = Scenario(variable_dict)

# Trainer - Training orchestrator
trainer = Trainer(config)
trainer.train(agent, train_cycles)
trainer.evaluate(agent, num_episodes)
trainer.package(agent)
trainer.close()

Migration Guide

If migrating from separate imports:

# Old way
from composabl_core import Agent, Skill
from composabl_train import Trainer

# New way (recommended)
from composabl import Agent, Skill, Trainer

Debug Mode

Enable detailed logging:

import logging
logging.basicConfig(level=logging.DEBUG)

# Or via environment
os.environ["LOGLEVEL"] = "DEBUG"

Agent API

The Agent class is the central orchestrator that combines skills, sensors, and perceptors.

Creating an Agent

Agent Methods

Adding Components

Serialization

Visualization

Complete Agent Example

Sensors API

Sensors transform raw observations into meaningful features for skills.

Creating Sensors

from composabl import Sensor

# Simple sensor with lambda
sensor = Sensor(
    name="velocity",
    description="Current velocity in m/s",
    lambda_str=lambda obs: obs["speed_kmh"] * 0.277778
)

Goals API

Pre-built goal implementations for common objectives.

MaintainGoal

ApproachGoal

AvoidGoal

MaximizeGoal

MinimizeGoal

Scenarios API

Define variable initial conditions for robust training.

Basic Scenarios

from composabl import Scenario

# Fixed scenario
fixed_scenario = Scenario({
    "temperature": 25.0,
    "pressure": 1.0,
    "flow_rate": 100.0
})

# Sample from scenario
sampled = fixed_scenario.sample()
# Returns: {"temperature": 25.0, "pressure": 1.0, "flow_rate": 100.0}

# Variable scenario with ranges
variable_scenario = Scenario({
    "temperature": [20, 30],
    "pressure": [0.8, 1.2],
    "flow_rate": [80, 120]
})

# Variable scenario with choices
variable_scenario_choices = Scenario({
    "temperature": [20, 30, 40],
    "pressure": [0.8, 1.2, 2],
    "flow_rate": [80, 120, 150]
})

# Sample from scenario
sampled = variable_scenario_choices.sample()
# Returns: {"temperature": 30, "pressure": 2, "flow_rate": 80}

Perceptors API

Process sensor data before it reaches skills.

Basic Perceptor

Advanced Perceptors

Train

Composabl Train API Documentation

Overview

The Composabl Train API provides the training infrastructure for agents using Ray for distributed computing. It handles the complete training lifecycle, including algorithm configuration, resource management, checkpointing, and deployment preparation.

Trainer API

The Trainer class is the main interface for training agents.

Basic Usage

Training Configuration

Complete Configuration Example

Training Targets

Local Target

Train with a simulator running locally:

Docker Target

Train with simulators in Docker containers:

Benchmarking

For every scenario on the top-level skill, this will perform `num_episodes_per_scenario` amount of inference episodes.

Recording

Glossary

Controller: A skill agent that makes decisions based on programmed algorithms. Also called a programmed skill agent.

Decisions: The outputs of a Composabl system - its instructions for controlling the system

Design patterns: Common structures that can be used to quickly create multi-agent systems

Episode: An entire run through the task

Functional Pattern: A machine teaching design pattern used to orchestrate skills in sequences

Iteration: One decision during agent training or performance

Learned skill agent: A skill agent that uses DRL to make decisions and learn by practicing in simulation

Orchestration: Arranging agents as modular building blocks to work together to complete tasks

Orchestrator: A special type of skill agent that assigns decision-making control to the right decision-making skill agent based on current conditions

Perceptor: An ML model or other algorithm that interprets sensor data

Plan-Execute Pattern: A machine teaching design pattern used to orchestrate skill agents in pairs that work together to make decisions

Programmed skill agent: A skill agents that makes decisions based on programmed algorithms. Also called controllers.

Project: A collection of agents for the same use case that share the same simulator and top-level goal

Scenarios: Conditions that are associated with specific skills

Sensors: The part of the system that takes in information from the simulator or real environment - the eyes, ears, and other senses of the system

Simulator: The virtual environment where agents practice and improve performance

Skill agents: Modules within a multi-agent system that make decisions to complete all or part of a task

Strategy Pattern: A machine teaching design pattern used to orchestrate skill agents in hierarchies

Teacher: An algorithm that creates a skill agent that uses DRL to learn to make decisions

Sample Use Cases

The examples and code samples in this documentation refer to our sample use cases. These examples are real-world use cases with complex goals and constraints. In each case, the Composabl team has built agent systems that exceed the benchmark control technology by orders of magnitude.

Industrial Mixer

About the Use Case

Learn more:

The industrial mixer use case is a realistic case study of a chemical process control agent system controlling a continuous stirred tank chemical reaction. The agent system controls the temperature in a tank where a chemical reaction occurs to create a product.

As the chemicals are stirred together in the tank, the reaction produces heat at a nonlinear, unpredictable rate. If the tank isn’t cooled enough, it can reach dangerous temperatures, a condition called thermal runaway. If it’s cooled too much, not enough product will be produced. The agent system needs to balance these two goals, keeping the tank at the right temperature at every moment to optimize production while ensuring safety.

Explore Agent System Components

Production Scheduling

About the Use Case

Learn more: .

The production scheduling use case is an complex production planning problem set in an industrial bakery. The agent system must determine the right amount of cookies, cakes, and cupcakes to make each day, directing teams of workers and equipment and responding to fluctuations in costs, pricing, and demand.

The case study, developed in partnership with Composabl partner , requires the agent system to make a choice every minute between 24 possible combinations of equipment, task, employee and product, over the course of a 400-decision day, with the ultimate goal of maximizing profit.

Tutorials

Get Started

This getting started guide takes you through the steps you will need to go through before you can start building your teams of agents.

Prerequisites

A Composabl account

Create Your Use Case

The first thing you will do is create your use case in the Composabl UI.

Select the simulator you will use for the project and the assigned team. If you haven't created a team yet, you will need to do that before you create your first project.

Then, you can use the AI prompt to explain what you are attempting to achieve with your project and have it bootstrap based on your description. I will describe the use case skills, goals, and constraints and call it Chemical Process Control.

Click New use case and Create use case

Set Use Case Goals

To finish setting up your use case, set your top-level project goal.

Now, we will set the goal for this example. In this project, we want to:

Maximize yield (make as much product as we can)

Goal Title and Description

Fill out the goal title and description for the top-level goal.

Goal Title: Chemical Process Control Goals

Description: The goal for this project is to maximize yield

Logical Structure

We are going to one condition to represent our goal.

To maximize yield, the objective is Maximize , and the variable is Eps_Yield

Now you can save your changes.

Now that you have created your new use case and set the goal, you can start creating agent systems!

Optional: Set Up the Data Science Workflow

Some of the steps in this tutorial use Composabl's data science workflow, which allows data scientists and controls engineers to publish any Python module to the Composabl platform to use in multi-agent systems.

To use this workflow, complete these steps:

Now that you have your project created and your goals set you're ready to start creating agents!

Deep Reinforcement Learning

The DRL agent system is a simple design with only one skill agent. This agent system does not use machine teaching to decompose the task into skills that can be trained separately. Instead, the entire reaction is controlled by a skill agent trained with deep reinforcement learning.

Let's get started!

1. Create your first skill agent

This agent system has a single-skill agent called Control Full Reaction. To create this skill in the UI go to the skill agent page and click Create new skill agent

2. Set skill agent goals and constraints

Configure your agent to set the instructions for its training sessions. This agent has one goal, to maximize yield, and one constraint, to keep the temperature from going above 400 degrees Kelvin.

Click Add goal In the left drop-down menu, select Maximize, and in the right one, select Eps_Yield . This means the agent will train with the goal of maximizing the total product produced by the end of each episode.
Click Add constraint. In the left drop-down menu, select Avoid , and in the right one, select T . After you select T you're going to see a slider appear for you to set boundaries you want to train the system to avoid. In this case, we want to set the boundaries from 400 to 500.
Save your skill agent configuration and return to the Agent Orchestration Studio.

3. Create a Scenario

Set scenarios to tell each skill agent what specific conditions or phases of the process to practice in. This skill agent controls the full reaction, so it needs to practice with the reaction as a whole.

Go to the Scenarios page and select Add scenario, then name it Control full reaction and click Save . We're going to add two criteria to this scenario, and they are a reference temperature and concentration.

Control Full reaction: Cref Is 8.57, Tref Is 311 |

4. Add the Skill Agent to Your Agent Configuration

Drag the skill control_reaction that you can now see on the left-hand side of your project onto the skills layer. Click on the skill agent once it's in the skill layer and assign the scenario.

5. Run Your Training Session

Now, we are ready to train your agent and see the results. First, select our built-in training cluster or one you own and have connected to the platform. Then set the number of cycles. For this tutorial, we suggest running 50. You can run multiple simulations in parallel to speed up training time. Under advanced, you can use GPUs instead of CPUs, set a rollout fragment length, and set the number of benchmark runs.

Once you have everything configured, click Allocate training cycles . This agent system design has only one agent, so all training cycles will be allocated to our DRL agent. In a multi-agent system, you can assign a different number of training cycles to different agents depending on the complexity of the skill.

6. View Results

When the training has been completed, you can view your results in the training sessions tab in the UI. This will show you information on how well the agent is learning.

You will likely see a steep learning curve as the agent experiments with different control strategies and learns from the results. When the learning curve plateaus, that usually means that the skill is trained.

Analyze the DRL Agent's Performance

Conversion rate: 90% Thermal runaway risk: Low

We tested this fully trained agent and plotted the results.

The DRL agent system performs well. Its relatively thin shadow means that it performs consistently over different conditions and stays within the safety threshold almost every time.

This agent controls the initial steady state well, staying on the benchmark line. But during the transition, the DRL agent goes off the benchmark line quite a bit. It doesn't notice right away when the transition phase begins, staying too long in the lower region of the graph and then overcorrecting. That's because DRL works by experimentation, teaching itself how to get results by exploring every possible way to tackle a problem. It has no prior knowledge or understanding of a situation and relies entirely on trial and error. That means it is potentially well-suited to complex processes, like the transition phase, that can’t be easily represented mathematically.

However, its behavior is erratic because it can’t distinguish between the phases. The DRL agent’s skills do better than the traditional automation benchmark, but still leave room for improvement.

Establish a Simulation Environment

Simulation Overview

A simulation is the part of the Composabl agent ecosystem that models the real world. It tells the agent system what happens when it takes an action, whether based on historical data, physics, business logic, or other dynamics. The simulator enables agents to learn how to make decisions safely, repeatedly, and at scale.

Composabl supports three types of simulations, giving you flexibility depending on your use case and available infrastructure.

🧠 Simulation Types

1. Data-Driven Simulations (No-Code Option)

Use historical operational data to automatically generate a simulation in the Composabl UI.

No coding required, just upload a properly formatted CSV
Uses Composabl’s Training-as-a-Service
Ideal for industrial and logistics workflows where historical data reflects real-world constraints

2. Containerized Simulations Using Composabl’s Training Service

Bring your simulation, built in any language or framework, and connect it to Composabl via Docker.

You define the simulation logic
Composabl manages the training cluster and runtime
Ideal for teams with existing physics-based or black-box models

This option provides you with complete flexibility while still leveraging Composabl’s infrastructure and tools.

3. Containerized Simulations Using Your Own Azure Training Cluster

For enterprises managing their cloud infrastructure, you can run your simulation and training in your own Azure tenant.

Simulation runs in a Docker container, just like above
Composabl orchestrates training via your own Azure compute resources
Gives you control over scaling, security, and cost management

Explore Composabl’s Example Simulators

To understand how a simulator works inside the Composabl platform, you can explore our public Python simulators hosted on Docker Hub. These simple environments are perfect for:

Demos and POCs
Self-guided learning
Team education and experimentation

Use the CLI to Get Started

To list available simulators:

composabl sim list

To connect to one of the simulators:

composabl sim run

Simulation Help

If you already have a simulator, the documentation will guide you through the process of connecting it to Composabl, whether you’re using our managed training or your infrastructure.

If you don’t yet have a simulator, you can:

Start with a data-driven simulation
Explore our example simulations
Or work with a Composabl partner to develop one based on your system

Data Driven Simulation

Data-Driven Simulations

Composabl enables you to build simulations from historical data. These simulations are used to train and evaluate multi-agent systems in high-fidelity environments that reflect your actual operations.

Use cases include modeling equipment behavior, logistics workflows, and process automation.

Recommended Data Range

Minimum: 3 months
Preferred: 1 year (especially for systems with seasonal variation)

Datasets will likely yield a successful simulation when they reflect a >80% accurate relationship between control actions (operator adjustments) and sensor readings.

1. CSV Data Format

Upload a CSV file where each column follows a naming convention:

Variable Type

Prefix

Example

Sensor

s_

s_T (temperature)

Action

a_

a_dTc (cooling delta)

Other Input

(no prefix)

Datetime

Sensors are variables that provide information about the environment or conditions within the process. These may be reported by the machine being controlled or they may come from outside systems. Quality measures such as the results of lab tests are also sensors.

Actions are variables that describe the adjustments the operator makes to the system controls.

Optionally include a timestamp column in UTC. If omitted, Composabl assumes the rows are sequential time steps.
Not all fields need values for every row. The simulator recognizes when data is collected at different intervals.

Units Row (Optional)

Include a row above your column headers to define units for each variable:

kmol/m3,K,K,,kmol/m3,K
s_Ca,s_T,s_Tc,s_Tref,s_Cref,a_dTc
…data rows…

2. Data Configuration

After uploading your CSV file, Composabl guides you through a configuration screen where you confirm the role and type of each column:

Field

Description

User variable name

Rename your variables for clarity (optional)

Mode

Sensor / Action / None (for reference values)

Type

Box or continuous variables

Value Range

Auto-calculated from your data

Unit (optional)

Populated from the units row

3. Simulation Creation Flow

🛠 Step 1: Upload CSV

Navigate to the Data-Driven Simulator section.
Name your simulation.
Upload your CSV file in the proper format.

🧭 Step 2: Configure Variables

Review detected variables.
Adjust mode, type, unit, and value range if needed.

📊 Step 3: Review Data Score

Composabl will scan your data and generate a Data Score from 0 to 100:

Score Range

Meaning

80–100

Good quality – Ready to simulate

Below 80

Needs improvement – Add data or clean inputs.

🚀 Step 4: Create Simulation

Click Next, and your simulation will be created. It will now be available in your workspace for training, testing, and analysis.

4. Best Practices

Use consistent naming: Prefix sensors and actions properly.
Include a units row: This improves interpretability and ensures correctness.
Handle missing data: Some missing values are fine, but complete coverage of key variables improves accuracy.
Timestamps (optional): Include UTC timestamps if events need temporal alignment.

5. Example File

📁 Download sample data:

17MB

cstr_simulator_data 2.csv

ple.csv

Use this example file, which contains data, to create a Chemical Process Simulation and see the expected structure, naming conventions, and units in action.

6. Troubleshooting

Problem

Cause

Fix

Variable not detected

Missing prefix

Use s_ for sensors and a_ for actions

Low data score

Not enough data, missing values

Add more historical data or fill in key columns

Unit not recognized

Unit row missing or misaligned

Add units row directly above the variable row

Connect a Simulator to Composabl

In this tutorial, we will learn how to upload simulators via the Composabl CLI as well as your custom, adapted simulators to the Composabl UI.

Upload simulators via Composabl CLI

To upload simulators that already follows the Composabl simulation specification all you need to do is the fallowing command from the simulation folder:

After that, you can go to the Composable editor and connect that sim to any project.

Upload Third-Party Simulators via Docker

Prerequisites

If your simulator isn't already compatible with the Composabl platofrm you will need to create gRPC bindings and upload it as a Docker image in the Composabl editor. You can follow along with these Composabl API patterns and the following structure:

Going through the files:

docker/entrypoint.sh: This file is the entrypoint of your Docker container.
src/exceptions/invalid_usage.py: This file contains the exception class InvalidUsage that is used to raise exceptions in the simulator.
src/__init__.py: This file is the initialization file of the module. No code is needed here, but for Python to recognize the folder as a module, this file is necessary.
src/main.py: This file is the main file of the simulator. It uses the composabl_core.networking module to expose the simulator to the Composabl platform. This file is also available zipped along with this tutorial.
src/server_impl.py: This file contains the implementation of the server that will be used to run the simulator.
src/sim.py: This file contains your implementation of the simulator itself. Usually, a Env (inheriting from gym.Env) class is implemented here, and it is used to run the simulator.
Dockerfile: This file is the Dockerfile that will be used to build the Docker image of your simulator.
requirements.txt: This file contains the Python packages that are necessary to run your simulator. It is used to install the necessary packages in the Docker image.

gRPC Spec for Implementation

Under the hood, the Composabl SDK uses gRPC to communicate with the Composabl platform.

To create a simulator that works with the Composabl API, you have to implement the following gRPC methods:

After making sure that your simulator is compatible with the Composabl platform, you can proceed to the next section.

Docker

Go to the folder where your simulator is located. We can navigate to the simulator folder and see what is inside it.
After that, ensure that you have Docker installed. You can check if Docker is installed by running the following command:
If Docker is installed, you should see the Docker version. If not, you can install Docker by following the instructions on the .
Then, before building the image, ensure that you have a Docker Hub account. If you don't have one, you can create one by going to the .

DockerHub

To log in to Docker Hub, run the following command:
You will be prompted to enter your Docker Hub username and password. After that, you should see a message saying that you are logged in.
Now, we can build the Docker image of the simulator. To do so, run the following command:
This command will build the Docker image of the simulator. The -t flag is used to tag the image with the name <your-docker-hub-username>/<simulator-name>. The . at the end of the command indicates that the Dockerfile is in the current directory.
After building the image, you can check if it was built successfully by running the following command:
And then push the image to Docker Hub:

Composabl UI

After that, you can go to the Composabl UI and upload your simulator. To do so, follow the steps below:

Go to the Composabl UI by accessing the following link: . You'll be asked to login and then redirected to your dashboard page.
Then, on the left sidebar, click on the "Simulators" tab. You should see a list of simulators that are already available on the platform.
On the top right corner, you should see a button to "New Simulator". Click on it.
A pop-up will appear, asking you to select between "Internal" and "External" simulators. Select "External".
Then, you can fill the Title and Description of the simulator. We suggest smaller names for the Title and a more detailed description for the Description.
After clicking in next step, a brief tutorial will open up. Take care to read it and then click on "Next Step" again.
After that, you can fill the Docker Image field with the name of the Docker image you pushed to Docker Hub <your-docker-hub-username>/<simulator-name>.
If the image is public, no more fields are needed. If the image is private, you need to fill the Docker Username and Docker Password fields with your Docker Hub username and password, respectively. Then, click on "Validate and next step".

Conclusion

If you've followed all the steps correctly, you should have successfully uploaded your simulator to the Composabl UI. You can now use your simulator to train agents and run simulations on the platform. If you have any questions or need help, feel free to reach out to us.

Composabl Simulation API

You can integrate your simulator with the Composabl SDK by using the ServerComposabl class. This class provides the necessary methods for the simulator to interact with the Composabl SDK.

The methods of the ServerComposabl class allow the Composabl SDK to automatically take care of serializing and deserializing the different requests and responses.

To conform your simulator to the Composabl SDK, you must define a server implementation class that defines methods of how to talk with the Composabl SDK.

Composabl’s simulation API extends the the gymnasium.Env standards.

Set Up the Simulation Environment Instance

Make

Make is a request to create a new instance of the environment with the specifications requested.

string env_Id; Identifier for the type of environment to create.
dictionary env_Init; Initial configuration for the environment, as defined within the runtime configuration (link to section about how to define runtime configuration parameters)

  async def make(self, env_id: str, env_init: dict):
        self.env_id = env_id if env_id else self.env_id
        self.env_init = env_init if env_init else self.env_init

        self.env = Sim(self.env_init)

        return {
            "id": "my_simulator",
        }

`Sensor_space_info`

Sensor_space_info provides details about the environment’s sensor space.

async def sensor_space_info(self) -> gym.Space:
        return self.env.sensor_space

`Action_space_info`

Action_space_info defines the agent system's action space.

async def action_space_info(self) -> gym.Space:

`Action_space_sample`

The action_space_sample function returns an element of the simulator’s action space.

   async def action_space_sample(self):
        return self.env.action_space.sample()

Run the Simulation Environment Instance

`Reset`

Reset is a request to reset the environment, and returns the first observation of the newly reset environment.

observation Initial observation of the environment.
Dictionary info Additional information about the reset environment.

  async def reset(self): 
        return self.env.reset()

`Step`

Step provides the agent system action to be applied to the environment. The return structure is as follows:

observation; The observation following the action.
float reward The reward received after taking the action.
bool terminated Whether the episode has ended.
bool truncated Whether the episode was truncated before a natural conclusion.
Dictionary info Additional information about the step.

async def step(self, action):
        return self.env.step(action)

`Close`

Close denotes the simulator is done being used and may perform any necessary cleanups required.

async def close(self):
        self.env.close()

`Set_Scenario`

Set_scenario tells the simulator the current scenario the agent system wishes to train on. Learn more about scenarios.

  async def set_scenario(self, scenario):
        self.env.scenario = scenario

`Get_Scenario`

Get_scenario returns the scenario that the simulation is currently running.

async def get_scenario(self):
        if self.env.scenario is None:
            return None

        return self.env.scenario

Create Visualizations

Get_Render

Get_render provides the current rendered image of the environment, either as a numpy array or a string.

 async def get_render(self):
        return self.env.render()

Build Multi-Agent Systems

Create a Use Case

A use case is a collection of agent systems for the same use case. The best way to use Composabl is to build multiple agent systems within the same use case using different design patterns and variations. This allows you to iterate and improve your agent systems for the best possible performance.

All agent systems within a use case share the same goals and the same simulator.

Create a Use Case in the UI

To create a use case, click on New New Use Case in the upper right-hand corner of your dashboard.

You'll be prompted to enter the simulator associated with your use case and then choose your team.

Use the AI Use Case Creation Assistant

You also have the option to use Composabl's AI assistant to help set up the use case. This specialized AI assistant is designed to help solve the "cold start" problem and get you started designing your agent system more quickly.

Based on your written use case description, the AI will automatically create customized goals and agent skills within your use case. You can then access those goals and agent skills in the Agent Orchestration studio and edit and adjust them as needed.

When prompting the assistant, use as much detail as you can. You can tell the AI:

What the problem or use case is
What equipment or process the agent system will control
What the overall goal is
Anything you know about phases of the process or different scenarios

If you choose not to use the AI assistant, just click Skip. You can then configure your goals and skill agents manually.

Set Goals, Constraints, and Success Criteria

The performance goal is the most important KPI or metric you will use to evaluate your agent system's success. This goal directs your agent as it trains. The AI learning technology within the agent will reward the agent system when it gets closer to the goal, helping it to improve.

For many business use cases, the top-level goal will be expressed in terms of profit or ROI. There are many factors to consider in the production scheduling use case, but profit is the ultimate goal, so it is the most important KPI.

Some use cases will have multiple goals the agent system must meet simultaneously. In fact, the need to balance more than one goal is a sign that a use case is a good fit for Machine Teaching. In the industrial mixer use case, the agent system needs to get the right chemical product concentration and keep the tank's temperature from becoming dangerously hot.

Use the Guidance Library for Reusability

Composabl now supports a Use Case Guidance Library—a reusable set of:

Goals
Constraints
Success Criteria

This feature allows you to save commonly used elements and apply them across multiple skill agents. Instead of recreating the same definitions every time, you can quickly load goals, constraints, and success criteria from your shared library.

Use this to:

Maintain consistency across agents
Speed up setup for similar workflows
Capture expert-defined logic once and reuse it across Skill agents

Setting Goals, Constraints, and Success Criteria in the UI

To edit a goal created by the copilot or to create a new goal, follow these steps:

Navigate to the use case page
Click Set up goal and enter a name and description for your goal.
Click Add condition to define the goal.

Use the dropdown menus to select the variables and then define the parameters for each goal, constraint, or success criterion you want to include.

Configure Goals

Goals define what a skill agent system should do. Goals apply to one of the sensor variables and are defined using one of three possible directives:

Maximize: Maximize the value of the variable
Minimize: Minimize the value of the variable
Maintain: Keep the variable at a specified value

For example, for the industrial mixer, we want to maximize the concentration of the product, Ca.

Configure Constraints

Constraints set the boundaries for the skill agent. They define rules that the agent system must follow during operation. Constraints are defined using one of two possible directives:

Avoid: The skill agent learns to keep the variable from reaching a specified value or range through withholding rewards
Terminate: When the actions of the skill agent lead to certain conditions within a variable, the skill agent has failed and must stop and start a new episode

For example, we want to avoid the temperature, T, getting above 400 degrees Kelvin in the industrial mixer use case.

Configure Success Criteria

Success criteria tell the skill agent when it's doing something right. They are defined using one of two possible directives:

Approach: The skill agent learns to get close to a specified value by getting an increased reward
Succeed: When the success criteria are achieved, the session ends, and a new one begins so that the skill agent can keep practicing and learn to win every time

Using Goals, Constraints, and Success Criteria for Multi-Agent AI System Design

Goals, constraints, and success criteria transform complex AI agent building into an intuitive process that eliminates the need for programming expertise.

You directly apply your domain knowledge without coding by simply defining what you want the agent to achieve and what conditions to respect. This approach reduces development time while ensuring AI agents embody real-world operational wisdom, bridging the gap between technical AI capabilities and practical industrial knowledge in complex physical environments.

Create Skill Agents

Skills are the foundational building blocks for your intelligent agent system. They take action to achieve goals in key scenarios where your agent system needs to succeed. To build an agent with Machine Teaching, you will create multiple skill agents and then orchestrate them together.

You can use three different types of skill agents within Composabl.

Create a to implement a that uses DRL to make decisions. You can set up your teacher to train the skill agent using goals or to train with rewards. The skill agent will then train and practice in simulation until it can make decisions successfully.
Create a to implement a that makes decisions based on programming rather than learning. A controller can use math, rules, optimization, or any other technology that can be expressed in a Python program. Configure controllers with the SDK and publish them to the UI to use in agent systems.
Create a to orchestrate skills together. Selectors are specialized skills that direct the agent system how to choose between different decision-making skills. Selectors can be either learned or programmed.

You can also create skills in two different ways, using the UI and using the SDK. If you use the SDK, you can then publish the skills to the UI to be included in agent system designs.

For learned skills you can use either the UI or the SDK successfully to create teachers, but the SDK includes some fine-tuning options that are not available in the UI.

Controllers for programmed skills can be created only through the SDK. They can then be published to the UI for use in agent systems.

You can use either the UI or the SDK to create selectors.

Create Skill Agents

You can use Composabl to create skill agents that learn with deep reinforcement learning. Configure the Composabl teacher by setting goals, constraints, and success criteria for your skill agent. Composabl then turns these into reward functions and trains the skill agent.

Create a New Skill Agent

To create or edit a skill agent, follow these steps:

Navigate to the Skill Agents page
Click on a skill agent, or click + to create a new skill agent
Configure the skill agent

Use the dropdown menus to select the variables and then define the parameters for each goal, constraint, or success criterion you want to include.

Choose Implementation Method

To create a new skill agent that learns with deep reinforcement learning, select Teacher under the Implementation Method dropdown. The other option, Controller, is used for a programmed skill agent that is configured with the Composabl SDK and published to the Agent Orchestration Studio.

Configure Goals

Goals define what a skill agent should do. Goals apply to one of the sensor variables and are defined using one of three possible directives:

Maximize: Maximize the value of the variable
Minimize: Minimize the value of the variable
Maintain: Keep the variable at a specified value or track a specified variable (such as a set point)

For example, for the industrial mixer, we want to maximize the yield of the product over the course of an entire episode, eps_yield.

Configure Constraints

Constraints set the boundaries for the skill agent. They define rules that the skill agent must follow during operation. Constraints are defined using one of two possible directives:

Avoid: The skill agent learns to keep the variable from reaching a specified value or range through withholding rewards
Terminate: When the actions of the skill agent lead to certain conditions within a variable, the skill agent has failed and must stop and start a new episode

For example, we want to avoid the temperature, T, getting above 400 degrees Kelvin in the industrial mixer use case.

Configure Success Criteria

Success criteria tell the skill agent when it's doing something right. They are defined using one of two possible directives:

Approach: The skill agent learns to get close to a specified value by getting increased reward
Succeed: When the success criteria are achieved, the session ends, and a new one begins so that the skill agent can keep practicing and learn to win every time

Create Skill Agents with Rewards Using the SDK

The Composabl SDK offers a suite of advanced tools to train skills using deep reinforcement learning. Using the Python teacher class, you can fine-tune the rewards for your skills. Once you have configured a skill with the SDK, you can publish it to the UI to use in agent system designs.

Create a New Skill

To create a skill in the Python SDK, begin by logging in to the SDK by typing Composabl login from the CLI.

Then type composabl skill new.

Give the skill a name and a description in response to the prompts that follow. Choose whether your skill should be a teacher (learned with AI) or a controller (a programmed module like an optimization algorithm or MPC controller).

Specify the folder where you’d like to create the skill.

The Composal SDK will create a folder and Python teacher file from the template.

The Python Teacher Class

The Python teacher class offers several functions that you can use to fine-tune the training of your skills.

Functions for Training

Train with Rewards: the `compute_reward` Function

The compute_reward function provides the bulk of the feedback after each agent system action about how much that action contributed to the success of the skill. This function returns a number that represents the reward signal the agent system will receive for its last decision. Reward functions, as they are called in reinforcement learning, can be tricky to craft. .

End Training: the `compute_termination` Function

The compute_termination function tells the Composabl platform when to terminate a practice episode and start over with a new practice scenario (episode). From a teaching perspective, it makes most senses to terminate an episode when the agent system succeeds, fails, or is pursuing a course of action that you do not find likely to succeed. This function returns a Boolean flag (True or False) whether to terminate the episode. You can calculate this criteria however seems best.

Define Success: the `compute_success_criteria` Function

The success_criteria function provides a definition of skill success and a proxy for how completely the agent system has learned the skill. The platform uses the output of this function (True or False) to calculate when to stop training one skill and move on to training the next skill. It is also used to determine when to move to the next skill in a fixed order sequence. The agent system cannot move from one skill in a fixed order sequence to the next, until the success criteria for one skill is reached.

Here are some examples of success criteria definition:

A simple but naive success criteria might return True if the average reward for an episode or scenario crosses a threshold, but False if it does not.
A more complex success criteria might calculate root mean squared error (RMSE) for key variables across the episode and return True if the error is less than a customer specified benchmark, but False otherwise.
A complex success criteria might compare a benchmark controller or another agent system to the agent system across many key variables and trials. It returns True if the agent system beats the benchmark on this criteria, but False otherwise.

Train with Goals

Training with goals lets you use a predefined reward structure rather than configuring the rewards individually. When you use a goal, your agent system will inherit the compute reward, compute termination, and compute success functions from the goal. (You will still have the option to further customize those functions as needed.)

The five goal types you can use are:

AvoidGoal
MaximizeGoal
MinimizeGoal
ApproachGoal
MaintainGoal

These have the same parameters and work the same way as .

Goals are added using specialized teacher classes rather than the general teacher class that you would otherwise use to teach skills. For example, for a skill named Balance that you wanted to train with a goal to maintain a specific orientation, you would use the MaintainGoal teacher class.

The parameters you can use for goals are:

You can also use more than one goal for a single skill using the CoordinatedGoal teacher class. This is useful when your agent system needs to behave in a way that creates a balance between two goals that are both important.

Functions to Guide Agent System Behavior with Rules

Just like rules guide training and behavior for humans, providing rules for the agent system to follow can guide the agent system's decision-making more quickly to success. Rules guide the behavior of an agent system based on expertise and constraints.

Add Rules: the `compute_action_mask` Function

The compute_action_mask teaching function expresses rules that trainable agent systems must follow.

The compute_action_mask teaching function works only for discrete action spaces (where the actions are integers or categories), not for continuous action spaces (where decision actions are decimal numbers). If you specify a mask for a skill whose actions are continuous, the platform will ignore the action mask.

The function returns a list of 0 and 1 values. Zero means that the action is forbidden by the rule. One means that the action is allowed by the rule. The function may change the returned value after each decision. This allows complex logic to express nuanced rules.

In the example above, the first action is forbidden for the next decision, but the second and third actions are allowed. The logic in the skill itself (whether learned or programmed) will choose between the allowed second and third actions.

All selectors have a discrete action space (they choose which child skill to activate), so you can always apply the compute_action_mask function to teach them.

Functions to Manage Information Inside Agent Systems

As information passes through perceptors, skills, and selectors in the agent system, sometimes it needs to change format along the way. You can use three teaching functions to transform sensor and action variables inside agent systems: transform_ sensors, transform_action, and filtered_ sensor _space.

Transform Sensor Variables: the `transform_sensors` function

To transform sensor variables, use the transform_sensor function to calculate changes to specific sensors, then return the complete set of sensor variables (the observation space).

Two of the most common reasons for transforming sensor variables are conversion and normalization. For example, if a simulator reports temperature values in Fahrenheit, but the agent system expects temperature values in Celsius, use the transform_sensor function to convert between the two.

Normalization is when you transform variables into different ranges. For example, one sensor variable in your agent system might have very large values (in the thousands), but another variable might have small values (in the tenths), so you might use the transform_sensor function to transform these disparate sensor values to a range from 0 to 1 so that they can be better compared and used in the agent system.

Transform Decisions within the Agent System: the `transform_action` function

You may want to transform action variables for the same reasons as sensor variables.

Filter the Sensor List: the `filtered_sensor_space` function

Use the filtered_sensor_space function to pare down the list of sensor variables you need for a particular skill. Pass only the information that a skill or module needs in order to learn or perform well.

Configure Programmed Algorithms as Skill Agents

are programmed skill agents used in Composabl agent systems. These may be optimizers, PID or MPC controllers, heuristics, or any other Python algorithms that are useful to control parts of a process. Configure controllers in the SDK and then publish them to the UI to use in agent systems.

Create a New Skill Agent

To create a skill agent in the Python SDK, begin by logging in to the SDK by typing Composabl login from the CLI.

Then type composabl skill new.

Give the skill agent a name and a description in response to the prompts that follow. Choose your skill agent type as a controller (a programmed module like an optimization algorithm or MPC controller).

Specify the folder where you’d like to create the skill agent.

The Composal SDK will create a folder and Python controller.py() file from the template.

The Python Controller Class

The Python controller class offers several functions that you can use to build your algorithms or rule-based controllers in Composabl.

Functions for Training

Controllers don't need training, since they are based on programming rather than learning, but they include functions that connect them to the rest of the agent system during training.

Initialize your algorithm: the `init` Function

The __init__ function can be used to initialize your Algorithm and define initial configuration variables, this is called once when the Runtime starts. Let's supose that you want to use a MPC algorithm:

def __init__(self, *args, **kwargs):
    self.counter += 1
    self.mpc = LinearMPC()

Process Observation to Compute Action: the `compute_action` Function

The compute_action function will process the observation and returns an action. This function returns a number that represents the action that will be sent to the simulation.

async def compute_action(self, obs, action):
    self.counter += 1
    action = self.mpc.solve(obs)
    return action

End Simulation: the `compute_termination` Function

The compute_termination function tells the Composabl platform when to terminate a practice episode and start over with a new practice scenario (episode). From a controller perspective, it makes most senses to terminate an episode when the agent system succeeds, fails, or is pursuing a course of action that you do not find likely to succeed. This function returns a Boolean flag (True or False) whether to terminate the episode. You can calculate this criteria however seems best.

python
async def compute_termination(self, transformed_sensors, action):
        return False

Define Success: the `compute_success_criteria` Function

The success_criteria function provides a definition of the skill agent's success and a proxy for how completely the agent system has learned the skill. The platform uses the output of this function (True or False) to calculate when to stop training one skill agent and move on to training the next skill agent. It is also used to determine when to move to the next skill agent in a fixed order sequence. The agent system cannot move from one skill agent in a fixed order sequence to the next, until the success criteria for one skill agent is reached.

python
async def compute_success_criteria(self, transformed_ sensors, action):
        return self.counter > 100

Here are some examples of success criteria definition:

A simple but naive success criteria might return True if a certain sensors or KPI value for an episode or scenario crosses a threshold, but False if it does not.
A complex success criteria might compare a benchmark controller or another agent system to the agent system across many key variables and trials. It returns True if the agent system beats the benchmark on this criteria, but False otherwise.

Functions to Manage Information Inside Agent Systems

As information passes through perceptors, skill agents, and selectors in the agent system, sometimes it needs to change format along the way. You can use three teaching functions to transform sensor and action variables inside agent systems: transform_ sensors, transform_action, and filtered_ sensor _space.

Transform Sensor Variables: the `transform_sensors` function

To transform sensor variables, use the transform_sensor function to calculate changes to specific sensors, then return the complete set of sensor variables (the observation space).

python
async def transform_sensor(self, sensor, action):
        return sensor

Transform Decisions within the Agent System: the `transform_action` function

You may want to transform action variables for the same reasons as sensor variables.

python
async def transform_action(self, transformed_sensor, action):
    return action

Filter the Sensor List: the `filtered_sensor_space` function

Use the filtered_sensor_space function to pare down the list of sensor variables you need for a particular skill agent. Pass only the information that a skill agent or module needs in order to learn or perform well.

python
def filtered_sensor_space(self):
        return ["state1"]
Return a list of all the sensor variables that you want passed to the skill agent by this teacher.

Configure API Connections to Third-Party Software as Skill Agents

In this tutorial, we will walk through how to set up a skill agent in Composabl that integrates with a third-party API. This type of integration allows your agent system to communicate with external systems, such as machine performance APIs, and use the data to make informed decisions.

We will create a programmed skill agent that connects to a mock third-party API, process its response, and return an action based on the data received. This tutorial will also touch on orchestrating this skill agent within your agent system.

Step 1: Defining the Programmed Skill Agent

A programmed skill agent in Composabl is created by specifying the logic for interacting with the external API and processing the response. In this case, we will create a simple API connection to a fake endpoint that returns data about machine performance. The agent system will act based on the information received.

1.1. Creating the API Integration Skill Agent

We’ll define a programmed skill agent for making the API request. Here's an example of how to define the skill agent using a controller function that calls the API and processes the response.

import requests 
from composabl import SkillController

# Define the programmed skill 
class ThirdPartyAPISkill(SkillController): 

    def __init__(self, *args, **kwargs):
        self.api_url = "https://api.example.com/machine-status" 

    async def compute_action(self, obs, action):
        # Send sensor data to the third-party API 
        response = self._call_api(obs) 
        # Process the response and return an action 
        action = self._process_response(response) 
        return action
  
    def _call_api(self, observation): 
        try: 
            response = requests.post( 
                self.api_url,  
                json=observation,  
                headers={'Content-Type': 'application/json'} 
            ) 
            response.raise_for_status() 
            return response.json() 

        except requests.RequestException as e: 
            print(f"API call failed: {e}") 
            return None 

    def _process_response(self, response): 
        if not response:
            # Default action 
            return 0.0

        action = float(response.get("action"))
        reason = response.get("reason", "No reason provided") 

        print(f"Action: {action} - Reason: {reason}") 
        return action

    async def transform_sensors(self, obs):
        return obs

    async def filtered_sensor_space(self):
        return ['sensor1', 'sensor2', 'sensor3']

    async def compute_success_criteria(self, transformed_obs, action):
        return False

    async def compute_termination(self, transformed_obs, action):
        return False

In this example:

The compute_action() method sends observation data (e.g., from sensors) to a third-party API.
The _call_api() function makes the API call and handles any errors that might occur.
The _process_response() function processes the response from the third-party API and determines the appropriate action for the agent system to take based on the data.

Step 2: Adding the Programmed Skill Agent to the Agent System

2.1. Adding the Skill Agent to Composabl UI

Once the skill agent is defined, you can add it to your agent system in the UI using the methods below:

Create a new Skill Agent using the Composabl CLI with a given name and description and implementation type, that in this case will be a controller. The name will be "third_party_api_skill"

composabl skill new

Change the controller.py code to use the class that you created: ThirdPartyAPISkill(). Change the pyproject.toml file to include your class ThirdPartyAPISkill in the entrypoint and its name:

[project]
name = "Third Party API Skill Agent"

entrypoint = "third_party_api_skill.controller:ThirdPartyAPISkill"

Publish the Skill Agent to the UI

composabl login

composabl skill publish third_party_api_skill

Select your organization and project that you want to publish it to.

Reference: https://docs.composabl.io/changelog/0-8-0.html

2.2. Adding the Skill Agent to Composabl SDK

Once the skill agent is defined, you can add it to your agent system using the add_skill() SDK method. This allows the agent system to execute the API connection skill agent when necessary.

Here’s how to add the ThirdPartyAPISkill to the skill agent:

# Define and add the third-party API skill agent
third_party_skill = Skill("third_party_api", ThirdPartyAPISkill) 
agent.add_skill(third_party_skill)

By importing and creating the class with SkillController, you are indicating that this skill agent is programmed and does not require training. It will use predefined logic to interact with the third-party API and make decisions based on the data returned.

Conclusion

By following these steps, you’ve successfully defined and integrated a programmed skill agent that communicates with a third-party API into your Composabl agent system. The agent system can now take actions based on external data and dynamically respond to scenarios.

This approach allows agent systems to interface with a wide range of external systems, from monitoring equipment to adjusting machine settings, all through programmable skill agents.

Orchestration of skill agents through selectors ensures the agent system executes the correct skill agents at the right time, whether the skill agents are learned or programmed.

Orchestrate Skill Agents

Skill agents can be arranged in sequences or hierarchies, in skill groups, or as coordinated skills that output multiple decisions together. The orchestration structures reflect common design patterns that can be used to accelerate the design and creation of agent systems.

Orchestrate Skills in Hierarchies and Sequences

For some agent system designs, the task will be broken down into different skill agents that each control the system under certain conditions. For these agent systems, a special skill called an orchestrator chooses the right skill agent at the right time to accomplish the task. Orchestrators are the specialized supervisor skill agents that orchestrate the skill agents together, determining which skill agent to activate based on the conditions the system needs to respond to.

To add an orchestrator to an agent system, drag the orchestrator into your agent system above the skills layer.

You will then be prompted to configure the orchestrator.

Orchestrators can be trained with DRL just like the other skill agents, and you set goals for them the same way. Click on Configure Orchestrator to be taken to the menu to set your orchestrator’s goals. Note that the goals of the agent’s top-level orchestrator should be the same as the .

You will also be prompted to choose between an additional set of options that correspond to two separate Machine Teaching design patterns.

Fixed-order sequence: perform the skills in a set order. This is used in the , a design pattern that is useful for tasks that involve fixed sequences of actions.
Variable order sequence: perform the skills in any order based on the orchestrator’s determination. This is used in the , a design pattern that is useful for tasks that require different control strategies to be used in different situations or conditions.

Orchestrate Skill Agents in Groups

Unlike agent system designs that use an orchestrator to assign control to skills one at a time, agent systems with skill groups use skills working together to make decisions.

Skill groups always consist of two or more skill agents. To create a skill group, simply drag the second skill under the first, and a skill group will automatically be created.

Skill groups are used for the , where one skill determines what the action should be and a second skill then “turns the knobs” to implement the decision.

A common plan-execute design is the set point pattern, where a DRL agent skill determines what the set point of a system should be, and then a traditional automation technology (a controller that uses model predictive control or optimization) determines what actions are necessary to get the system to the set point. For example, in , the DRL plan skill agent decides what temperature the liquid in the tank should be. Then the MPC execute skill agent determines what temperature of coolant needs to be flowed around the tank to achieve the desired set point.

In the industrial mixer example, the DRL skill agent is able to train effectively because the actions of the MPC controller are predictable. That means that it can practice and learn knowing that variations in performance are due to its own actions.

In agent systems with multiple DRL skill agents arranged in plan-execute patterns, Composabl will always train the skills from the bottom to the top. In other words, the execute skill will have to achieve competence before the plan skill will start training. That allows each skill to effectively interpret the feedback from the system without confusion from each other.

Orchestrate Coordinated Skills

Some tasks require multiple skill agents to work together on a single decision, but in parallel rather than in sequence. Skill agents for these tasks use coordinated skills that learn to take action together toward a goal. Also known as Multi-Agent Training, coordinated skills are trained using a coach, rather than a teacher.

Examples of Coordinated Skill Agents

Traffic Optimization: Enhancing traffic flow and safety by teaching individual vehicles to navigate optimally and cooperate with each other.

Collaborative Robotics: Enabling robots to work together on tasks such as assembly in manufacturing or coordination in logistics.

Smart Grids: Optimizing energy distribution by having agents represent power plants, storage, and consumers to improve efficiency and stability.

Multiplayer Games: Creating adaptive and intelligent NPCs that can offer dynamic challenges to players in competitive or cooperative game settings.

Communication Networks: Improving network performance by optimizing resource allocation and traffic routing through agents representing network components.

Environmental Management: Balancing economic, ecological, and social goals in land use and resource management by simulating stakeholders as agents.

Healthcare Logistics: Strategizing resource allocation and treatment plans in scenarios like pandemics by considering the actions of hospitals, pharmacies, and patients as agents.

Supply Chain Optimization: Minimizing costs and delivery times in supply chains by coordinating agents representing various stages of the supply chain process.

Orchestrate Coordinated Skills with the SDK

Coordinated skill agents are not yet available in the UI. In the SDK, we have expanded the API to integrate Coordinated Skills through the add_coordinated_skill method on your agent. This method accepts a new class that gets configured, named CoordinatedSkill, just as with the Teacher or Controller classes we implement this class by inheriting from the Coach class.

The coordinated skill agent will now take the incoming observation and action spaces and pass it to the sub-skills as a shared environment observation and action taking. The sub-skills will then return their observations and actions, which will be passed back to the coordinated skill agent. The coordinated skill agent will then return the combined observations and actions to the agent system.

Configure Scenarios

are situations where your agent system needs to behave differently to succeed.

Scenarios are created by carving out sections of the simulation space, as defined by specific configurations of variables and variable ranges. This allows you to train skill agents only in the scenarios where they will need to perform, leading to more efficient training and better performance. Orchestrators learn to recognize scenarios and pass control to the appropriate specialist skill agent to handle the scenario’s demands.

Types of Scenarios

How you define scenarios depends on the type of sensor variables you are working with.

Discrete variables are categories that describe a particular scenario. For each scenario, there is likely a perceptor in the agent system, such as a machine learning model, that inputs the sensors, processes the sensor values, and outputs the discrete variable category.

Continuous variables are numbers. When they are used to define a scenario, one number value determines the section of the space that your agent system will explore for decisions.

Sometimes a scenario is better defined by a range of continuous values than by a single continuous value. In that case, the scenario would be defined by a continuous variable range.

Here are some examples of how scenarios can be defined in different ways:

Use Case

Discrete Variables

Continuous Variables

Continuous Variable Ranges

Scenarios Example

Let’s say that we are creating an agent system to plan the operations of a restaurant with varying demand for three different recipes. We could create three different scenarios:

Low demand: 30 recipe A, 20 recipe B, and 10 recipe C
Normal demand: 60 recipe A, 45 recipe B, and 10 recipe C
High demand: 100 recipe A, 50 recipe B, and 25 recipe C

Once you have defined these scenarios, you could set the agent system to train in the low-demand scenario until the success criteria are reached. Then it would move to normal demand, and then to high demand. We would expect the agent to learn faster as it progressed through the scenarios, with cumulative knowledge building over time.

Set Up Scenarios in the UI

Add Scenarios to a Use Case

Add scenarios to your use case by clicking on Scenarios from the left-hand menu to open the Scenarios page.

Click Add scenario to create a new scenario. Give your scenario a name and then click Add condition to configure it. You will then have the option to choose from any of the available sensor variables and apply conditions to them.

For continuous variables, select Is and enter the exact value.
For continuous variable ranges, select Is between and enter the range.
For discrete variables, select Is element of and enter the possible values.

Create Scenario Flows

In addition to identifying scenarios, Composabl also lets you teach your agent systems about how scenarios relate to each other. To do this, you can build scenario flows to establish sequences of scenarios for the agent system to practice.

Without scenario flows, the training platform will connect scenarios at random. But sometimes it is important for agent systems to practice scenarios in a specific order. For example, it might be important to practice flying a drone in high winds and then landing in the same conditions. Or an agent managing a production line might need to practice going from a scenario in which raw materials are scarce to one in which material costs rapidly rise. By creating a scenario flow, you can ensure that a given sequence of scenarios will be included in training.

Once you have scenarios built, create sequential sections of the task by clicking Add section.

After creating all the sections you need, drag and drop the scenarios into the sections.

Create as many flows as you need to capture the different scenario sequences your agent system will need to navigate.

Add Scenarios to Skill Agents

Scenarios are added to agent systems as part of configuring skill agents. Scenarios are added to skill agents so that they know what specific conditions they need to master. Not all scenarios are relevant to all skill agents. For example, a drone landing skill doesn’t need to practice taking off in windy weather.

The configurations you set up when building the scenario flows will appear in the skill agent configuration modal. For each section of the process, as defined by the flows, tell the orchestrator which scenarios it should apply by checking the boxes next to each scenario.

Add a Perception Layer

Adding perception modules to your agent can provide richer, complex, condensed, and nuanced information to the decision-making parts of the agent system. For example, you might include a computer vision model in your perception layer that inputs images or video from a camera and outputs classifications of objects that it identifies. You can also add large language models as perceptors to take in and interpret information in natural language.

Each module in the perception layer for a Composabl agent system inputs the sensor variables, processes those variables in some way, and outputs one or more new variables that the platform will automatically add to the list of sensors.

Perceptors can use any supported Python function or library to calculate outputs. They can even call machine learning and large language models or their APIs.

The next three pages explain how to use the SDK and CLI workflow to create new perceptors or configure existing models as perceptors to use in Composabl agent systems.

Add Perceptors to Agent Systems

Just like skill agents, perceptors can be dragged and dropped into agent systems using the UI. Perceptors will always be situated in the Perception layer that comes before orchestrators and skill agents. That’s because perception needs to be applied to the sensor inputs to create new variables that are then passed to the skills layer for the agent system to use in decision-making.

Create a New Perceptor

Perceptors use the SDK and CLI workflow.

To access a template for a perceptor, type composabl perceptor new into the CLI. Composabl will then generate a perceptor template that you can populate with your information.

In this simple perceptor example we calculate the perceptor outputs that will be added as new sensor variables and we create a list of perceptors that comprise the perception layer.

python
class DeltaCounter():
    def __init__(self):
        self.key = "state1"
        self.previous_value = None

    def compute(self, sensors):
        if self.previous_value is None:
            self.previous_value = sensors[self.key]
            return {"delta_counter": 0, "state2": 0}

        delta = sensors ["state1"] - self.previous_value
        self.previous_value = sensors["state1"]
        return {"delta_counter": delta, "state2": 0}

    def filtered_sensor_space(self, sensors):
        return ["state1"]

delta_counter = Perceptor(["delta_counter", "state2"], DeltaCounter, "the change in the counter from the last two steps")

Import Skill Agents to the UI

Import Skill Agents into Composabl

Composabl makes it easy to import existing logic and controllers, from MPC and PID controllers to custom Python applications and machine learning models. Once imported, these agents can be reused across your use cases and visually orchestrated in the Agent Orchestration Studio.

Whether you’ve built your logic using Composabl’s SDK or you're wrapping existing IP (e.g., LLMs, external APIs, heuristics), you can import it into Composabl and make it available for drag-and-drop integration.

You can import , , and .

Why Import Skill Agents?

Many organizations already rely on proven control strategies, models, or code. Importing them into Composabl allows you to:

Re-use your existing IP in a composable agent system
Combine logic-based and learning-based agents (e.g., hybrid PID + DRL setups)
Standardize deployment workflows across your team or organization

How to Import Skill Agents and Orchestrators

Log in to Composabl
```
composabl login
```

You’ll be redirected to the browser to enter your credentials.

Navigate to your agent directory
Change to the folder containing the skill agent or orchestrator you want to import.
Import the component
- For a skill agent:

composabl skill publish skill-name

For an orchestrator:

composabl selector publish selector-name

Choose where to save it
- Select your organization
- Select the project or use case
Finish importing
Once the import is complete:
- Open the UI
- Go to Agent Orchestration Studio
- Refresh your browser
Your new skill agent or orchestrator will appear in the sidebar, ready to use.

Importing Perceptors

Navigate to the Perceptors Directory

Move to the folder above the individual perceptor you want to import.

Import a Perceptor

composabl perceptor publish perceptor-name

Select Your Organization and Use Case
Complete the Import
After import:
- Go to the UI
- Open Agent Orchestration Studio
- Refresh the page
Your perceptor will appear in the sidebar and can be connected to agents.

What Can You Import?

You can import nearly any Python-based logic as a skill agent, including:

PID or MPC controllers
LLM-based agents
Business rule engines
ML models or custom inference code
External API wrappers

Once imported, these agents can be composed into intelligent multi-agent systems alongside Composabl-native components.

Train Agents

Submit a Training Job through the UI

About Cluster Training

Composabl agent systems use Kubernetes clusters to train at scale. A cluster is a collection of computers that work on large tasks simultaneously. This provides enough compute to complete large training tasks as efficiently as possible.

Composabl offers two options for cluster training:

Use Composabl's Training as a Service offering to train on our clusters
Use your own compute clusters through Azure, AWS, or another provider

Ensure that Your Agent System is Ready

Before you submit your job for training on a cluster, make sure that your agent is fully configured and all the parameters have been set. That means checking all the agent components:

Goals
Perceptors
Orchestrators, including goals for learned selectors and scenarios
Skill agents, including goals for learned skills
Scenarios, including scenario flows

Any component of the agent with a warning sign is not fully configured and not ready for training. Go back to edit that agent component and make sure that all of the fields are filled out.

Choose the Right Cluster

You can train on your own cluster or on Composabl’s clusters using training as a service (TaaS) credits. If you want to use Composabl’s clusters, ensure that you have credits available.

To train on your own cluster, make sure that you have set your cluster up and installed Composabl successfully.

Click Train and then choose the cluster option in the menu. You will then have the option to configure your training session.

Configure Your Training Session

Training session configuration options are the same whether you’re using TaaS or training on your own cluster.

Set the Number of Training Cycles

A training cycle is a complete pass through the entire task, with the agent system continuing until it reaches success or some other stop criteria. Your agent system will train each skill one at a time for the selected number of training cycles, starting from the bottom of the agent system design.

A training cycle involves about 1,000 agent decisions. Depending on the complexity of the task, agent systems may need to complete anywhere between 100 and several thousand training cycles to become proficient.

Set the Number of Simulators

You can run multiple simulators in parallel to speed up training. If you run more than one simulator during a training, the number of training cycles selected will be multiplied by the number of simulators, so 5 training cycles with 3 simulators selected would lead to 15 training cycles total.

You can use the Advanced Configuration to choose how powerful each machine running a simulator should be. If you choose Small, each training cycle selected will result in one training cycle completed. If you choose GPU, you will get 4 training cycles for each training cycle.

More training cycles running simultaneously will speed up training, but also increase costs. How long your training takes also depends on the complexity of your agent system and your simulator.

Start Training

When you have configured your settings correctly, click Start Training.

You will then be taken to the Training Sessions page. There you can follow the agent system training progress by viewing the real-time plots or the console output.

Note that it will take a few minutes for the visualization to begin.

Analyze Agent System Behavior

Analyzing agent system behavior is a key part of using Composabl. This capability is key for:

Building agent systems that beat performance benchmarks
Monitoring the performance of your agent systems and simulators in real-time
Analyzing your agent systems and simulators to understand what went wrong in case of problems
Creating dashboards and reports to share with your team or customers

Composabl offers multiple options for analyzing agent system behavior. For high-level summaries, you can use the benchmark reporting features in the UI. For more in-depth information, you can use the Historian and raw benchmark data in the SDK, which creates a database of all the information gathered during training and deployment. You can also design an LLM module in your agent to provide customizable communications capacity using natural language.

View Training Session Information

The Training Sessions page allows you to view agent systems' training in real time and analyze their performance in training.

When you begin a training session, the graphs for each trained skill agent will begin to generate. You can watch your skills learn by viewing the graphs, or you can click on the Console Output tab for detailed information about each training decision.

The shape of the curve can help you understand how your skill agents are learning. When the curve plateaus, that usually means that the skill has been successfully trained and will not learn more. If the curve shows jagged ups and downs, then the skill isn't performing consistently and has more learning to do. Sometimes this is a sign that you should go back and adjust the training settings.

The training sessions page shows a list of all the training sessions for a project in a menu on the left of the screen, allowing you to jump between different agents, as well as different training sessions for the same agent system.

Analyze Data in Detail with the Historian

In this tutorial, we will explore how to use the historian to validate the trained AI agent system in Composabl and training logs. The historian stores historical time-series data in an optimized format (parquet) - https://www.databricks.com/glossary/what-is-parquet, which helps in evaluating how the agent is performing during training.

Step 1: Accessing the Historian Data

The historian file stores time-series data essential for validating agent system training. There are several ways to access and store the historian data, but the recommended format is as a delta file (parquet).

Understanding the Format:
- The historian data is typically large, around 500 megabytes for standard operations. It is stored in a Delta Lake file format, optimized for time-series data and supporting efficient queries.
Downloading the Historian File:
- From the Composabl UI, download the historian file. This file will come in a compressed format (e.g., .gz).
- After extracting it, you should see the delta file containing time-series data.

Step 2: Setting Up for Validation

Unpacking the Historian File:
- If the historian file is compressed (e.g., .gz), unpack the file using a tool like gzip:
  gunzip -k historian_file.gz
- Once unzipped, you’ll see a 10 MB+ delta file with historical time-series data.
Understanding the Delta File:
- The delta file is optimized for fast reads and writes of time-series data.
- It supports an append-only structure, which ensures that each new piece of data can be added efficiently without modifying the existing data.

Step 3: Querying the Historian Data

Setting Up a Query Environment:
- To validate your agent system’s training, you’ll need to set up an environment that allows you to query the delta file. Delta Lake integrates well with systems like Apache Spark, but for simple querying, you can use tools like pandas in Python.

Querying for Agent Training Logs:

Extract and analyze relevant historical data from the delta file. Here's a simple Python example for querying the delta file using pandas:


import pandas as pd 



# Load the historian delta file 

df = pd.read_parquet('historian_delta_file.parquet') 

df = df.sort_values(by=['timestamp'])

df_data = df[df['category_sub'].isin(['step', 'skill-training','skill-training-cycle'])]
#filter df with composabl_obs on "data" col only
df_data = df_data[(df_data['data'].str.contains('composabl_obs')) | (df_data['category_sub'].str.contains('skill-training')) | (df_data['category_sub'].str.contains('skill-training-cycle'))]

#df_data['data'] = df_data['data'].apply(lambda x: x if 'composabl_obs' in x else None)
def convert_to_dict(x):
   try:
      return json.loads(x)
   except:
      try:
            return ast.literal_eval(x)
      except:
            return None

df_data['data'] = df_data['data'].apply(lambda x: convert_to_dict(x))

df_data['skill_name'] = df_data['data'].apply(lambda x: x['name'] if 'is_done' in x else None)
df_data['skill_name'] = df_data['skill_name'].fillna(method='bfill')

df_data['reward'] = df_data['data'].apply(lambda x: x['teacher_reward'] if 'composabl_obs' in x else None)

df_data['obs'] = df_data['data'].apply(lambda x: x['composabl_obs'] if 'composabl_obs' in x else None)

#df_data['done'] = df_data['data'].apply(lambda x: x["teacher_terminated"] if "teacher_terminated" in x else None)
df_data['cycle'] = df_data['data'].apply(lambda x: x['cycle'] if 'cycle' in x else None)
df_data['cycle'] = df_data['cycle'].fillna(method='bfill')

df_data = df_data[df_data['category_sub'] == 'step']

print(df_data)

# group by runs
df_group = df_data.groupby(['run_id','skill_name','cycle'])['reward'].mean()

# Process observation data
df_obs = pd.DataFrame(data=[[v[0] for v in list(x.values())] for x in df_data['obs'].values], columns=[list(df_data['obs'][0].keys())])

df_obs['cycle'] = df_data['cycle']
df_obs['run_id'] = df_data['run_id']
df_obs['skill_name'] = df_data['skill_name']
df_obs.columns = [x[0] for x in list(df_obs.columns)]

# Episode Reward by Run Id
for run_id in list(set([x[0] for x in df_group.index])):
   for skill in list(set([x[1] for x in df_group.index])):
      #df_group[run_id].plot(subplots=True, title=run_id)
      plt.plot(df_group[run_id][skill])
      plt.ylabel(f'Mean Episode Reward')
      plt.xlabel(f'Cycle')
      plt.title(f'{run_id} - {skill}')

      plt.show()

Key Benefits of Using the Historian for Validation:

Optimized Data Handling: The Delta Lake format is designed for fast querying, making it ideal for time-series data.
Efficient Storage: The append-only nature ensures that new data can be added without overwriting or modifying existing data, making it easy to track data over time.
Continuous Monitoring: By continuously adding data to the historian, you can validate your agent system's long-term impact on machine performance, uptime, and safety.

Evaluate Performance

Analyze Data

Benchmark Testing and Data Generation

After training a multi-agent system in Composabl, the system automatically runs a series of standardized tests to evaluate its performance. This benchmarking process:

Places the system in controlled testing environments
Records detailed metrics at each step of operation
Aggregates results to provide comprehensive performance statistics

The output of this testing process is compiled into a structured benchmark.json file, which contains rich performance data that can be analyzed to assess effectiveness, identify improvement opportunities, and compare different design approaches. This file is a performance record and a valuable analytics resource for optimizing your agentic systems.

Downloading Benchmark Artifacts

To download benchmark data for further analysis:

Navigate to the "Training Sessions" page
Click the artifacts dropdown in the top right page of a trained system
Select "Benchmark"
The benchmark.json file will be saved to your local machine

Understanding the Benchmark.json File

The benchmark.json file contains structured data about the performance of a trained agent system. Here's how to interpret this file:

File Structure

{
  "skill-name": {
    "scenario-0": {
      "scenario_data": { ... },
      "episode-0": [ ... ],
      "aggregate": { ... }
    }
  }
}

Key Components

Scenario Data: Contains reference values for the scenario:

"scenario_data": {
  "sensor_one": {"data": 8.57, "type": "is_equal"}, 
  "sensor_two": {"data": 373, "type": "is_equal"}
}

Episode Data: Array of state-action pairs showing how the agent performed in each step:

[
  {
    "state": "{'sensor_one': array([311.2639], dtype=float32), ...}",
    "action": "[-1.253192]",
    "teacher_reward": 1.0,
    "teacher_success": false,
    "teacher_terminal": null
  },
  ...
]

Aggregate Statistics: Summary statistics for the entire benchmark:

"aggregate": {
  "mean": { ... },
  "medians": { ... },
  "std_dev": { ... },
  "max": { ... },
  "min": { ... }
}

Deploy Agents

Access a Trained Agent System

Once you have completed training an agent and are ready to deploy it you can download it from the UI.

Go to the Training tab in your project.
Select the training session for the agent you want to deploy.
Click on the Artifacts drop down in the upper right of the screen.
Click Inference Model and you will down load a file called agent.json.

Deploy an Agent System in a Container

Based on the notes provided in the image, we'll help you create a tutorial on accessing the agent system runtime after deploying it to Docker. This tutorial will explain the steps for building the Docker container, deploying the agent system, and accessing the runtime for inference or interaction.

Tutorial: Accessing the Agent System Runtime After Deploying to Docker

Once you have packaged and deployed your agent system inside a Docker container (https://docs.composabl.com/deploy-agents/deploy-an-agent-in-a-container), the next step is accessing its runtime for operations like model inference. This tutorial will guide you through the process of building and running the Docker container and then connecting to the agent system's runtime for further interactions.

Step 1: Preparing the Dockerfile and Environment

To deploy the agent system to Docker, we need to first create an image from the Dockerfile (https://docs.composabl.com/deploy-agents/deploy-an-agent-in-a-container). The Dockerfile will package the necessary runtime, model, and environment for the agent system.

Step 2: Building the Docker Image

Building the Image: You can build the Docker image by running the following command in the terminal. This will take the Dockerfile and the associated files (like the pre-trained model) and create an image.

docker build -t composabl_agent_api .

The -t flag allows you to tag the image (composabl_agent_api), which makes it easier to reference later.
Make sure that the model file (agent.json) and all relevant scripts are reachable within the Docker context (i.e., the directory from which you are building).

Checking the Image: Once the build is complete, you can verify that the image was created successfully by running:

docker images

Step 3: Running the Docker Container

Now that the image is built, the next step is to run it in a container. You will run the Docker container in an interactive mode to access the runtime.

docker run -e COMPOSABL_LICENSE="<your_license>" -it -p 8000:8000 composabl_agent_api

-it: Runs the container interactively.
-p 8000:8000: Maps port 8000 from the container to port 8000 on your local machine so that you can access the HTTP server for the agent system runtime.
-e COMPOSABL_LICENSE="<your_license>" : is exporting the environment variable and linking to your composabl license

The HTTP server should now be up and running within the container, ready to handle model inference or other tasks.

Step 4: Accessing the Agent System Runtime

With the Docker container running, you can now connect to the agent system's runtime. The runtime will be an HTTP server, as mentioned in your notes. You can access it through a POST request for model inference or other operations.

Sending Requests to the Agent System: You can send a POST request to the running server using a tool like curl, Postman, or any Python HTTP library (such as requests).

Here’s an example using curl:

curl -X POST http://localhost:8000/predict -H "Content-Type: application/json" -d '{"observation": {"T": 311.0, "Tc": 292.0, "Ca": 8.56, "Cref": 8.56, "Tref": 311.0, "Conc_Error": 0.0, "Eps_Yield": 0.0, "Cb_Prod": 0.0}}'

This request will:

POST data to the /predict endpoint on localhost:8000, which is being forwarded from the Docker container.
The agent system will handle the request, infer the model, and return the action as a result.

Conclusion

In this tutorial, we walked through the process of:

Building a Docker image with your agent system and its runtime.
Running the Docker container interactively to expose the agent system’s HTTP server.
Accessing the agent system runtime by sending HTTP requests for inference or other tasks.

By following these steps, you can deploy and interact with your Composabl agent system in a Dockerized environment.

Deploy an Agent System as an API

Deploying an agent system means exporting the trained agent system, loading it into your production environment, and then asking the agent system for decisions. In Composabl, the export is a json file called agent.json that contains all you need to deploy your agent system. You can load the agent system file to use in your IT infrastructure with many ways.

This document will show how you can deploy your agent system as an API using Python and Flask.

Step 1: Accessing and Preparing the Files

To deploy the agent system as an API, we need to extract the agent.json model, get the agent_api.py script to start the API, requirements.txt to install packages. You can find a sample for these files in our GitHub repo: https://github.com/Composabl/examples.composabl.io/tree/main/deploy_agent

This is the structure needed for the API:

*** How to extract the agent.json ***

Log into the Composabl UI (https://app.composabl.com/onboarding), train your agent system, and navigate to the training sessions section.
Check the status of the agent system:
- Green status (Done) indicates finished training.
Download the agent system essential file:
- The agent system file (a .gz compressed file).
- Extract the agent system file agent.json to the model folder

Step 2: Get the API python file

Step 3: Install the requirement packages

Run the following in your terminal:

It will install these packages:

composabl
flask[async]
numpy

Step 4: Export your license and start the API

To start Composabl API, you will need to export your license as an environment variable and then use Python to start the Flask API with your agent system.

Step 5: Test your API

After running the API, you can test it by opening the terminal and run the script below:

In the POST request, we pass the use case "observation" with sensor variables and their values to receive an action from the agent system. The code above is related to the agent.json demo for Chemical Process Control.

Connect Runtime Container to Your Operation

Based on the notes provided in the image, I'll help you create a tutorial on accessing the agent system runtime after deploying it to Docker. This tutorial will explain the steps for building the Docker container, deploying the agent system, and accessing the runtime for inference or interaction.

Tutorial: Accessing the Agent System Runtime After Deploying to Docker

Once you have packaged and deployed your agent system inside a Docker container, the next step is accessing its runtime for operations like model inference. This tutorial will guide you through the process of building and running the Docker container and then connecting to the agent system's runtime for further interactions.

Step 1: Preparing the Dockerfile and Environment

To deploy the agent system to Docker, we need to first create a Dockerfile. The Dockerfile will package the necessary runtime, model, and environment for the agent system.

Dockerfile Setup: Your Dockerfile should contain the following key components:
- Base Image: Use a Python base image (or any base that supports the necessary libraries).
- Copy Model Files: Copy the pre-trained model (e.g., .gz file) to the container.
- Install Dependencies: Install any required Python libraries (like OHTTP or other packages for the agent system).

Here’s an example Dockerfile:

Step 2: Building the Docker Image

Building the Image: You can build the Docker image by running the following command in the terminal. This will take the Dockerfile and the associated files (like the pre-trained model) and create an image.

The -t flag allows you to tag the image (my-agent-runtime), which makes it easier to reference later.
Make sure that the model file (agent.gz) and all relevant scripts are reachable within the Docker context (i.e., the directory from which you are building).

Checking the Image: Once the build is complete, you can verify that the image was created successfully by running:

Step 3: Running the Docker Container

Now that the image is built, the next step is to run it in a container. You will run the Docker container in an interactive mode to access the runtime.

-it: Runs the container interactively.
-p 8000:8000: Maps port 8000 from the container to port 8000 on your local machine so that you can access the HTTP server for the agent system runtime.

The HTTP server should now be up and running within the container, ready to handle model inference or other tasks.

Step 4: Accessing the Agent System Runtime

Sending Requests to the Agent System: You can send a POST request to the running server using a tool like curl, Postman, or any Python HTTP library (such as requests).

Here’s an example using curl:

This request will:

POST data to the /infer endpoint on localhost:8000, which is being forwarded from the Docker container.
The agent system will handle the request, infer the model, and return the result.

Interacting with the Agent System: If you prefer to interact with the agent system directly, you can also enter the container’s interactive mode and run commands.

This will open a shell inside the running Docker container, allowing you to execute any runtime commands manually.

Step 5: Automating the Process

For convenience, you can automate the entire process of building the image, running the container, and interacting with the agent system by creating a script.

Here’s a basic example of an automation script:

Save this as run_agent.sh, and then execute it:

This script will:

Build the Docker image.
Run the container, mapping the necessary port and exposing the HTTP server for inference.

Step 6: Troubleshooting and Debugging

If the container fails to start, or if the server doesn't respond, you can debug the container by checking the logs:

This command displays the output of the running container, which can help diagnose issues such as missing dependencies or server errors.

Conclusion

In this tutorial, we walked through the process of:

Building a Docker image with your agent system and its runtime.
Running the Docker container interactively to expose the agent’s HTTP server.
Accessing the agent system runtime by sending HTTP requests for inference or other tasks.

By following these steps, you can deploy and interact with your Composabl agent system in a Dockerized environment.

Connecting to Agent System Runtime and Plotting Results of Agent System Operations

In this tutorial, we will cover how to connect to the agent system runtime, load a pre-trained agent system, run inference, and visualize the results in a production-like environment. The provided script, agent_inference.py, is a key component that demonstrates connecting to the Composabl agent system runtime, initializing the environment, and plotting agent system operation results.

Step 1: Understanding `agent_inference.py`

The script agent_inference.py connects to the runtime, loads a pre-trained agent system, connects to a local simulation, collects sensor data from the sim and plots the results. Here is an outline of the core steps in the process:

Start Runtime and Load Agent System: The script initializes the trainer and loads a pre-trained agent system from a model folder.
Set Up the Simulation Environment: It connects to a simulation environment.
Run Inference: The pre-trained agent system interacts with the simulation to perform inference (decisions), collecting observations and giving actions at each step.
Collect Data and Plot Results: Sensor data and actions are collected in a Pandas DataFrame, and the results are plotted using Matplotlib to visualize how the agent system is performing over time in a production like environment.

Step 2: Connecting to the Runtime and Loading the Agent System

The first task is to connect to the Composabl runtime and load the pre-trained agent system. This is accomplished using the Trainer and Agent classes. The agent system's model is loaded from the directory where the model was saved during training.

Here:

Trainer(config) initializes the runtime with a configuration file.
Agent.load(PATH_CHECKPOINTS) loads the saved agent from the specified checkpoint directory.
trainer._package(agent) prepares the agent for inference by packaging it.

Step 3: Connecting to the Simulation Environment

Next, we connect the agent system to the simulation environment. The make() function creates a connection to the local simulator, and the environment is initialized.

Here:

The simulator is configured to run locally (localhost:1337) and you have to start it locally and manually before.
The environment is initialized with sim.init(), and the agent system is connected to it.

Step 4: Setting the Scenario and Running Inference

After connecting to the simulator, you need to set up the specific scenario that the agent system will operate in. This scenario determines the environment's initial state.

With the environment set, the agent can now run inference for a set number of iterations. At each iteration, the agent observes the environment, takes an action, and collects the results (observations and rewards). This is done in a loop.

In each iteration:

The agent system performs an action based on the current observations.
The environment advances one step with sim.step(action), and the agent receives a new observation and reward.
Sensor data and actions are logged into a Pandas DataFrame for later analysis.

Step 5: Saving Data and Plotting Results

Once the inference loop is complete, the collected data is saved, and the results are visualized. The results are plotted using Matplotlib.

This code generates three subplots:

Temperature Controller (Tc) over time.
Temperature (T) and Reference Temperature (Tref) over time.
Concentration (Ca) and Reference Concentration (Cref) over time.

The plots provide a visual representation of the agent system's performance during the simulation. Finally, the figure is saved as inference_figure.png in the benchmarks directory.

Step 6: Running the Script

To run the script, execute the agent_inference.py in your terminal.

Conclusion

In this tutorial, we demonstrated how to:

Connect a pre-trained Composabl agent system to a runtime and simulation environment.
Set up a scenario and run inference.
Collect observations and actions, and plot the results using Matplotlib.

By following these steps, you can visualize the performance of your agent system and gain insights into how it interacts with the environment over time.

clusters

Creating a Cluster

Introduction

To train Composabl agent systems on a cluster, you need to install the Composabl controller onto a compatible Kubernetes cluster.

This document will guide you through setting up this cluster using various options:

Azure: Set up a cluster through our Azure Marketplace Composabl Training Cluster offer
Automated: Set up a compatible Kubernetes cluster on Azure through the use of Pulumi (IaaS)
Manually: Set up a compatible Kubernetes cluster through any tool and infrastructure of your choice

Components

The entire Composabl controller installation consists of two main components:

Controller: This component connects to the Composabl No-code ui, and allows you to initiate training on your cluster.
Historian: The historian component records detailed statistics regarding the training process of your agent system. After training, these will be available alongside the agent system in the no-code UI

RBAC

The installation of Composabl will introduce 2 namespaces, composabl-train and composabl-sims, for training workers and simulators respectively.

For normal operation, a series of permissions are required by the ServiceAccounts of the Composabl installation.

Cluster Setup

We currently have the following guides to setting up your own cluster:

Azure, our recommended way of setting up a training cluster
Automated, which describes the use of Pulumi for setting up a compatible cluster on Azure
Manual, if you're experienced in setting up a cluster, or have existing standard tooling for setting up kubernetes clusters

Manual

Overview

Due to the many options available to you for installing Kubernetes clusters, this document will not go into the specifics of setting up the cluster. Rather, it will provide you with guidance and requirements for your cluster.

Nodes

Depending on whether you want to use GPUs or not, you need the following nodes:

Nodes that are always required:

"main": This means nodes to run the control plane. The Composabl controller does not interact with these nodes, so they should be provisioned as recommended by the Kubernetes distribution you use.
"composabl": This node or nodes will be where the Composabl controller and Historian software are scheduled.
"envrunners": These nodes will handle training workloads. If you're not using GPUs, all training will be done on these nodes. If you are, these nodes will manage the communication with the simulators, and can be reduced in size
"simscpu": These nodes are where the simulators will be scheduled. Sizing depends on the simulator.

If you want to use GPU training, you need the following nodepool: 5. "learners": These nodes with GPUs will accelerate the learning step of the training process.

If your simulator can be accelerated using GPU, you can add the final node pool: 6. "simsgpu": These will run simulators, assigning a GPU to them.

A note on GPUs: Currently, only Nvidia GPUs are supported. The cluster must have the installed for training on GPU to be enabled.

1. Sizing

Whether or not you use autoscaling using , each node type must be sized accordingly.

main: As required by your Kubernetes distribution
composabl: In total, 16GB of memory and 4 CPU - with 1 node at least 8GB of memory.
envrunners: If not using GPUs, we recommend 8 CPU and 8 or 16 GB of memory. In any case, the number of simulators that can be managed by each envrunner instance depends on the number of CPU
simscpu: The sizing of these nodes depends on the resource requirements of your simulator
learners: These nodes should have 1 Nvidia GPU. Other resources can be limited - 2 CPU and 8GB of memory is sufficient
simsgpu: As with simscpu, depends on the simulator requirements.

2. Labels

All groups of nodes must be labeled accordingly. The name, as given in the sizing guide is the name you should set as the agentpool label.

You may be able to define this during your cluster setup, but if not, you can use the following commands:

Replace the values in between <> with the name of the nodes you'd like to assign to a specific pool.

Storage

The components also need access to (semi)persistent, shared storage. This section will detail the types and amount of storage needed.

It needs the following PersistentVolumeClaims in the composabl-train namespace:

pvc-controller-data with a size of ±1Gi and ReadWriteOnce (or better) accessMode When using Azure, you will need to set the nobrl mountOption for this PVC, as this is required for the Composabl controller to function.
pvc-training-results with a suitable size - this is where your final agent system data will be stored before it is uploaded to the No-code application. It needs accessmode to be ReadWriteMany (RWX). A good initial size is to match historian-tmp.
historian-tmp is used as temporary storage for historian data. It needs to have an accessMode of ReadWriteOnce and the size will depend on the length of your training sessions. We recommend starting with 5Gi.

The size of pvc-training-results and historian-tmp is dependent on the amount and size of training jobs you want to run simultaneously on your cluster. If you plan on running long-lived training sessions with many cycles, you may want to increase the capacity for both,

Private image registry

If you want to use a private registry for simulator images, you will need to set up this private registry yourself, and make sure the cluster is able to pull images from this registry.

Next steps

Once your cluster is running, and you have verified your setup is working, you can continue to

Troubleshooting

Resolving Certificate Issues for Installing the Composabl SDK on WSL

If you use Windows Subsystem for Linux (WSL) on a corporate machine with internal security tools, such as Zscaler, you may encounter issues installing the SDK due to certificate replacement during HTTPS traffic inspection. Below is a method to resolve this issue by importing the appropriate certificates into WSL.

Step-by-Step Guide to Import Certificates into WSL

Identify the Certificates
- Open in your browser.
- Click on the padlock symbol next to the URL to view the site certificates. This method may vary slightly depending on your browser.
- Export the certificates in the chain needed to trust the "app.composabl.com" certificate, which may be replaced by Zscaler or similar security tools. The intermediate and root certificates are often issued by your organization's IT department.
Export the Certificates
- Export the certificates in Base 64 encoded X.509 (PEM) format.
- Make sure the exported certificate files have a .crt extension.
Copy Certificates to WSL
- Copy the exported certificates to /usr/local/share/ca-certificates in WSL. You can use any file transfer method between Windows and WSL.
Update the Certificate Store
- Run the following command to update the certificate store in WSL: sudo update-ca-certificates\
Verify the Certificate
- You can verify if the certificates are correctly recognized by running: openssl s_client -connect api.composabl.com:443 -prexit

This command should connect and print the certificate chain and other relevant information.

Removing Certificates
- If you need to remove the certificates, delete them from /usr/local/share/ca-certificates and then run: `sudo update-ca-certificates --fresh`

This command will rebuild the certificate bundle from the remaining individual certificates.

Alternative Method: Export from Windows Certificate Store
The intermediate/root certificates may already be available in the Windows certificate store. You can identify and export the required certificates from there if needed.

By following these steps, you should be able to configure your WSL environment to trust certificates replaced by corporate security tools, enabling the installation of the SDK without issues.