Technical architecture
Edge-first observability, not another control system.
RobotOps runs as a small edge gateway on the same network as your robots and WMS. It ingests the telemetry you already have access to—APIs, topics, logs, and reports—normalizes it into a vendor-neutral model, and pushes summarized signals into the cloud for fleet-wide analytics. It never sits in the robot control loop.
This page is for engineers, integrators, and reliability teams who want to understand exactly where RobotOps runs, how it moves data, and what it does and does not do.
High-level flow
At a high level, RobotOps follows a simple edge-first pattern:
- Collect telemetry from vendor APIs, message buses, logs, and reports on your own network.
- Normalize that telemetry into a unified schema for robots, missions, incidents, and sites.
- Summarize and enrich signals at the edge (health, anomalies, counters, aggregates).
- Send low-rate, vendor-neutral metrics and events to the cloud for dashboards, alerts, and analytics.
The edge gateway does feature extraction and buffering; the cloud focuses on cross-fleet visibility, historical analysis, reliability metrics, and alert routing.
Edge gateway responsibilities
The edge gateway runs on-premise, typically on an industrial PC or VM in the same VLAN as your robots and WMS.
- Connects to vendor APIs and webhooks for robot state, missions, alarms, and battery.
- Subscribes to message buses you already operate (MQTT, Kafka, etc.) for robot and mission events.
- Ingests logs, DB tables, and batch exports from OEM systems.
- Performs lightweight aggregation (counters, rates, rolling windows, anomaly flags).
- Buffers telemetry locally so short network blips do not cause data loss.
RobotOps does not packet-sniff proprietary protocols or bypass OEM safety layers. If an OEM exposes no telemetry, we show that gap instead of pretending it is observable.
Cloud responsibilities
The cloud turns normalized signals into fleet-wide observability:
- Multi-site, multi-vendor time series for robot, mission, and incident metrics.
- Health and reliability scores (MTBF, MTTR, fault rates, stranded events) per robot/vendor/site.
- Performance views for throughput, queue times, blocked time, and idle vs. productive time.
- Incident detection based on patterns in telemetry, not raw fault codes.
- APIs and export endpoints for data warehouses or BI tools.