[D1] Back To Systems

Operational Systems Portfolio

Deployment Preparation

Onyx / Edge Fleet Observability

See the fleet as it actually is. Operate it with confidence.

Onyx gives fleet operators, network teams and SREs a production-ready observability layer for distributed infrastructure, combining telemetry, logs, topology, alerting, device operations and resilient agent delivery into one operational platform.

Fleet observability for teams that need faster detection, better incident context and stronger operational control across distributed systems.

Product Maturity

Functionally complete and prepared for deployment readiness.

Onyx is beyond concept stage. The platform, agent architecture, telemetry pipeline, dashboard surfaces and supporting operator workflows are built; the current focus is deployment readiness, hardening and commercial rollout for early production environments.

01

Core product workflows for fleet telemetry, device visibility, topology, alerting, incidents, monitoring and audit are already in place.

02

The Rust agent, Next.js dashboard and Python platform services are built around a production deployment model rather than a prototype stack.

03

The product is now in deployment preparation, infrastructure hardening and early commercial onboarding rather than foundational product invention.

What Onyx Is

A fleet observability and operational context system for distributed infrastructure.

Onyx helps teams understand distributed infrastructure as a live fleet rather than a collection of disconnected signals. It brings telemetry, topology, incidents, device state and operational workflows together so teams can see the fleet clearly and act with more confidence.

01

Collect The Signals

Onyx captures telemetry, logs, device health and reachability across distributed environments without assuming perfect connectivity or ideal operating conditions.

02

Map The Relationships

Topology and infrastructure context help operators understand how devices, networks and operational dependencies relate instead of treating the fleet as isolated endpoints.

03

Detect The Drift

Alerting, health signals and investigative context help teams surface degraded devices, abnormal behaviour and fleet changes earlier.

04

Operate The Fleet

Incidents, monitoring, audit and device operations surfaces turn observability into an operating layer that supports response and ongoing fleet control.

Core Value

Onyx turns fragmented fleet observability into a usable operating picture.

Instead of leaving telemetry, topology, alerts and device context spread across separate systems, Onyx brings them together so operators can understand distributed infrastructure as a fleet rather than a collection of partial signals.

01

Fleet Visibility With Real Context

Onyx brings telemetry, logs, health signals and device relationships into one operational view so teams can reason about the fleet as a system rather than a list of endpoints.

02

Operational Awareness Beyond Dashboards

Topology, alerting, incidents, monitoring and device-level investigation work together, helping operators move from signal to action faster.

03

Resilient Edge Collection

The agent and collection model are built for real distributed environments, with buffering, failover and self-healing behaviour that support continuity when connectivity is imperfect.

Why Teams Look For Onyx

The same distributed observability problems surface across serious fleets.

Teams usually start looking for a stronger system when fleet visibility is fragmented, incident context is slow and edge conditions expose the limits of tooling built around centralised assumptions.

01

Fragmented Visibility

Teams often have telemetry, logs and alerts, but not a coherent fleet picture that shows what is unhealthy, where it sits and how it relates to the wider environment.

02

Slow Incident Context

Operators lose time stitching together dashboards, logs and network knowledge before they can understand what is really happening during a fleet issue.

03

Blind Spots At The Edge

Distributed and field infrastructure often sits outside the assumptions of centralised observability tooling, leaving gaps in visibility and confidence.

04

Collection That Breaks Under Real Conditions

Many environments need collection and transport layers that tolerate imperfect connectivity, endpoint churn and operational variance instead of failing under them.

Operational Impact

What improves when the fleet becomes easier to understand.

For teams already dealing with fragmented visibility, distributed failure modes and slow incident context, the payoff is not another dashboard. The payoff is faster understanding, earlier detection and stronger operational control.

01

Earlier Detection Of Real Issues

Operators get a clearer picture of degraded devices, blind spots and abnormal fleet behaviour before those issues become larger operational problems.

02

More Useful Context During Response

Telemetry, logs, topology and device state stay closer together, helping teams move from alert to understanding with less manual stitching across tools.

03

Stronger Control Across Distributed Environments

Onyx improves confidence in environments where infrastructure is geographically distributed, operationally diverse and difficult to reason about from a single vantage point.

Capabilities

Onyx improves fleet visibility, the speed of operational understanding and the resilience of data collection across distributed environments.

Unified visibility into fleet telemetry, device health, logs, alerts and infrastructure relationships across distributed environments.

Topology-aware fleet operations that help teams understand not just what is unhealthy, but how devices, networks and operational conditions relate.

A resilient agent layer with store-and-forward buffering, self-healing behaviour, endpoint failover and controlled command workflows for real operational use.

Operational surfaces for alerts, incidents, monitoring, audit and fleet investigation rather than monitoring dashboards alone.

Observability workflows that extend into discovery, audit and topology through the wider Onyx operator tool suite.

Where It Fits

The strongest fit is in environments where device state, topology, reachability and incident context all matter together, and where centralised monitoring alone is not enough.

Teams operating distributed devices, field infrastructure or edge systems where visibility is fragmented today.

Environments where device health, topology, reachability and alert context all matter together rather than in isolation.

Operators who need a stronger investigative and response surface across fleets that are operationally hard to observe from centralised tooling alone.

Typical Applications

Edge infrastructure and fleet operations.

Distributed observability for platform, SRE and network engineering teams.

Operational monitoring where device state, topology and fleet health all matter together.

Environments where device reachability, telemetry continuity and incident context need to hold up outside the datacentre.

Before Onyx

What teams usually rely on before they adopt a stronger fleet observability system.

Onyx is not entering a blank space. Most teams already have some mix of monitoring dashboards, logs, alerts, network tooling and incident workflows. The issue is that those pieces rarely come together into a coherent fleet operating picture on their own.

01

Separate Dashboards For Separate Signals

Many teams manage fleet state through disconnected dashboards for telemetry, logs, networking and incidents, which makes real operational understanding slower than it should be.

02

Centralised Monitoring Assumptions

Traditional monitoring stacks often assume stable connectivity and datacentre-like conditions that do not reflect how distributed infrastructure behaves in practice.

03

Topology And Device Context Outside The Workflow

Operators frequently need human memory, separate documentation or external tools to understand how the fleet is connected and where a problem actually sits.

Proof

Built to earn trust in distributed operating environments.

Onyx is built to support real fleet operations with resilient collection, topology-aware context and operational surfaces that help teams investigate, respond and stay oriented as the environment changes.

01

Production-Oriented Agent

The Rust agent is built for long-running fleet deployment with buffering, supervised recovery, controlled command execution and self-healing behaviour rather than lightweight demo collection.

02

Topology-Aware Operations

Onyx gives operators more than device metrics, combining telemetry with network relationships and investigative context across the fleet.

03

Operational Depth

The platform includes alerting, incidents, monitoring, audit and fleet operations surfaces so teams can work from one operational picture instead of fragmenting response across tools.

What That Means In Practice

Built to keep telemetry, topology, logs and incident context closer together during real operator workflows.

Designed for fleets where connectivity is imperfect and collection resilience matters as much as the dashboard view.

Structured to support investigation, response and ongoing fleet operations rather than passive monitoring alone.

Why Onyx

Onyx is built for teams that need fleet observability to be operationally useful, not just visually informative.

What makes Onyx distinct is not just telemetry collection in isolation. It is the combination of resilient edge collection, topology-aware context and fleet operations surfaces inside one production-oriented workflow.

01

Fleet-Native By Design

Onyx is built for distributed infrastructure and edge fleets where device state, topology and operational conditions have to be understood together, not approximated from centralised infrastructure assumptions.

02

Operationally Useful Context

Telemetry, logs, topology, alerts and investigation surfaces support real operator workflows, not just passive monitoring.

03

Resilient Collection Model

The agent and collection pipeline are designed for continuity under imperfect connectivity, using buffering, failover and self-healing rather than assuming ideal conditions.

Operational Intelligence

Onyx turns fleet signals into a more useful operational understanding layer.

Onyx does more than collect metrics. Its intelligence layer helps operators interpret what matters across telemetry, topology and fleet history so the platform becomes a stronger system for understanding live distributed environments.

01

Operational Analysis With Context

Intelligence workflows can reason over telemetry, topology and operational history to help teams interpret fleet state with more context than dashboards alone.

02

Fleet Understanding That Compounds

As operators build a richer picture of device state, topology and recurring issues, Onyx becomes a stronger system for interpreting what matters across the fleet.

03

Production-Oriented Assistance

The intelligence layer supports operational investigation and decision-making inside the platform rather than acting as a disconnected assistant beside it.

Deployment Foundation

Built for distributed infrastructure on AWS, with a deployment posture that supports real fleet operations.

Onyx is built on AWS so the platform can be deployed with the security, resilience and operational control serious fleet environments require. Ingest, fleet-facing services and backend platform components run inside an enterprise-ready foundation built for real distributed operations.

01

AWS-Native Delivery

Onyx uses an AWS deployment model that supports secure ingress, fleet-facing endpoints and production infrastructure control for distributed environments.

02

Resilient Data Path

The collection and platform model preserve operational continuity across telemetry ingest, buffering, transport and backend services when conditions are less than perfect.

03

Platform Architecture For Operations

Dashboard, APIs, agent services and backend telemetry infrastructure are structured to support serious fleet operations, not just visualisation.

FAQ

Common questions from teams evaluating Onyx.

These are the questions we expect from operators and infrastructure teams assessing whether Onyx is relevant to their fleet and operating model.

Q01

What is Onyx?

Onyx is a fleet observability platform for distributed infrastructure. It helps operators understand telemetry, topology, incidents, device state and operational health through one production-oriented fleet view.

Q02

Who is Onyx for?

Onyx is designed for operators, SRE teams, network engineers and infrastructure teams managing edge systems or distributed fleets where centralised monitoring alone is not enough.

Q03

How is Onyx different from standard monitoring dashboards?

Standard monitoring often stops at isolated charts and alerts. Onyx combines telemetry, topology, fleet context, incidents and operational workflows so teams can understand and operate the fleet more effectively.

Q04

What gives Onyx an enterprise-ready deployment posture?

Onyx is built on an AWS-native foundation with secure ingress, resilient data paths and production-grade platform services. That gives teams a deployment posture built for control, resilience and real fleet operations rather than a lightweight monitoring demo.

Design Partner Access

Bring Onyx into view before it reaches general availability.

Design partner access is for teams that want an earlier look at how Onyx supports fleet observability in their environment. It gives you a clearer understanding of the platform, a direct way to share operational context and a practical path into deployment planning if the fit is strong.

Relevant For 01

For operators who need clearer visibility across edge and distributed environments.

Relevant For 02

For teams that need observability to extend into topology, device operations and fleet response rather than stopping at dashboards.

What You Get

Early Access

01

See how Onyx supports telemetry, topology, investigation and fleet operations in distributed environments.

02

Share the fleet shape, operating constraints and observability gaps that matter most in your environment.

03

Move into deployment planning and a closer onboarding conversation if Onyx is a strong fit for your team.

Register interest to start a conversation around your fleet, your operating constraints and design partner availability.