AxonOps AI Engine

AI built into your Apache Kafka and Apache Cassandra operations. Chat with your cluster, get root cause analysis on every alert, and read continuously generated reports.

How the AxonOps AI Engine works

The AI engine is a model trained on the Apache Kafka and Apache Cassandra source code and official documentation, deployed inside the AxonOps control plane. AxonOps agents on each cluster node feed metrics, logs, and configuration into the control plane, the AI reads from that data, and chat, alert analysis, and report outputs are returned to the same console your engineers already use. Nothing has to leave your environment for a third-party SaaS.

AxonOps AI architecture diagram
Live Cluster Data
  • Cassandra configurations and schema
  • Kafka configurations, topics, and ACLs
  • Kafka Connect configurations
  • Cassandra logs
  • Kafka logs
  • Cassandra metrics
  • Kafka metrics
  • Linux metrics
  • Linux configurations including system resources (CPU, memory, storage) and kernel tunings
Trained Knowledge
  • Cassandra source code
  • Cassandra documentation
  • Cassandra knowledge base
  • Kafka source code
  • Kafka documentation
  • Kafka knowledge base
AI Outputs
  • Root Cause Analysis on every metrics and logs alert
  • Cluster Health and Reporting kept continuously up to date
  • Chat Service for engineers to ask questions about the cluster
Chat

Ask questions about your cluster

A natural-language chat service connected directly to your AxonOps environment. Engineers ask questions in plain English and the AI answers using your live metrics, logs, and cluster configuration. The model is trained on Apache Kafka and Apache Cassandra source code and documentation, so it can reason about specific implementation details rather than offering generic infrastructure advice.

  • Natural-language questions over live metrics, logs, and configuration
  • Trained on Apache Kafka and Apache Cassandra source code and documentation
  • Answers cite the underlying telemetry and configuration they reason about
  • Available to operators, SREs, and developers from the same console
Ask questions about your cluster
Alert Analysis

Root cause analysis on every alert

Every metrics or logs alert is automatically analysed for likely root cause. The AI correlates the firing alert with related signals across the cluster, surfaces the probable cause, and recommends next steps so on-call engineers spend less time triaging and more time fixing.

  • Automatic root cause analysis on metrics and logs alerts
  • Correlation across nodes, topics, keyspaces, and consumers
  • Recommended remediation steps grounded in your cluster state
  • Shorter MTTR for on-call engineers
Root cause analysis on every alert
Reports

Cluster recommendations on demand

Generate a report on demand for your Kafka and Cassandra clusters and receive a structured set of recommendations across performance, reliability, configuration, capacity, and security. Findings are grouped with severity levels so engineers can prioritise what needs attention first.

  • Recommendations grouped across performance, reliability, configuration, capacity, and security
  • Severity levels to help teams prioritise remediation work
  • Operational findings grounded in live cluster state
  • Shareable with platform teams, SREs, DBAs, and management
Cluster recommendations on demand

See the AxonOps AI Engine on your clusters

AxonOps dashboard