AI Data Center Operations

All Systems Normal
⚡ 2.41 TB/s
🔔
12
🌐
Join the waitlist
A
Admin ▾
AI Cluster HealthLIVE
🛡0/100
Healthy
GPU UtilizationLIVE
0%
DCOS Java process
Power UsageLIVE
0MW
68% of 3.45 MW
PUE
💧0
Excellent
Active AlarmsLIVE
🔔0
Critical 2 Warning 10
Open TicketsLIVE
🎫0
In Progress
Data Center 3D View
normal
Temperature
Humidity
Power
GPU
GPU Count
Utilization
Network
Storage I/O
Overlay
18°C
45°C
GPU Clusters View all
AI-Training-01
85%
1,280/1,504
AI-Training-02
72%
1,152/1,600
AI-Inference-01
65%
1,024/1,568
AI-Inference-02
90%
1,386/1,536
Research-Cluster
45%
698/1,552
Power Trend
Cooling Overview
24Total
Running20 (83%)
Standby3 (13%)
Maintenance1 (4%)
Avg. Supply
18.2 °C
Avg. Return
27.8 °C
Total Assets
0
across all types
Online
0
healthy
Warning
0
action soon
Critical
0
attention now
Asset ID Type Model Location Capacity Utilization Status
Coming soon
This module is part of the Sensaka roadmap.
Join the waitlist
Active Alarms View all
▲ Critical
Rack A-04 Over Temperature
Temperature 32.6°C — threshold 30°C
2 min ago
▲ Critical
GPU Node 192.168.1.45 Unreachable
No response from BMC
5 min ago
⚠ Warning
Power Usage High — Rack A-07
32.1 kW exceeds threshold
8 min ago
⚠ Warning
Fan Speed Low — Rack A-02
Fan 3 at 40%
15 min ago
ℹ Info
Backup Job Completed
Daily backup finished
20 min ago
AI Insights View all
🤖 Potential Risk Detected
Rack A-04 shows a +0.4°C/hr rising trend. Recommend inspecting CRAC unit 3.
Network Overview View all
Spine Switches
8
Healthy
Leaf Switches
48
Healthy
Network Utilization
42%
1.2 / 2.8 Tbps