Ember FAQ Bot
Got questions?

Ask us anything

Drop a question and we'll get back to you ASAP.

Know what changed.
Know what failed.

Ember helps engineering teams trace failures, track releases, and investigate issues across complex hardware systems.

Faster root-cause analysis
Complete traceability
Offline-first telemetry capture
01 · Fleet Status

Monitor hardware in real-time.

Live health and operational status across devices and environments.

Find systems quickly.

Filter by status, location, ownership, or deployment group.

Coordinate shared hardware.

Track availability, reservations, and device usage across teams.

Open in sandbox
ember › fleet
Fleet Robots Versions Schedule
Fleet
7 robots · live telemetry
Register Cards
Status Available In Use Maintenance Offline
7
Total Robots
7
Available
0
In Use
Baymax
DEMO-BMX-001
Available
Lab 4
+ add
Computers
Eng Customer GUI computer-0 Operator computer-1
5
Computers
5
Online
4
Peripherals
BB-8
DEMO-BB8-001
Available
Lab 3
+ add
Computers
Eng Customer GUI computer-0 Operator computer-1
5
Computers
5
Online
3
Peripherals
Bender
DEMO-BND-001
Available
Sim Lab
+ add
Computers
Eng Customer GUI computer-0 Operator computer-1
5
Computers
5
Online
3
Peripherals
Dummy
DEMO-DUMMY-001
Available
Demo Lab
demo+ add
Computers
Eng Customer GUI computer-0 Operator computer-1
5
Computers
5
Online
4
Peripherals
Optimus
DEMO-OPT-001
Available
Test Bay
+ add
Computers
Eng Customer GUI computer-0 Operator computer-1
5
Computers
5
Online
2
Peripherals
R2-D2
DEMO-R2D2-001
Available
Lab 1
+ add
Computers
Eng Customer GUI computer-0 Operator computer-1 Eng 6
6
Computers
6
Online
3
Peripherals
02 · System Status

Inspect any device.

Devices, services, operational history, and deployment context in one place.

Investigate failures faster.

Trace recurring issues across logs, telemetry, and system events.

Live health and operational history.

System metrics, deployments, connectivity, and historical events captured automatically.

Open in sandbox
ember › fleet › bender
Bender
DEMO-BND-001
Available
Last used by Jordan Lee · Eng
5
Computers
5
Online
3
Peripherals
5 pkgs
Software
Overview Computers 5 Telemetry Peripherals 3 Calendar 11 SW History Sentinels
Available Error Online
Reservations
11
Notes
0
Peripherals
3
Computers
5/5
all online
Location
Sim Lab
Installed
May 2026
ACTIVE SENTINEL · OPEN
USB camera dropped — /dev/video0
Customer GUI Computer · 22m ago
3×
seen this week
Why this happened

Power-draw spike on USB hub B during nav startup briefly disconnected /dev/video0. Driver: uvcvideo · Kernel 5.15. Camera reconnected after 1.4s but two frames were lost.

Recommended fixes
  • Move camera to powered hub (port 3)
  • Update uvcvideo driver → 1.18+
  • Add disconnect guard in ember-agent watchdog
First seen 9 days ago · 7× across fleet View runbookApply fix
Asset Lifecycle Timeline 20
Reserved by Priya Shah
Customer Site - SF · Training Session
Fri, Jun 5, 10:00 AM
Reserved by Theo Nguyen
Customer Site - SF · Integration Test
Mon, Jun 1, 08:00 AM
Eng Console connected
Agent 1.4.11 on demo-engineering-bender — telemetry, health, alerts active
Wed, May 27, 06:42 AM
Customer GUI Computer connected
Agent 1.4.11 on demo-media-bender — telemetry, health, alerts active
Wed, May 27, 06:42 AM
USB camera dropped — /dev/video0
Customer GUI Computer · first occurrence · uvcvideo driver disconnect
Mon, May 18, 11:34 AM
Quick Actions
Component Health
Eng Console
Eng · demo-engineering-b…
HEALTHY
CPU44%
MEM68%
DISK60%
Customer GUI Computer
Customer GUI · demo-media-bender
HEALTHY
CPU48%
MEM60%
DISK55%
computer-1
computer-1 · demo-computer-1…
HEALTHY
CPU36%
MEM35%
DISK30%
03 · Release Tracking

Track every deployed version.

Software, firmware, and deployment history across systems and environments.

See what changed.

Compare deployments and quickly identify systems running different versions.

Connect releases to failures.

Trace operational issues back to the deployments and updates that introduced them.

Open in sandbox
ember › versions › nvidia-container-toolkit
Versions
Software, firmware, hardware, and peripherals across the fleet — with deployment history and blame.
Peripherals Hardware Firmware Software Snapshots Activity
nvidia-container-toolkit auto-updated 2 robots · 12h ago
baseline PINNED 1.14.3-1
2 VERSIONS 2 OFF-BASELINE · 2 ROBOTS
1.14.3-1 BASELINE · PINNED
5 devices · 5 robots · 71.4% of fleet
Bender Baymax BB-8 Dummy WALL-E
1.15.0-1 OFF-BASELINE · AUTO-UPDATED
2 devices · 2 robots · 28.6% of fleet · arrived from nvidia-archive via unattended-upgrades
Optimus R2-D2
CRASH TRACEBACK · 1.15.0-1
Auto-installed at 5:02 PM yesterday by unattended-upgrades
Source: nvidia-archive security pocket · Diff vs baseline: libnvidia-container.so.1 ABI break
Roll back to 1.14.3-1
This update has caused crashes in
cuda-vision-node
segfault on libcudart.so.12 load
4 crashes
ember-perception
degraded throughput, missed 187 frames
12 events
docker-runtime
container start failures on GPU init
2 events
Affected: Optimus, R2-D2 · First crash 11m after install
04 · Ops & Scheduling

Coordinate shared hardware.

Track availability, usage, and operational status across teams and systems.

Keep operational context attached.

Deployments, failures, testing, and ownership history tied to every system.

Route issues to the right team.

Operational issues linked directly to the affected systems and deployments.

Track operational workflows.

Coordinate testing, maintenance, deployments, and field operations in one place.

Open in sandbox
ember › schedule
Asset Schedule
Reserve and schedule asset check-outs — half-hour precision
Month Week Day Resource
New Reservation
May 25 — May 31, 2026 This Week
All Asset Types All Assets All Operators
25 Mon 26 Tue 27 Wed 28 Thu 29 Fri 30 Sat 31 Sun
Wednesday, May 27, 2026
7 robots · 7 reservations
HARDWARE
6a7a8a9a10a11a12p1p2p3p4p5p6p7p8p
Baymax
Robot · Available
Customer Demo
11:00 AM – 01:00 PM · Marcus Chen · Solutions Eng
BB-8
Robot · Alert
Maintenance · Pose drift >2°
02:00 PM – 04:00 PM · Diego Martinez · Vision Eng
Bender
Robot · Available
Hardware Validation
01:00 PM – 03:00 PM · Jordan Lee · Firmware Eng
Dummy
Robot · Available
Training Session
12:00 PM – 03:00 PM · Theo Nguyen · ML Eng
Optimus
Robot · Available
Integration Test
11:00 AM – 02:00 PM · Marcus Chen · QA Lead
R2-D2
Robot · In Use
Training Session
09:00 AM – 12:00 PM · Jordan Lee · Robotics Eng
WALL-E
Robot · Alert
Repair · Drive motor stall
05:00 PM – 07:00 PM · Ada Williams · Hardware Tech
Overnight (8pm – 6am) No overnight reservations
Legend Baymax BB-8 Bender Dummy Optimus R2-D2 WALL-E Current time
Request Robot Time
Reserve robot time · Ember Robotics
WALL-E Alert · Drive motor stall
Marcus Chen
Hardware Tech
Inspect drive-motor stall — Alert #482 fired at 09:14 AM, repeated 3× under load
05/27/2026
05:00 PM
07:00 PM
Ada (Hardware Tech) will be notified
Calendar invite sent with the linked alert and recent telemetry — WALL-E's schedule is updated automatically.

Investigating failures still takes too long

"The system behaves differently in production."

Not enough to fail immediately, but enough to create intermittent issues nobody can reliably reproduce.

"We know the rollout caused it."

We just can't tell which systems updated, which ones didn't, or when the behavior actually changed.

"The context is scattered everywhere."

Telemetry lives in one tool. Logs in another. Deployment history is buried in CI runs and Slack threads.

For fast-moving hardware teams

Monitor deployed systems

Live system health, processes, logs, and telemetry across devices and environments.

Understand what changed

Track software, firmware, and deployment history across systems.

Search operational failures

Search logs and telemetry across systems from one place.

Coordinate shared hardware

Track ownership, reservations, and operational status across teams.

Capture data offline

Store logs and telemetry locally during outages and replay automatically after reconnect.

Built for enterprise environments

SSO, audit logs, role-based access, and per-device authentication by default.

From R&D to production

Validation and testing

Track software versions, operational data, and release history during testing and sign-off.

Engineering workflows

Centralize logs, telemetry, and deployment history across development and field deployments.

Deployed systems

Investigate issues across systems running in production, offline, and low-connectivity environments.

Industries we serve
Robotics & Autonomy
Aerospace & Defense
Automotive & ADAS
Industrial & Energy
Semiconductor Equipment

Deployment to debugging in minutes

1

Deploy

Install the Ember agent on-device.

2

Capture

Collect operational data locally, even offline.

3

Investigate

Trace failures across systems, releases, and deployments.

10x your team's productivity.

See how teams debug and operate real-world systems with Ember.