Build Your Space AI Agent - 1
Build Your Space AI Agent - 2
Build Your Space AI Agent - 3
Build Your Space AI Agent - 4
Build Your Space AI Agent - 5

Build Your Space AI Agent

Make your AI assistant see faces, remember people, and serve you proactively

Intermediate40minVoice AI
Watcherface_recognitiondisplayVoicexiaozhi

What This Solution Does

Your AI assistant is smart, but it can't "see" — it doesn't know who's talking to it.

Imagine this: You come home from work, and your AI assistant recognizes your face and says "Welcome back, John! You have 3 important emails today." When guests arrive, it can tell the difference between family and strangers. In a showroom, it recognizes VIP customers and provides personalized service.

This solution gives your AI assistant "eyes" and a "big screen," turning it into a true smart space butler.

Core Value

BenefitDetails
Recognize & GreetSay "Remember my face, my name is John," and get greeted automatically next time
Personalized ServiceDifferent people get different responses and actions
Privacy in Your HandsAll face data stays on local device, never uploaded to cloud
Large DisplayCast conversations to TV/monitor for everyone to see
Narrate ModeAI switches background images based on conversation — great for tours & demos

Application Scenarios

ScenarioHow It Works
Smart HomeAuto-greet family on entry, remind kids about homework, check on elderly's health
Guest ReceptionIdentify VIP customers, auto-display personalized service info on big screen
ExhibitionAI interactive display attracts visitors, remembers preferences for next visit
OfficeGreet you by name in the morning, tell you today's schedule

Requirements

Inputs and Outputs

Voice + face video input; voice response + display content + custom actions output

Face Recognition Feature

ConditionDetails
CapacityRemembers up to 20 faces
LightingRequires normal indoor lighting, won't work well in darkness
AngleFrontal face works best, side profiles may be less accurate
DeviceOnly needs SenseCAP Watcher, your existing device works

Display Casting Feature

ConditionDetails
DisplayRequires a TV or monitor with HDMI support
ComputingRequires a computer or Raspberry Pi to run the display service
NetworkWatcher and display device must be on the same WiFi

Two Features Can Be Installed Separately

  • Just want face recognition? Install only that
  • Just want large display? Install only display casting
  • Want both? Install both, they work independently

Deployment Comparison

OptionCore DeviceFeatureBest For
Display CastSenseCAP Watcher + reComputer R1100Cast conversations to TV/monitorExhibition demos, multi-person interaction, home entertainment
Face RecognitionSenseCAP WatcherRecognize & greet, personalized serviceSmart home, VIP reception, office automation

Integration Interfaces

mqtt

Face recognition events

Port: 1883
{"event":"face_recognized","name":"John","confidence":0.92}

Usage Requirements

usb

SenseCAP Watcher connected via USB

Deployment Options

Architecture
voice_assistant
edge_device

Download & Install

Preset: Face Recognition {#face_recognition}

Add face recognition to your Xiaozhi, letting it recognize family and friends.

DevicePurpose
SenseCAP WatcherAI voice assistant with camera
USB-C data cableFor firmware flashing

What you'll get:

  • Automatic greeting when recognized face appears
  • Enroll and manage faces from the web panel (up to 20 people)
  • Voice control to enable, query, and delete faces

Requirements: WiFi network · Xiaozhi App for device binding

Step 1: Flash Xiaozhi Firmware {#face_esp32 type=esp32_usb required=true config=devices/watcher_esp32.yaml}

Write the voice assistant program to the Watcher to enable voice interaction.

Wiring

Connect Device

  1. Connect Watcher to computer via USB-C cable
  2. Select the serial port above (choose one starting with wchusbserial)
  3. Click the Flash button

Troubleshooting

ProblemSolution
Serial port not foundTry a different USB cable or USB port
No serial data receivedHold BOOT button, press RESET, release BOOT, then retry
Flash failedUnplug and reconnect the device

Step 2: Flash Face Recognition Firmware {#face_himax type=himax_usb required=true config=devices/watcher_himax.yaml}

Write the face recognition program to the Watcher's AI chip.

Wiring

Connect Device

  1. Ensure Watcher is connected to computer
  2. Select the serial port above (choose one starting with usbmodem)
  3. Click the Flash button
  4. After clicking Flash, press the reset button on the device to enter flash mode

Troubleshooting

ProblemSolution
Device not respondingUnplug and reconnect the USB cable
Flash stuck or failsPress the reset button and try again
Flash fails repeatedlyUse a different USB cable or port
Flash fails at 99% or restarts mid-flashClose other apps using serial ports, reconnect USB and retry

Step 3: Configure Xiaozhi {#face_configure type=manual required=false}

Connect the Watcher to WiFi and bind it to your account using the mobile app.

Connect to WiFi

Device will prompt for network setup on first boot. Follow voice instructions to connect to WiFi.

Bind Xiaozhi Account

  1. Open Xiaozhi App
  2. Scan the QR code displayed on device
  3. Complete the binding process

Test Voice

Wake up the device by saying "Xiaozhi Xiaozhi" to test voice interaction. If it responds normally, the setup is successful.

Troubleshooting

ProblemSolution
WiFi connection failedEnsure using 2.4GHz network, check password
QR code not showingRestart device, wait for boot complete

Step 4: Face Database Management {#face_enroll type=serial_camera config=devices/face_enroll.yaml required=false}

Manage the face recognition database through the app interface.

How to Use

  1. Click Connect to start the camera preview
  2. You'll see a live feed with face detection boxes
  3. Use the Face Database panel below to manage enrolled faces

Register a New Face

  1. Click Register in the Face Database panel
  2. Enter a name for the person
  3. Click Start Capture — face the camera with good lighting
  4. Wait for the capture to complete (5 seconds)
  5. The new face will appear in the table

Troubleshooting

ProblemSolution
"Please complete Step X first"Go back and select the correct serial port in the indicated step
Camera not showingCheck USB connection, try refreshing ports in Step 2
Enrollment failedEnsure good lighting, face the camera directly, try again

Deployment Complete

Face recognition is ready! Let's try it out.

Step 1: Enroll faces from the panel above

Use the Face Database panel to enroll family or colleagues (see "Register a New Face" above).

Step 2: Enable face recognition by voice

Unplug the USB cable, wake up the device by saying "Xiaozhi Xiaozhi", then say:

"Turn on face recognition"

Once confirmed, Watcher will automatically scan for faces while idle.

Step 3: Experience automatic recognition

Walk in front of the Watcher — it will recognize you and greet you by name!

Voice Commands

Say thisEffect
"Turn on face recognition"Enable recognition (required on first use)
"Turn off face recognition"Disable recognition
"Delete face XXX"Remove an enrolled face
"Who do you know"List all enrolled faces
"Turn on familiar mode"Only alert for strangers, ignore familiar faces
"Turn off familiar mode"Greet everyone

Preset: Display Cast {#display_cast}

Cast Xiaozhi conversations to TV or large display, ideal for exhibition halls, meeting rooms and multi-person scenarios.

DevicePurpose
SenseCAP WatcherAI voice assistant
reComputer R1100Edge computing device, runs display service
HDMI DisplayShows cast content

What you'll get:

  • Real-time conversation display on big screen
  • Fullscreen mode for presentations
  • mDNS auto-discovery - connect by voice command
  • Narrate mode - AI controls background images for storytelling & guided tours

Requirements: All devices on same network

Step 1: Flash Watcher Firmware {#display_watcher type=esp32_usb required=true config=devices/display_watcher.yaml}

Write the voice assistant program to the Watcher for display casting.

Wiring

Connect Device

  1. Connect Watcher to your computer using USB-C cable
  2. Select the serial port above
  3. If not detected, try a different USB port or cable

Troubleshooting

ProblemSolution
Serial port not foundTry a different USB cable or USB port
Flash failedUnplug and reconnect the device

Step 2: Deploy Display Service {#display_service type=docker_deploy required=true config=devices/display_local.yaml}

Start the display service that shows conversations on your screen.

Target: Local Deployment {#display_service_local type=local config=devices/display_local.yaml}

Deploy the display service on your local computer.

Wiring

Architecture

  1. Ensure Docker is installed and running
  2. Set a display name (e.g. "Living Room Display") for mDNS discovery
  3. Click Deploy button to start services

Troubleshooting

ProblemSolution
Docker not foundInstall Docker Desktop
Port 8765 busyStop other services using this port

Target: Remote Deployment {#display_service_remote type=remote config=devices/recomputer.yaml default=true}

Deploy the display service to reComputer R1100.

Wiring

Architecture

  1. Connect reComputer to network and HDMI display
  2. Enter IP address and SSH credentials
  3. Set a display name (e.g. "Meeting Room Display") for mDNS discovery
  4. Click Deploy to install on remote device

Troubleshooting

ProblemSolution
SSH connection failedCheck IP address and credentials
Docker pull failedCheck network connection, retry deployment
Watcher can't find displayEnsure both devices on same network, check firewall

Deployment Complete

Display cast is ready!

Test it:

  1. Open http://<device-ip>:8765 on display browser
  2. Press F for fullscreen mode
  3. Say "Cast to [Display Name]" to start

Voice commands: "Start casting", "Stop casting", "Cast status"

Narrate Mode (New): The display now supports a narrate mode where AI can control background images — ideal for presentations, storytelling, and guided tours.

  1. Click the gear icon (top-left) to open the config panel
  2. Toggle "Enable Narrate Mode"
  3. Enter your Xiaozhi WebSocket MCP URL to enable AI-driven image switching
  4. Add trigger rules: keyword + image URL pairs for automatic background changes
  5. Press N to toggle narrate mode, click the PiP window to resize it
Contact Us
We Are Glad to Be Your Hardware Partner !
Have you used our products before?