AI Lab — Edge AI Model Hub on Rockchip

What This Lab Does

A one-stop sandbox to try popular open-source AI models on your reComputer RK3576 / RK3588. Pick a preset, deploy with one click, and start calling the API — everything runs on the device, no cloud needed.

Preset	Model	What you get	Hardware
Object Detection	YOLO 11 (Nano / Small / Medium)	Detect 80 common objects in images or live video	RK3576 or RK3588
LLM Chat	DeepSeek-R1 (1.5B / 7B, multiple quantizations)	OpenAI-compatible chat API, fully local	RK3576 (8GB recommended for 7B)
Vision Chat	Qwen2.5-VL	Describe images, answer questions about pictures	RK3576 (8GB+)

Output Interfaces

Preset	Endpoint	Method	Description
Object Detection	:8000/api/models/yolo11/predict	POST	Upload image, returns bounding boxes
Object Detection	:8000/api/video_feed	GET	MJPEG stream with detection overlay
LLM Chat	:8001/v1/chat/completions	POST	OpenAI-compatible chat (supports streaming)
Vision Chat	:8002/v1/chat/completions	POST	OpenAI-compatible vision chat (text + image)
Vision Chat	:8002/docs	GET	Interactive API docs (Swagger UI)

Quick start — Object Detection:

curl -X POST http://<device-ip>:8000/api/models/yolo11/predict \
  -F "file=@photo.jpg" -F "conf=0.5"

Quick start — Chat (LLM or VLM):

curl http://<device-ip>:8001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model":"rkllm-model","messages":[{"role":"user","content":"Hello!"}],"max_tokens":256}'

Core Value

One platform, three models — try detection, chat, and vision without setting up three different stacks
Local-only by default — your images and conversations never leave the device
Standard APIs — REST + OpenAI-compatible interfaces, drop-in for existing clients
NPU accelerated — Rockchip NPU runs inference efficiently on low-power hardware

Integration Scenarios

Scenario	Which preset	How it fits
Security camera intrusion detection	Object Detection	Call predict API from NVR for person/vehicle alerts
Air-gapped chatbot or code assistant	LLM Chat	OpenAI-compatible client points to local endpoint
Document screening / image triage	Vision Chat	Send page screenshots, ask questions in natural language
Robot perception + dialog	Object Detection + LLM	Pair detection output with LLM reasoning on the same device

Technical Specs

Spec	Value
Object Detection latency	~30ms/frame (RK3576) · ~20ms (RK3588)
LLM token speed	Depends on variant — 1.5B fastest, 7B strongest
Memory needed	4GB for 1.5B LLM · 8GB+ for 7B LLM and Vision Chat
Disk needed	3–10GB per preset, depending on model
Supported hardware	reComputer RK3576 (all presets) · reComputer RK3588 / ROCK 5T (Object Detection only)

Good to Know

Each preset deploys independently — you can have all three running on the same RK3576 (they use different ports: 8000 / 8001 / 8002)
First startup downloads the Docker image (1–4GB) and loads the model — allow a few minutes
Object Detection works with or without a camera (image upload always works)
LLM and Vision Chat are RK3576-only because they require RKLLM NPU support not yet available on RK3588
All conversations and detections stay on-device — nothing is sent to a cloud

AI Lab — Edge AI Model Hub on Rockchip

What This Lab Does

Output Interfaces

Core Value

Integration Scenarios

Technical Specs

Good to Know

Integration Interfaces

Usage Requirements

Deployment Options

Object Detection (YOLO 11)

LLM Chat (DeepSeek-R1)

Vision Chat (Qwen2.5-VL)