Core Value

Private by design — all conversations stay on your device, your data never leaves your network

Multiple model sizes — choose from 1.5B (fast, lightweight) or 7B (stronger reasoning) with different compression options

Standard API — OpenAI-compatible interface works with existing tools and libraries

NPU accelerated — Rockchip NPU handles inference efficiently on low-power hardware

Scenario	Description
Edge chatbot	Build customer-facing chat without cloud dependency
Local code assistant	Get coding help on air-gapped networks
Document Q&A	Process sensitive documents without uploading to cloud
IoT command parsing	Parse natural language commands for device control

Scenario

Description

Edge chatbot

Build customer-facing chat without cloud dependency

Local code assistant

Get coding help on air-gapped networks

Document Q&A

Process sensitive documents without uploading to cloud

IoT command parsing

Parse natural language commands for device control

ダウンロードとインストール

Preset: RK3576 LLM Inference {#rk3576_llm}

Deploy DeepSeek-R1 large language model to your reComputer RK3576 with one click.

Device	Purpose
reComputer RK3576	Runs DeepSeek-R1 LLM with NPU acceleration

What you'll get:

OpenAI-compatible chat API running locally on your device
Choose from 5 model variants (1.5B/7B, different quantizations)
No cloud dependency — all inference runs on-device

Requirements: RK3576 device with SSH access + Docker installed

Step 1: Deploy DeepSeek-R1 {#deploy_llm type=docker_deploy required=true config=devices/rk3576.yaml}

Deploy the LLM container to your RK3576 device.

Target: Remote Deployment {#rk3576_remote type=remote config=devices/rk3576.yaml default=true}

Deploy to your RK3576 over SSH with one click.

Wiring

Connect RK3576 to the same network as your computer
Select the model variant you want to run
Fill in device IP, SSH username, and password
Click Deploy