Home    FPGA Development Board    JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

● Core Platform: Xilinx Kintex UltraScale+ Industrial FPGA + NVIDJetson Orin Heterogeneous AI SoM
● Channel Specs: 4 independent CXP-12 channels, 12.5Gbps per lane, total bandwidth up to 50Gbps
● 40GQSFP+ optical port, 26-pin multi-function I/O
● Edge AI Performance: Up to 157 TOPS INT8 offline local inference, no external 
● Applications:  Industrial AOI inspection, scientific high-speed imaging, automation control, semiconductor test, mobile robot, medical & UAV onboard vision

JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

Xilinx UltraScale+ FPGA + NVIDIA Jetson Heterogeneous Architecture | 4-Channel CXP-12 Edge AI Computing All-in-One | Integrated PoCXP Power Supply & Local Inference

Four Core Advantages

FPGA+Jetson Dual-Core Heterogeneous Computing Architecture

Equipped with Xilinx Kintex UltraScale+ XCKU5P FPGA + NVIDIA Jetson Orin Nano/NX SoM, interconnected via PCIe Gen3 x4 high-speed interface. The underlying FPGA handles image acquisition and preprocessing, while Jetson GPU performs local AI inference, enabling "acquisition and analysis in one" without the need for an industrial computer.

4-Channel CXP-12 Full Bandwidth Acquisition + PoCXP Power Supply

Supports 4 independent CoaXPress v2.1 camera inputs with a maximum speed of 12.5Gbps per channel (total aggregate bandwidth 50Gbps). Built-in PoCXP circuit delivers 17W 24V power per channel, transmitting data and power simultaneously over a single coaxial cable to simplify device wiring.

Complete Embedded Edge AI Software Stack

Pre-installed with Ubuntu 22.04 + NVIDIA JetPack SDK, including CUDA, cuDNN, TensorRT, OpenCV, and ROS support. Natively supports PyTorch/ONNX inference, paired with Aravis/GenTL standard SDKs, and offers dual control modes: local ImgGrab graphical debugging and remote upper computer control.

High Energy Efficiency Edge Computing Capability

Powered by NVIDIA Ampere architecture GPU, covering 34~275 TOPS INT8 inference computing power with a wide power consumption range of 7W~75W. Available in 4GB/8GB/16GB memory versions with modular SoM + carrier board design, easy to integrate into different products for various edge lightweight scenarios.

Product Overview

JetKU-CXP1004 is an embedded 4-channel CoaXPress (CXP) AI computing platform designed for edge imaging AI applications in industrial and scientific research fields. Compliant with CoaXPress V2.1 standard, a single Micro-BNC coaxial cable can synchronously transmit high-speed images and PoCXP camera power, significantly reducing the wiring complexity of the entire system.

Onboard Xilinx Kintex UltraScale+ XCKU5P FPGA serves as the underlying image processing unit, paired with NVIDIA Jetson Orin series GPU modules. The FPGA side features 2GB DDR4 SDRAM  cache for real-time ISP, ROI cropping, and lossless compression. The Jetson side supports running deep learning models (e.g., object detection, semantic segmentation, speech recognition) locally without network dependency. Jetson Orin Nano is available in two versions: 4GB (34TOPS INT8, 7-25W) and 8GB (67TOPS INT8, 7-25W); Jetson Orin NX is available in two versions: 8GB (117TOPS INT8, 10-40W) and 16GB (157TOPS INT8, 10-40W). All models adopt NVIDIA Ampere architecture GPU, enabling all image algorithms, defect detection, and visual SLAM to run offline locally without an external industrial computer, highly suitable for lightweight edge scenarios such as mobile robots, medical devices, semiconductor AOI, and UAV airborne vision.

Core Features

  • Core Architecture: Xilinx Kintex UltraScale+ XCKU5P FPGA + NVIDIA Jetson Orin Nano/NX AI SoM dual-processor heterogeneous computing architecture
  • Acquisition Channels: 4 independent CoaXPress v2.1 inputs (backward compatible with CXP v2.0/v1.1), maximum 12.5Gbps per channel (CXP-12), total bandwidth 50Gbps
  • PoCXP Power Supply: 17W 24V regulated power supply per CXP interface with overcurrent/short-circuit automatic protection, no external camera power supply required
  • Internal Interconnection: PCIe Gen3 x4 connecting FPGA and Jetson, theoretical bandwidth 32Gbps for low-latency large data transmission
  • Onboard Cache: 2GB DDR4 SDRAM  on FPGA side; 4GB/8GB/16GB LPDDR4x on Jetson side (varies by model)
  • AI Computing Power (Ampere Architecture GPU): Orin Nano 4GB (34TOPS,7-25W) / Orin Nano 8GB (67TOPS,7-25W) / Orin NX 8GB (117TOPS,10-40W) / Orin NX 16GB (157TOPS,10-40W)
  • Edge AI Capability: Supports local execution of deep learning models without network dependency
  • Local Storage: Standard M.2 NVMe 256GB SSD, supporting large-capacity image recording and local AI model deployment
  • Industrial I/O: 26pin isolated industrial interface including RS485, differential LVDS, optocoupler isolated input/output, TTL encoder interface
  • Expansion Interfaces: 40Gbps QSFP+ optical port, Gigabit Ethernet, USB3.2, Type-C, DP display output, Mini PCIe wireless slot
  • System Environment: Pre-installed Ubuntu 22.04 LTS with complete NVIDIA JetPack AI software stack, supporting TensorRT accelerated inference
  • Mechanical Specifications: Embedded computing platform with dimensions 186.00mm × 173.30mm, powered by 12V DC single power supply

Complete Technical Specifications

Category Item Detailed Parameters
Core Computing Platform FPGA Unit Xilinx Kintex UltraScale+ XCKU5P, supporting user-defined logic development; 2GB DDR4 SDRAM  large-capacity image cache
AI Processing Unit • Jetson Orin Nano 4GB: 34TOPS INT8, 7-25W, 512-core Ampere GPU, max frequency 1020MHz (Entry-level robots/education/small AI devices)
• Jetson Orin Nano 8GB: 67TOPS INT8, 7-25W, 1024-core Ampere GPU, max frequency 1020MHz (Entry-level robots/education/small AI devices)
• Jetson Orin NX 8GB: 117TOPS INT8, 10-40W, 1024-core Ampere GPU, max frequency 1173MHz (Mid-range robots/Autonomous Mobile Robots (AMR))
• Jetson Orin NX 16GB: 157TOPS INT8, 10-40W, 1024-core Ampere GPU, max frequency 1173MHz (Mid-range robots/Autonomous Mobile Robots (AMR))
CoaXPress Camera Input Interface Standard CoaXPress v2.1 (backward compatible with v2.0 / v1.0 / v1.1); 4 Micro-BNC channels CH0~CH3
Downlink Speed Full range of CXP-1(1.25G)~CXP-12(12.5G), maximum 12.5Gbps per channel, total aggregate 50Gbps
PoCXP Power Supply Maximum 17W 24V DC per channel with overload/short-circuit protection, onboard 12V to 24V DC-DC converter
High-Speed Expansion Interfaces QSFP+ Optical Port 40Gbps high-speed optical fiber expansion, supporting multi-platform cascading and mass data offline export
Gigabit Ethernet Remote SSH debugging, network video stream transmission
USB3.2 Gen1 ×4 Peripheral, dongle, high-speed storage device connection
DP Display Output Local HD interface preview and debugging display
M.2 NVMe SSD 256GB standard configuration for local image storage and AI model deployment
Mini PCIe Slot Reserved for expanding 4G/5G/LTE/Wi-Fi wireless modules (not included in standard configuration)
General Industrial I/O (26pin DB26) Differential Bus 4 bidirectional LINE0-3 (RS485/422), 2 LVDS differential input/output
Isolated Circuits 2 optocoupler isolated inputs, 2 optocoupler isolated outputs, withstand voltage 30V
TTL GPIO 2 bidirectional 3.3V LVTTL, supporting encoder A/B/Z signal acquisition
Trigger Function 2 32-bit hardware timers, event trigger, position trigger, noise filtering
Software & Development Ecosystem Operating System Pre-installed on platform: Ubuntu 22.04 LTS; Upper computer: Windows7/10/11 64bit
AI Development Stack NVIDIA JetPack SDK (including CUDA Toolkit, cuDNN, TensorRT, OpenCV, GStreamer), supporting PyTorch/ONNX Runtime and ROS
Development SDK Aravis SDK, GenTL Producer, ImgGrab graphical client, complete Qt demonstration project
Power & Mechanical Power Input 12V DC input, 5A adapter recommended for powering the whole machine + PoCXP cameras
Dimensions & Heat Dissipation 186mm × 173.3mm with active fan cooling; industrial-grade wide temperature operation

NVIDIA Jetson Product Series Complete Parameter Reference (as of 2026)

Module Model AI Performance (TOPS) Power Consumption Range GPU Specification GPU Max Frequency Application Scenarios
Jetson AGX Orin Developer Kit 275 15W–60W 2048-core Ampere GPU, 64 Tensor Cores 1.3 GHz High-performance robots, autonomous driving, medical imaging
Jetson AGX Orin Industrial Edition 248 15W–75W 2048-core Ampere GPU, 64 Tensor Cores 1.2 GHz
Jetson AGX Orin 32GB 241 15W–60W 1792-core Ampere GPU, 56 Tensor Cores 1.3 GHz
Jetson Orin NX 16GB 157 10W–40W 1024-core Ampere GPU, 32 Tensor Cores 1173 MHz Mid-range robots, Autonomous Mobile Robots (AMR)
Jetson Orin NX 8GB 117 10W–40W 1024-core Ampere GPU, 32 Tensor Cores 1173 MHz
Jetson Orin Nano Super Developer Kit 67 7W–25W 1024-core Ampere GPU, 32 Tensor Cores 1020 MHz Entry-level robots, education, small AI devices
Jetson Orin Nano 8GB 67 7W–25W 1024-core Ampere GPU, 32 Tensor Cores 1020 MHz
Jetson Orin Nano 4GB 34 7W–25W 512-core Ampere GPU, 16 Tensor Cores 1020 MHz

Hardware Interface & Structure Drawings

Figure: Front Hardware Interface Labeled Diagram

Figure: Back Expansion Slot Interface Diagram

Figure: FPGA & Jetson Hardware Interconnection Architecture Diagram

Figure: Overall PCB Dimension Diagram

Software Architecture & Quick Start

Three Working Modes

  • Edge Local Computing Mode (Recommended): Jetson independently completes image acquisition, AI inference, and motion control without external PC; deploy algorithm scripts via SSH terminal and debug graphically with local ImgGrab.
  • Upper Computer Remote Control Mode: LAN PC connects to the computing platform remotely via GenTL/Aravis protocol for real-time image preview and camera parameter configuration, suitable for laboratory debugging.
  • Local GUI Mode: Connect to DP display directly and run ImgGrab on the platform's Linux desktop to debug the entire visual system independently without a computer.

Complete Software List

Category Software Name Usage Description
Underlying Driver HelloFPGA Runtime Driver Package Underlying hardware driver for computing platform, required for Windows upper computer
Upper Computer GUI ImgGrab Client Camera discovery, parameter configuration, image preview, video playback
Development SDK GenTL SDK Standard GenTL Producer cross-platform dynamic library
Aravis SDK Complete Aravis 0.10 development library adapted to computing platform with Qt demonstration project
Assistant Tool Device Manager Device information reading, online firmware upgrade

All software can be downloaded from the official website: https://img-grab.com/jszc

  1. Hardware Wiring: Connect 12V 5A power adapter, CXP camera coaxial cable, and DP display/Gigabit network cable for debugging if needed;
  2. Power On: The computing platform is pre-installed with Ubuntu system, which automatically initializes Jetson and FPGA on first boot;
  3. Debugging Mode Selection: Open ImgGrab on local display directly, or install driver + ImgGrab on same-network PC for remote connection;
  4. AI Algorithm Deployment: Upload models via SSH, complete quantization acceleration with TensorRT, and run visual detection process locally in real time.

Safety Precautions (Must Read)

  • Do NOT hot-plug CXP coaxial cables, FMC subcards, or power interfaces; operate hardware only after power off and electrostatic discharge;
  • Maximum PoCXP output per channel is 17W; total power consumption of four cameras shall not exceed the rated power of the adapter, otherwise overcurrent protection will trigger power-off;
  • Wear anti-static wristband before operation; the high-density FPGA and Jetson modules are extremely sensitive to ESD, which may cause permanent hardware damage;
  • 26pin I/O interface level is 3.3V LVTTL; DO NOT connect 5V/12V external signals directly, otherwise FPGA IO circuits will be burned;
  • Surface temperature of the whole machine can reach over 60℃ under full-load AI inference + 4-channel CXP-12 acquisition; touch and maintain only after power off and cooling;
  • Store in anti-static bag in dry and ventilated environment when idle for a long time; power on regularly to remove moisture and avoid chip damage from condensation.

Typical Edge AI Application Scenarios

AMR/AGV Mobile Robot Visual SLAM Navigation

Semiconductor Wafer Defect Online AOI Inspection

UAV Airborne Electro-Optical Pod Visual Recognition

Portable Ultrasound/Intraoperative CT Medical Edge Imaging Computing

High-Speed Object Appearance Real-Time Sorting on Flexible Production Lines

High-Speed Vision Offline Acquisition Platform for Universities/Research Labs

Purchase Method: