Homepage ꄲ FPGA Development Board ꄲ JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

ꁆ ꁇ

JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

● Core Platform: Xilinx Kintex UltraScale+ Industrial FPGA + NVIDJetson Orin Heterogeneous AI SoM
● Channel Specs: 4 independent CXP-12 channels, 12.5Gbps per lane, total bandwidth up to 50Gbps
● 40GQSFP+ optical port, 26-pin multi-function I/O
● Edge AI Performance: Up to 157 TOPS INT8 offline local inference, no external
● Applications: Industrial AOI inspection, scientific high-speed imaging, automation control, semiconductor test, mobile robot, medical & UAV onboard vision

JetKU-CXP1004 Embedded Jetson CoaXPress AI Computing Platform

Xilinx UltraScale+ FPGA + NVIDIA Jetson Heterogeneous Architecture | 4-Channel CXP-12 Edge AI Computing All-in-One | Integrated PoCXP Power Supply & Local Inference

Document Download

JetKU-CXP1004 4-Channel CoaXPress Frame Grabber User Manual.pdf

Four Core Advantages

FPGA+Jetson Dual-Core Heterogeneous Computing Architecture

Equipped with Xilinx Kintex UltraScale+ XCKU5P FPGA + NVIDIA Jetson Orin Nano/NX SoM, interconnected via PCIe Gen3 x4 high-speed interface. The underlying FPGA handles image acquisition and preprocessing, while Jetson GPU performs local AI inference, enabling "acquisition and analysis in one" without the need for an industrial computer.

4-Channel CXP-12 Full Bandwidth Acquisition + PoCXP Power Supply

Supports 4 independent CoaXPress v2.1 camera inputs with a maximum speed of 12.5Gbps per channel (total aggregate bandwidth 50Gbps). Built-in PoCXP circuit delivers 17W 24V power per channel, transmitting data and power simultaneously over a single coaxial cable to simplify device wiring.

Complete Embedded Edge AI Software Stack

Pre-installed with Ubuntu 22.04 + NVIDIA JetPack SDK, including CUDA, cuDNN, TensorRT, OpenCV, and ROS support. Natively supports PyTorch/ONNX inference, paired with Aravis/GenTL standard SDKs, and offers dual control modes: local ImgGrab graphical debugging and remote upper computer control.

High Energy Efficiency Edge Computing Capability

Powered by NVIDIA Ampere architecture GPU, covering 34~275 TOPS INT8 inference computing power with a wide power consumption range of 7W~75W. Available in 4GB/8GB/16GB memory versions with modular SoM + carrier board design, easy to integrate into different products for various edge lightweight scenarios.

Product Overview

JetKU-CXP1004 is an embedded 4-channel CoaXPress (CXP) AI computing platform designed for edge imaging AI applications in industrial and scientific research fields. Compliant with CoaXPress V2.1 standard, a single Micro-BNC coaxial cable can synchronously transmit high-speed images and PoCXP camera power, significantly reducing the wiring complexity of the entire system.

Onboard Xilinx Kintex UltraScale+ XCKU5P FPGA serves as the underlying image processing unit, paired with NVIDIA Jetson Orin series GPU modules. The FPGA side features 2GB DDR4 SDRAM cache for real-time ISP, ROI cropping, and lossless compression. The Jetson side supports running deep learning models (e.g., object detection, semantic segmentation, speech recognition) locally without network dependency. Jetson Orin Nano is available in two versions: 4GB (34TOPS INT8, 7-25W) and 8GB (67TOPS INT8, 7-25W); Jetson Orin NX is available in two versions: 8GB (117TOPS INT8, 10-40W) and 16GB (157TOPS INT8, 10-40W). All models adopt NVIDIA Ampere architecture GPU, enabling all image algorithms, defect detection, and visual SLAM to run offline locally without an external industrial computer, highly suitable for lightweight edge scenarios such as mobile robots, medical devices, semiconductor AOI, and UAV airborne vision.

Core Features

Core Architecture: Xilinx Kintex UltraScale+ XCKU5P FPGA + NVIDIA Jetson Orin Nano/NX AI SoM dual-processor heterogeneous computing architecture
Acquisition Channels: 4 independent CoaXPress v2.1 inputs (backward compatible with CXP v2.0/v1.1), maximum 12.5Gbps per channel (CXP-12), total bandwidth 50Gbps
PoCXP Power Supply: 17W 24V regulated power supply per CXP interface with overcurrent/short-circuit automatic protection, no external camera power supply required
Internal Interconnection: PCIe Gen3 x4 connecting FPGA and Jetson, theoretical bandwidth 32Gbps for low-latency large data transmission
Onboard Cache: 2GB DDR4 SDRAM on FPGA side; 4GB/8GB/16GB LPDDR4x on Jetson side (varies by model)
AI Computing Power (Ampere Architecture GPU): Orin Nano 4GB (34TOPS,7-25W) / Orin Nano 8GB (67TOPS,7-25W) / Orin NX 8GB (117TOPS,10-40W) / Orin NX 16GB (157TOPS,10-40W)
Edge AI Capability: Supports local execution of deep learning models without network dependency
Local Storage: Standard M.2 NVMe 256GB SSD, supporting large-capacity image recording and local AI model deployment
Industrial I/O: 26pin isolated industrial interface including RS485, differential LVDS, optocoupler isolated input/output, TTL encoder interface
Expansion Interfaces: 40Gbps QSFP+ optical port, Gigabit Ethernet, USB3.2, Type-C, DP display output, Mini PCIe wireless slot
System Environment: Pre-installed Ubuntu 22.04 LTS with complete NVIDIA JetPack AI software stack, supporting TensorRT accelerated inference
Mechanical Specifications: Embedded computing platform with dimensions 186.00mm × 173.30mm, powered by 12V DC single power supply

Complete Technical Specifications

Category	Item	Detailed Parameters
Core Computing Platform	FPGA Unit	Xilinx Kintex UltraScale+ XCKU5P, supporting user-defined logic development; 2GB DDR4 SDRAM large-capacity image cache
Core Computing Platform	AI Processing Unit	• Jetson Orin Nano 4GB: 34TOPS INT8, 7-25W, 512-core Ampere GPU, max frequency 1020MHz (Entry-level robots/education/small AI devices) • Jetson Orin Nano 8GB: 67TOPS INT8, 7-25W, 1024-core Ampere GPU, max frequency 1020MHz (Entry-level robots/education/small AI devices) • Jetson Orin NX 8GB: 117TOPS INT8, 10-40W, 1024-core Ampere GPU, max frequency 1173MHz (Mid-range robots/Autonomous Mobile Robots (AMR)) • Jetson Orin NX 16GB: 157TOPS INT8, 10-40W, 1024-core Ampere GPU, max frequency 1173MHz (Mid-range robots/Autonomous Mobile Robots (AMR))
CoaXPress Camera Input	Interface Standard	CoaXPress v2.1 (backward compatible with v2.0 / v1.0 / v1.1); 4 Micro-BNC channels CH0~CH3
	Downlink Speed	Full range of CXP-1(1.25G)~CXP-12(12.5G), maximum 12.5Gbps per channel, total aggregate 50Gbps
	PoCXP Power Supply	Maximum 17W 24V DC per channel with overload/short-circuit protection, onboard 12V to 24V DC-DC converter
High-Speed Expansion Interfaces	QSFP+ Optical Port	40Gbps high-speed optical fiber expansion, supporting multi-platform cascading and mass data offline export
	Gigabit Ethernet	Remote SSH debugging, network video stream transmission
	USB3.2 Gen1 ×4	Peripheral, dongle, high-speed storage device connection
	DP Display Output	Local HD interface preview and debugging display
	M.2 NVMe SSD	256GB standard configuration for local image storage and AI model deployment
	Mini PCIe Slot	Reserved for expanding 4G/5G/LTE/Wi-Fi wireless modules (not included in standard configuration)
General Industrial I/O (26pin DB26)	Differential Bus	4 bidirectional LINE0-3 (RS485/422), 2 LVDS differential input/output
	Isolated Circuits	2 optocoupler isolated inputs, 2 optocoupler isolated outputs, withstand voltage 30V
	TTL GPIO	2 bidirectional 3.3V LVTTL, supporting encoder A/B/Z signal acquisition
	Trigger Function	2 32-bit hardware timers, event trigger, position trigger, noise filtering
Software & Development Ecosystem	Operating System	Pre-installed on platform: Ubuntu 22.04 LTS; Upper computer: Windows7/10/11 64bit
	AI Development Stack	NVIDIA JetPack SDK (including CUDA Toolkit, cuDNN, TensorRT, OpenCV, GStreamer), supporting PyTorch/ONNX Runtime and ROS
	Development SDK	Aravis SDK, GenTL Producer, ImgGrab graphical client, complete Qt demonstration project
Power & Mechanical	Power Input	12V DC input, 5A adapter recommended for powering the whole machine + PoCXP cameras
Power & Mechanical	Dimensions & Heat Dissipation	186mm × 173.3mm with active fan cooling; industrial-grade wide temperature operation

NVIDIA Jetson Product Series Complete Parameter Reference (as of 2026)

Module Model	AI Performance (TOPS)	Power Consumption Range	GPU Specification	GPU Max Frequency	Application Scenarios
Jetson AGX Orin Developer Kit	275	15W–60W	2048-core Ampere GPU, 64 Tensor Cores	1.3 GHz	High-performance robots, autonomous driving, medical imaging
Jetson AGX Orin Industrial Edition	248	15W–75W	2048-core Ampere GPU, 64 Tensor Cores	1.2 GHz
Jetson AGX Orin 32GB	241	15W–60W	1792-core Ampere GPU, 56 Tensor Cores	1.3 GHz
Jetson Orin NX 16GB	157	10W–40W	1024-core Ampere GPU, 32 Tensor Cores	1173 MHz	Mid-range robots, Autonomous Mobile Robots (AMR)
Jetson Orin NX 8GB	117	10W–40W	1024-core Ampere GPU, 32 Tensor Cores	1173 MHz	Mid-range robots, Autonomous Mobile Robots (AMR)
Jetson Orin Nano Super Developer Kit	67	7W–25W	1024-core Ampere GPU, 32 Tensor Cores	1020 MHz	Entry-level robots, education, small AI devices
Jetson Orin Nano 8GB	67	7W–25W	1024-core Ampere GPU, 32 Tensor Cores	1020 MHz
Jetson Orin Nano 4GB	34	7W–25W	512-core Ampere GPU, 16 Tensor Cores	1020 MHz

Hardware Interface & Structure Drawings

Figure: Front Hardware Interface Labeled Diagram

Figure: Back Expansion Slot Interface Diagram

Figure: FPGA & Jetson Hardware Interconnection Architecture Diagram

Figure: Overall PCB Dimension Diagram

Software Architecture & Quick Start

Three Working Modes

Edge Local Computing Mode (Recommended): Jetson independently completes image acquisition, AI inference, and motion control without external PC; deploy algorithm scripts via SSH terminal and debug graphically with local ImgGrab.
Upper Computer Remote Control Mode: LAN PC connects to the computing platform remotely via GenTL/Aravis protocol for real-time image preview and camera parameter configuration, suitable for laboratory debugging.
Local GUI Mode: Connect to DP display directly and run ImgGrab on the platform's Linux desktop to debug the entire visual system independently without a computer.

Complete Software List

Category	Software Name	Usage Description
Underlying Driver	HelloFPGA Runtime Driver Package	Underlying hardware driver for computing platform, required for Windows upper computer
Upper Computer GUI	ImgGrab Client	Camera discovery, parameter configuration, image preview, video playback
Development SDK	GenTL SDK	Standard GenTL Producer cross-platform dynamic library
Development SDK	Aravis SDK	Complete Aravis 0.10 development library adapted to computing platform with Qt demonstration project
Assistant Tool	Device Manager	Device information reading, online firmware upgrade

All software can be downloaded from the official website: https://img-grab.com/jszc

Hardware Wiring: Connect 12V 5A power adapter, CXP camera coaxial cable, and DP display/Gigabit network cable for debugging if needed;
Power On: The computing platform is pre-installed with Ubuntu system, which automatically initializes Jetson and FPGA on first boot;
Debugging Mode Selection: Open ImgGrab on local display directly, or install driver + ImgGrab on same-network PC for remote connection;
AI Algorithm Deployment: Upload models via SSH, complete quantization acceleration with TensorRT, and run visual detection process locally in real time.

Safety Precautions (Must Read)

Do NOT hot-plug CXP coaxial cables, FMC subcards, or power interfaces; operate hardware only after power off and electrostatic discharge;
Maximum PoCXP output per channel is 17W; total power consumption of four cameras shall not exceed the rated power of the adapter, otherwise overcurrent protection will trigger power-off;
Wear anti-static wristband before operation; the high-density FPGA and Jetson modules are extremely sensitive to ESD, which may cause permanent hardware damage;
26pin I/O interface level is 3.3V LVTTL; DO NOT connect 5V/12V external signals directly, otherwise FPGA IO circuits will be burned;
Surface temperature of the whole machine can reach over 60℃ under full-load AI inference + 4-channel CXP-12 acquisition; touch and maintain only after power off and cooling;
Store in anti-static bag in dry and ventilated environment when idle for a long time; power on regularly to remove moisture and avoid chip damage from condensation.

Typical Edge AI Application Scenarios

AMR/AGV Mobile Robot Visual SLAM Navigation

Semiconductor Wafer Defect Online AOI Inspection

UAV Airborne Electro-Optical Pod Visual Recognition

Portable Ultrasound/Intraoperative CT Medical Edge Imaging Computing

High-Speed Object Appearance Real-Time Sorting on Flexible Production Lines

High-Speed Vision Offline Acquisition Platform for Universities/Research Labs

Purchase Method: