NexaAI
/

rf-detr-seg-preview-npu

Object Detection

instance-segmentation

edge-deployment

Model card Files Files and versions

rf-detr-seg-preview-npu / README.md

nexaml's picture

Create README.md

362605b verified 18 days ago

|

history blame contribute delete

3.53 kB

	---
	tags:
	- object-detection
	- instance-segmentation
	- transformer
	- detr
	- npu
	- qualcomm
	- roboflow
	- real-time
	- edge-deployment
	license: cc-by-nc-4.0
	library_name: nexa-sdk
	---

	# RF-DETR-Seg-Preview-NPU

	Run RF-DETR-Seg-Preview on Qualcomm NPU with nexaSDK.

	## Quickstart

	Install nexaSDK and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)

	Activate your device with your access token:

	```bash
	nexa config set license '<access_token>'
	```

	Run the model locally in one line:

	```bash
	nexa infer NexaAI/rf-detr-seg-preview-npu
	```

	## Model Description

	RF-DETR-Seg-Preview is a real-time object detection and instance segmentation model developed by Roboflow, based on the Transformer architecture. It is the first real-time model to achieve over 60 AP on the COCO dataset, combining high accuracy with efficient inference performance.

	The model leverages a DINOv2 visual backbone and a lightweight DETR design, providing excellent transfer learning capabilities and domain adaptability. Its end-to-end architecture eliminates the need for non-maximum suppression (NMS) and anchor boxes, simplifying the detection pipeline.

	RF-DETR-Seg-Preview brings state-of-the-art detection and segmentation accuracy to real-time applications, making it ideal for edge deployment scenarios where both speed and precision are critical.

	## Features

	- High-Accuracy Real-Time Detection & Segmentation: Achieves 60.5 mAP on COCO dataset while maintaining real-time performance, suitable for applications requiring both speed and accuracy.

	- Domain Adaptability: Through the DINOv2 backbone network, enables cross-domain transfer learning, suitable for complex scenarios such as aerial imagery and industrial applications.

	- Dynamic Resolution Support: Supports multi-resolution training and inference, allowing precision-speed trade-offs without retraining.

	- Efficient Edge Deployment: Optimized for Qualcomm NPU, providing fast inference and low latency on edge devices with limited resources.

	- End-to-End Architecture: Eliminates the need for NMS and anchor boxes, simplifying the detection and segmentation pipeline.

	- Instance Segmentation: Provides pixel-level segmentation masks for each detected object, enabling precise object boundary identification.

	## Use Cases

	- Real-Time Video Analysis: Fast object detection and instance segmentation in image or video streams, suitable for autonomous driving, security monitoring, and surveillance systems.

	- Edge Device Deployment: Lightweight design enables deployment on mobile devices, embedded systems, and other edge devices with resource constraints.

	- Autonomous Systems: Detection and segmentation of pedestrians, vehicles, and other objects for autonomous navigation and robotics.

	- Custom Dataset Fine-Tuning: Supports fine-tuning on custom datasets to meet specific application requirements.

	- Production Environments: Efficient deployment in production or research environments requiring real-time performance.

	## Inputs and Outputs

	Input:

	- Image path

	Output:

	- Detection results including object classes, bounding box coordinates, and confidence scores

	## License

	All NPU-related components of this project are licensed under the Creative Commons Attribution–NonCommercial 4.0 International (CC BY-NC 4.0) license.

	Commercial licensing or usage rights must be obtained through a separate agreement. For inquiries regarding commercial use, please contact [email protected]