NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. β’ 4 items β’ Updated 15 days ago β’ 3
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. β’ 4 items β’ Updated 15 days ago β’ 2
EdgeN Collection Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16. β’ 5 items β’ Updated 19 days ago β’ 1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. β’ 19 items β’ Updated 19 days ago β’ 1