AIVideoMagus Models

ONNX models used by AIVideoMagus for OCR, AI upscaling, face detection, segmentation, restoration, and enhancement workflows.

Optimized for: CoreML / Apple Neural Engine, ONNX Runtime, TensorRT, and DirectML.

PP-OCRv5 Mobile Detection
Lightweight OCR text detector optimized for mobile devices.
File: PP-OCRv5_mobile_det_infer.onnx
Best For:
Watermarks, subtitles, UI text, screenshots, signs.
Download
PP-OCRv5 Mobile Recognition
OCR recognition model used after text detection.
File: PP-OCRv5_mobile_rec_infer.onnx
Tip:
Pair with the detection model for full OCR workflows.
Download
RealESRGAN x2
Moderate 2x AI upscaler balancing speed and detail recovery.
File: RealESRGAN_x2.onnx
Download
RealESRGAN x2 FP16
FP16 optimized x2 upscaler for faster inference.
File: RealESRGAN_x2_fp16.onnx
Download
RealESRGAN x2plus FP16
Higher-quality 2x restoration with improved texture synthesis.
File: RealESRGAN_x2plus.fp16.onnx
Download
RealESRGAN x4
Standard 4x AI enhancement model.
File: RealESRGAN_x4.onnx
Download
RealESRGAN x4 FP16
FP16 optimized 4x upscaler for faster hardware acceleration.
File: RealESRGAN_x4_fp16.onnx
Download
RealESRGAN x4plus FP16
Premium 4x restoration model with stronger detail synthesis.
File: RealESRGAN_x4plus.fp16.onnx
Download
RealESRGAN x8
Extreme 8x enhancement model for aggressive enlargement.
File: RealESRGAN_x8.onnx
Download
RealESRGAN x8 FP16
FP16 optimized version of the x8 enhancement model.
File: RealESRGAN_x8_fp16.onnx
Download
Real-ESRGAN x4plus
Original Real-ESRGAN x4plus architecture widely used in restoration pipelines.
File: Real-ESRGAN-x4plus.onnx
Download
YOLOv8 Face
Fast face detection model for masking, tracking, enhancement, and preprocessing pipelines.
File: yolov8m-face.onnx
Best For:
Real-time face detection and blur targeting.
Download
YOLOv11 Face Models
Newer-generation face detectors with improved robustness, tracking consistency, and accuracy.
Files:
yolov11n-face.onnx
yolov11s-face.onnx
yolov11m-face.onnx
yolov11l-face.onnx
Variants:
n = Nano / fastest
s = Small
m = Medium
l = Large / highest accuracy
Nano Small Medium Large
RVM MobileNetV3 Models
Real-time video matting models based on Robust Video Matting with a MobileNetV3 backbone for fast background removal, alpha extraction, and green-screen style compositing.
Files:
rvm_mobilenetv3_fp32.onnx
rvm_mobilenetv3_fp16.onnx
Variants:
FP32 = Full precision / highest quality
FP16 = Half precision / faster + lower VRAM usage
FP32 FP16
LaMa Inpainting Models
Advanced image inpainting models designed for object removal, watermark cleanup, hole filling, and scene reconstruction using Fourier convolution techniques for strong large-area consistency.
Files:
lama_fp32.onnx
lama_fp16.onnx
Variants:
FP32 = Full precision / best reconstruction quality
FP16 = Half precision / faster inference + reduced memory usage
FP32 FP16