Convert Figma logo to code with AI

ailia-ai logoailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

2,332
356
2,332
316

Quick Overview

Error generating quick overview

Convert Figma logo designs to code with AI

Visual Copilot

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

The collection of pre-trained, state-of-the-art AI models.

About ailia SDK

ailia SDK is a self-contained, cross-platform, high-speed inference SDK for AI. The ailia SDK provides a consistent C++ API across Windows, Mac, Linux, iOS, Android, Jetson, and Raspberry Pi platforms. It also supports Unity (C#), Python, Rust, Flutter(Dart) and JNI for efficient AI implementation. The ailia SDK makes extensive use of the GPU through Vulkan and Metal to enable accelerated computing.

How to use

Try now on Google Colaboratory

If you would like to try on your computer:

ailia MODELS tutorial

ailia MODELS tutorial 日本語版

Documentation

ailia-models wiki

Supported models

403 models as of March 12, 2026

Latest update

  • 2026.03.12 Add depth_anything_v3, depth_pro

  • 2026.03.06 Add depth_anything_v2

  • 2026.03.04 Add gpt-sovits-v2-pro, bevformer, uniad

  • 2026.03.02 Add g2pw, gpt-sovits-v1, v2, v3 (chinese)

  • 2026.01.16 Add embeddinggemma

  • 2025.12.30 Add demucs, latentsync

  • 2025.12.26 Add sadtalker

  • 2025.12.25 Add samurai, cotracker3 (ailia SDK 1.6.1)

  • 2025.12.21 Add silerovad v5, v6, v6_2

  • 2025.12.17 Add sensevoice, cosyvoice2

  • 2025.12.01 Add glass, mobilevlm, donut

  • More information in our Wiki

Action recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
va-cnnView Adaptive Neural Networks (VA) for Skeleton-based Human Action RecognitionPytorch1.2.7 and laterMar 2017
st-gcnST-GCNPytorch1.2.5 and laterJan 2018EN JP
marsMARS: Motion-Augmented RGB Stream for Action RecognitionPytorch1.2.4 and laterNov 2018EN JP
ax_action_recognitionRealtime-Action-RecognitionPytorch1.2.7 and laterMar 2019
driver-action-recognition-adasdriver-action-recognition-adas-0002OpenVINO1.2.5 and laterMar 2019
action_clipActionCLIPPytorch1.2.7 and laterSep 2021

Anomaly detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
mahalanobisadMahalanobisAD-pytorchPytorch1.2.9 and laterMay 2020
spade-pytorchSub-Image Anomaly Detection with Deep Pyramid CorrespondencesPytorch1.2.6 and laterMay 2020
padimPaDiM-Anomaly-Detection-Localization-masterPytorch1.2.6 and laterNov 2020EN JP
patchcorePatchCore_anomaly_detectionPytorch1.2.6 and laterJun 2021
glassA Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and LocalizationPytorch1.2.14 and laterJul 2024

Audio Language Model

ModelReferenceExported FromSupported Ailia VersionDateBlog
qwen_audioQwen-AudioPytorch1.5.0 and laterNov 2023JP

Audio processing

Audio classification

ModelReferenceExported FromSupported Ailia VersionDateBlog
crnn_audio_classificationcrnn-audio-classificationPytorch1.2.5 and laterMar 2019EN JP
audioset_tagging_cnnPANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern RecognitionPytorch1.2.9 and laterDec 2019
transformer-cnn-emotion-recognitionCombining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-EncodersPytorch1.2.5 and laterOct 2020
microsoft clapCLAPPytorch1.2.11 and laterJun 2022
clapCLAPPytorch1.2.6 and laterNov 2022JP

Music enhancement

ModelReferenceExported FromSupported Ailia VersionDateBlog
hifiganHiFi-GANPytorch1.2.9 and laterOct 2020
deep music enhancerOn Filter Generalization for Music Bandwidth Extension Using Deep Neural NetworksPytorch1.2.6 and laterNov 2020

Music generation

ModelReferenceExported FromSupported Ailia VersionDateBlog
pytorch_wavenetpytorch_wavenetPytorch1.2.14 and laterSep 2016

Noise reduction

ModelReferenceExported FromSupported Ailia VersionDateBlog
rnnoisernnoiseKeras1.2.15 and laterSep 2017
voicefilterVoiceFilterPytorch1.2.7 and laterOct 2018EN JP
unet_source_separationsource_separationPytorch1.2.6 and laterJul 2019EN JP
demucsDemucsPytorch1.4.0 and laterSep 2019
dtlnDual-signal Transformation LSTM NetworkTensorflow1.3.0 and laterMay 2020
audiosepAudioSepPytorch1.3.0 and laterAug 2023

Phoneme alignment

ModelReferenceExported FromSupported Ailia VersionDateBlog
narabasnarabas: Japanese phoneme forced alignment toolPytorch1.2.11 and laterMar 2023

Pitch detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
crepetorchcrepePytorch1.2.10 and laterFeb 2018JP

Speaker diarization

ModelReferenceExported FromSupported Ailia VersionDateBlog
pyannote-audioPyannote-audioPytorch1.2.15 and laterNov 2019JP
auto_speechAutoSpeech: Neural Architecture Search for Speaker RecognitionPytorch1.2.5 and laterMay 2020EN JP
wespeakerWeSpeakerOnnxruntime1.2.9 and laterOct 2022

Speech to text

ModelReferenceExported FromSupported Ailia VersionDateBlog
deepspeech2deepspeech.pytorchPytorch1.2.2 and laterOct 2017EN JP
whisperWhisperPytorch1.2.10 and laterDec 2022JP
reazon_speechReazonSpeechPytorch1.4.0 and laterJan 2023
distil-whisperHugging Face - Distil-WhisperPytorch1.2.16 and laterNov 2023
sensevoiceSenseVoicePytorch1.2.13 and laterJuly 2024JP
reazon_speech2ReazonSpeech2Pytorch1.4.0 and laterFeb 2024
kotoba-whisperkotoba-whisperPytorch1.2.16 and laterApr 2024

Text to speech

ModelReferenceExported FromSupported Ailia VersionDateBlog
pytorch-dc-ttsEfficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided AttentionPytorch1.2.6 and laterOct 2017EN JP
tacotron2Tacotron2Pytorch1.2.15 and laterFeb 2018JP
vall-e-xVALL-E-XPytorch1.2.15 and laterMar 2023JP
Bert-VITS2Bert-VITS2Pytorch1.2.16 and laterAug 2023
gpt-sovitsGPT-SoVITSPytorch1.4.0 and laterFeb 2024JP
gpt-sovits-v2GPT-SoVITSPytorch1.4.0 and laterAug 2024
cosyvoice2CosyVoice2Pytorch1.4.0 and laterDec 2024
gpt-sovits-v3GPT-SoVITSPytorch1.4.0 and laterFeb 2025
gpt-sovits-v2-proGPT-SoVITSPytorch1.4.0 and laterJun 2025JP

Voice activity detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
silero-vadSilero VADPytorch1.2.15 and laterDec 2020JP

Voice conversion

ModelReferenceExported FromSupported Ailia VersionDateBlog
rvcRetrieval-based-Voice-Conversion-WebUIPytorch1.2.12 and laterMar 2023JP

Autonomous driving

ModelReferenceExported FromSupported Ailia VersionDateBlog
bevformerBEVFormerPytorch1.6.1 and laterMar 2022
uniadUniAD: Unified DrivingPytorch1.6.1 and laterDec 2022JP

Background removal

ModelReferenceExported FromSupported Ailia VersionDateBlog
deep-image-mattingDeep Image MattingKeras1.2.3 and laterMar 2017EN JP
indexnetIndices Matter: Learning to Index for Deep Image MattingPytorch1.2.7 and laterAug 2019
U-2-NetU^2-Net: Going Deeper with Nested U-Structure for Salient Object DetectionPytorch1.2.2 and laterMay 2020EN JP
u2net-portrait-mattingU^2-Net - Portrait mattingPytorch1.2.7 and laterMay 2020
u2net-human-segU^2-Net - human segmentationPytorch1.2.4 and laterMay 2020
cascade_pspCascadePSPPytorch1.2.9 and laterMay 2020
rembgRembgPytorch1.2.4 and laterAug 2020
gfmBridging Composite and Real: Towards End-to-end Deep Image MattingPytorch1.2.10 and laterOct 2020
modnetMODNet: Trimap-Free Portrait Matting in Real TimePytorch1.2.7 and laterNov 2020
background_matting_v2Real-Time High-Resolution Background MattingPytorch1.2.9 and laterDec 2020
dis_segHighly Accurate Dichotomous Image SegmentationPytorch1.2.10 and laterMar 2022

Crowd counting

ModelReferenceExported FromSupported Ailia VersionDateBlog
crowdcount-cascaded-mtlCNN-based Cascaded Multi-task Learning of
High-level Prior and Density Estimation for Crowd Counting
(Single Image Crowd Counting)
Pytorch1.2.1 and laterJul 2017EN JP
c-3-frameworkCrowd Counting Code Framework(C^3-Framework)Pytorch1.2.5 and laterJul 2019

Deep fashion

ModelReferenceExported FromSupported Ailia VersionDateBlog
fashionai-key-points-detectionA Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points DetectionPytorch1.2.5 and laterJun 2018
person-attributes-recognition-crossroadperson-attributes-recognition-crossroad-0230Pytorch1.2.10 and laterOct 2018
clothing-detectionClothing-DetectionPytorch1.2.1 and laterJun 2019EN JP
mmfashionMMFashionPytorch1.2.5 and laterNov 2019EN JP
mmfashion_tryonMMFashion virtual try-onPytorch1.2.8 and laterNov 2019
mmfashion_retrievalMMFashion In-Shop Clothes RetrievalPytorch1.2.5 and laterNov 2019

Depth estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
fcrn-depthpredictionDeeper Depth Prediction with Fully Convolutional Residual NetworksTensorFlow1.2.6 and laterJun 2016
monodepth2Monocular depth estimation from a single imagePytorch1.2.2 and laterJun 2018
fast-depthICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems"Pytorch1.2.5 and laterMar 2019
midasTowards Robust Monocular Depth Estimation:
Mixing Datasets for Zero-shot Cross-dataset Transfer
Pytorch1.2.4 and laterJul 2019EN JP
hitnetONNX-HITNET-Stereo-Depth-estimationPytorch1.2.9 and laterJul 2020
lap-depthLapDepth-releasePytorch1.2.9 and laterJan 2021
mobilestereonetMobileStereoNetPytorch1.2.13 and laterAug 2021
crestereoONNX-CREStereo-Depth-EstimationPytorch1.2.13 and laterMar 2022
zoe_depthZoeDepthPytorch1.3.0 and laterFeb 2023
depth_anythingDepthAnythingPytorch1.2.9 and laterJan 2024
depth_anything_v2Depth Anything V2Pytorch1.2.16 and laterJun 2024
depth_proDepth Pro: Sharp Monocular Metric Depth in Less Than a SecondPytorch1.2.12 and laterOct 2024
depth_anything_v3Depth Anything V3Pytorch1.2.16 and laterNov 2025

Diffusion

Text to image

ModelReferenceExported FromSupported Ailia VersionDateBlog
latent-diffusion-txt2imgLatent Diffusion - txt2imgPytorch1.2.10 and laterDec 2021
stable-diffusion-txt2imgStable DiffusionPytorch1.2.14 and laterAug 2022JP
anything_v3Linaqruf/anything-v3.0Pytorch1.5.0 and laterNov 2022
control_netControlNetPytorch1.2.15 and laterFeb 2023
latent-consistency-modelslatent-consistency-modelsPytorch1.2.16 and laterOct 2023
sd-turboHugging Face - SD-TurboPytorch1.2.16 and laterNov 2023
sdxl-turboHugging Face - SDXL-TurboPytorch1.2.16 and laterNov 2023
depth_anything_controlnetDepthAnythingPytorch1.2.16 and laterJan 2024
latentsyncLatentSyncPytorch1.4.0 and laterDec 2024

Text to audio

ModelReferenceExported FromSupported Ailia VersionDateBlog
riffusionRiffusionPytorch1.2.16 and laterDec 2022

Others

ModelReferenceExported FromSupported Ailia VersionDateBlog
latent-diffusion-inpaintingLatent Diffusion - inpaintingPytorch1.2.10 and laterDec 2021
latent-diffusion-superresolutionLatent Diffusion - Super-resolutionPytorch1.2.10 and laterDec 2021
DA-CLIPDA-CLIPPytorch1.2.16 and laterOct 2023
marigoldMarigold: Repurposing Diffusion-Based Image Generators for Monocular Depth EstimationPytorch1.2.16 and laterDec 2023

Face detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
mtcnnmtcnnKeras1.2.10 and laterApr 2016
yolov1-faceYOLO-Face-detectionDarknet1.1.0 and laterMar 2017
face-detection-adasface-detection-adas-0001OpenVINO1.2.5 and laterOct 2018
retinafaceRetinaFace: Single-stage Dense Face Localisation in the Wild.Pytorch1.2.5 and laterMay 2019JP
blazefaceBlazeFace-PyTorchPytorch1.2.1 and laterJul 2019EN JP
yolov3-faceFace detection using keras-yolov3Keras1.2.1 and laterDec 2019
face-mask-detectionFace detection using keras-yolov3Keras1.2.1 and laterDec 2019EN JP
dbfaceDBFace : real-time, single-stage detector for face detection,
with faster speed and higher accuracy
Pytorch1.2.2 and laterMar 2020
anime-face-detectorAnime Face DetectorPytorch1.2.6 and laterOct 2021

Face identification

ModelReferenceExported FromSupported Ailia VersionDateBlog
facenet_pytorchFace Recognition Using PytorchPytorch1.2.6 and laterMar 2015
insightfaceInsightFace: 2D and 3D Face Analysis ProjectPytorch1.2.5 and laterSep 2017
vggface2VGGFace2 Dataset for Face RecognitionCaffe1.1.0 and laterOct 2017
arcfacepytorch implement of arcfacePytorch1.2.1 and laterJan 2018EN JP
cosfacePytorch implementation of CosFacePytorch1.2.10 and laterJan 2018

Face recognition

Age gender estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
face_classificationReal-time face detection and emotion/gender classificationKeras1.1.0 and laterOct 2017
age-gender-recognition-retailage-gender-recognition-retail-0013OpenVINO1.2.5 and laterMay 2018EN JP
mivoloMiVOLO: Multi-input Transformer for Age and Gender EstimationPytorch1.2.13 and laterJul 2023JP

Emotion recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
ferplusFER+CNTK1.2.2 and laterAug 2016
hsemotionHSEmotion (High-Speed face Emotion recognition) libraryPytorch1.2.5 and laterMar 2021

Gaze estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
gazemlA deep learning framework based on Tensorflow
for the training of high performance gaze estimation
TensorFlow1.2.0 and laterMay 2018
mediapipe_irisirislandmarks.pytorchPytorch1.2.2 and laterJun 2020EN JP
gazellegazellePytorch1.2.16 and laterDec 2024JP
ax_gaze_estimationax Gaze EstimationPytorch1.2.2 and laterEN JP

Head pose estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
hopenetdeep-head-posePytorch1.2.2 and laterOct 2017EN JP
6d_repnet6D Rotation Representation for Unconstrained Head Pose Estimation (Pytorch)Pytorch1.2.6 and laterFeb 2022
L2CS_NetL2CS_NetPytorch1.2.9 and laterMar 2022
6d_repnet_360Toward Robust and Unconstrained Full Range of Rotation Head Pose EstimationPytorch1.2.9 and laterSep 2023

Keypoint detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
face_alignment2D and 3D Face alignment library build using pytorchPytorch1.2.1 and laterMar 2017EN JP
prnetJoint 3D Face Reconstruction and Dense Alignment
with Position Map Regression Network
TensorFlow1.2.2 and laterMar 2018
facemeshfacemesh.pytorchPytorch1.2.2 and laterJul 2019EN JP
facial_featurekaggle-facial-keypointsPytorch1.2.0 and laterOct 2019
3ddfaTowards Fast, Accurate and Stable 3D Dense Face AlignmentPytorch1.2.10 and laterSep 2020
facemesh_v2MediaPipe Face landmark detectionPytorch1.2.9 and laterMay 2023JP

Others

ModelReferenceExported FromSupported Ailia VersionDateBlog
face-anti-spoofingLightweight Face Anti SpoofingPytorch1.2.5 and laterJul 2020EN JP
ax_facial_featuresax Facial FeaturesPytorch1.2.5 and laterEN

Face restoration

ModelReferenceExported FromSupported Ailia VersionDateBlog
gfpganGFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial PriorPytorch1.2.10 and laterJan 2021JP
codeformerCodeFormer: Towards Robust Blind Face Restoration with Codebook Lookup TransformerPytorch1.2.9 and laterJun 2022

Face swapping

ModelReferenceExported FromSupported Ailia VersionDateBlog
deepfaceliveDeepFaceLiveONNX Runtime1.2.10 and laterDec 2020
sber-swapSberSwapPytorch1.2.12 and laterFeb 2022JP
facefusionFaceFusionONNX Runtime1.2.10 and laterAug 2023

Frame Interpolation

ModelReferenceExported FromSupported Ailia VersionDateBlog
cainChannel Attention Is All You Need for Video Frame InterpolationPytorch1.2.5 and laterNov 2019
rifeReal-Time Intermediate Flow Estimation for Video Frame InterpolationPytorch1.2.13 and laterNov 2020
flavrFLAVR: Flow-Agnostic Video Representations for Fast Frame InterpolationPytorch1.2.7 and laterDec 2020EN JP
filmFILM: Frame Interpolation for Large MotionTensorflow1.2.10 and laterFeb 2022

Generative adversarial networks

ModelReferenceExported FromSupported Ailia VersionDateBlog
pytorch-ganCode repo for the Pytorch GAN Zoo project (used to train this model)Pytorch1.2.4 and laterOct 2017
lipganLipGANKeras1.2.15 and laterOct 2019JP
council-ganCouncil-GANPytorch1.2.4 and laterNov 2019
samAge Transformation Using a Style-Based Regression ModelPytorch1.2.9 and laterFeb 2021
encoder4editingDesigning an Encoder for StyleGAN Image ManipulationPytorch1.2.10 and laterFeb 2021
restyle-encoderReStylePytorch1.2.9 and laterApr 2021
SadTalkerSadTalkerPytorch1.5.0 and laterNov 2022
live_portraitLivePortraitPytorch1.5.0 and laterJul 2024JP

Hand detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
hand_detection_pytorchhand-detection.PyTorchPytorch1.2.2 and laterMar 2019
yolov3-handHand detection branch of Face detection using keras-yolov3Keras1.2.1 and laterDec 2019
blazepalmMediaPipePyTorchPytorch1.2.5 and laterJun 2020

Hand recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
hand3dColorHandPose3D networkTensorFlow1.2.5 and laterMay 2017
v2v-posenetV2V-PoseNetPytorch1.2.6 and laterNov 2017
minimal-handMinimal HandTensorFlow1.2.8 and laterMar 2020
blazehandMediaPipePyTorchPytorch1.2.5 and laterJun 2020EN JP
hands_segmentation_pytorchhands-segmentation-pytorchPytorch1.2.10 and laterApr 2021

Image captioning

ModelReferenceExported FromSupported Ailia VersionDateBlog
illustration2vecIllustration2VecCaffe1.2.2 and laterNov 2015
image_captioning_pytorchImage Captioning pytorchPytorch1.2.5 and laterDec 2016EN JP
blip2Hugging Face - BLIP-2Pytorch1.2.16 and laterJan 2023

Image classification

CNN

ModelReferenceExported FromSupported Ailia VersionDateBlog
alexnetAlexNet PyTorchPytorch1.2.5 and laterSep 2012
vgg16Very Deep Convolutional Networks for Large-Scale Image RecognitionKeras1.1.0 and laterSep 2014
googlenetGoing Deeper with ConvolutionsPytorch1.2.0 and laterSep 2014
resnet18ResNet18Pytorch1.2.8 and laterDec 2015
resnet50Deep Residual Learning for Image RecognitionChainer1.2.0 and laterDec 2015
inceptionv3Rethinking the Inception Architecture for Computer VisionPytorch1.2.0 and laterDec 2015JP
inceptionv4Keras Inception-V4Keras1.2.5 and laterFeb 2016
wide_resnet50Wide ResnetPytorch1.2.5 and laterMay 2016
mobilenetv2PyTorch Implemention of MobileNet V2Pytorch1.2.0 and laterJan 2018
mobilenetv3PyTorch Implemention of MobileNet V3Pytorch1.2.1 and laterMay 2019
efficientnetA PyTorch implementation of EfficientNetPytorch1.2.3 and laterMay 2019
efficientnetv2EfficientNetV2Pytorch1.2.4 and laterApr 2021
imagenet21kImageNet21KPytorch1.2.11 and laterApr 2021
mlp_mixerMLP-MixerPytorch1.2.9 and laterMay 2021
voloVOLO: Vision Outlooker for Visual RecognitionPytorch1.2.9 and laterJun 2021
convnextA PyTorch implementation of ConvNeXtPytorch1.2.5 and laterJan 2022
mobileoneA PyTorch implementation of MobileOnePytorch1.2.1 and laterJun 2022

Transformer

ModelReferenceExported FromSupported Ailia VersionDateBlog
vitPytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)Pytorch1.2.7 and laterOct 2020EN JP
clipCLIPPytorch1.2.9 and laterFeb 2021EN JP
swin-transformerSwin TransformerPytorch1.2.6 and laterMar 2021
japanese-clipJapanese-CLIPPytorch1.2.15 and laterMay 2022
japanese-stable-clip-vit-l-16japanese-stable-clip-vit-l-16Pytorch1.2.11 and laterNov 2023
clip-japanese-baseline-corporation/clip-japanese-basePytorch1.2.16 and laterApr 2024
siglip2Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense FeaturesPytorch1.2.16 and laterFeb 2025JP

Specific task

ModelReferenceExported FromSupported Ailia VersionDateBlog
weather-prediction-from-imageWeather Prediction From Image - (Warmth Of Image)Keras1.2.5 and laterOct 2017
partialconvPartial Convolution Layer for Padding and Image InpaintingPytorch1.2.0 and laterNov 2018

Image inpainting

ModelReferenceExported FromSupported Ailia VersionDateBlog
inpainting-with-partial-convpytorch-inpainting-with-partial-convPyTorch1.2.6 and laterApr 2018EN JP
deepfillv2Free-Form Image Inpainting with Gated ConvolutionPytorch1.2.9 and laterJun 2018
inpainting_gmcnnImage Inpainting via Generative Multi-column Convolutional Neural NetworksTensorFlow1.2.6 and laterOct 2018
3d-photo-inpainting3D Photography using Context-aware Layered Depth InpaintingPytorch1.2.7 and laterApr 2020
lamaLaMa: Resolution-robust Large Mask Inpainting with Fourier ConvolutionsPytorch1.2.13 and laterSep 2021

Image manipulation

ModelReferenceExported FromSupported Ailia VersionDateBlog
colorizationColorful Image ColorizationPytorch1.2.2 and laterMar 2016EN JP
cnngeometric_pytorchCNNGeometric PyTorch implementationPytorch1.2.7 and laterMar 2017
style2paintsStyle2PaintsTensorFlow1.2.6 and laterJun 2017
deblur_ganDeblurGANPytorch1.2.6 and laterNov 2017
pytorch-superpointpytorch-superpoint : Self-Supervised Interest Point Detection and DescriptionPytorch1.2.6 and laterDec 2017
noise2noiseLearning Image Restoration without Clean DataPytorch1.2.0 and laterMar 2018
dfeDeep Fundamental Matrix EstimationPytorch1.2.6 and laterOct 2018
illnetDocument Rectification and Illumination Correction using a Patch-based CNNPytorch1.2.2 and laterSep 2019
dewarpnetDewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression NetworksPytorch1.2.1 and laterOct 2019
deep_white_balanceDeep White-Balance Editing, CVPR 2020 (Oral)PyTorch1.2.6 and laterApr 2020
u2net_portraitU^2-Net: Going Deeper with Nested U-Structure for Salient Object DetectionPytorch1.2.2 and laterMay 2020
invertible_denoising_networkInvertible Image DenoisingPytorch1.2.8 and laterApr 2021
dfmDeep Feature MatchingPytorch1.2.6 and laterJun 2021
fbcnnTowards Flexible Blind JPEG Artifacts RemovalPytorch1.2.9 and laterSep 2021
dehamerImage Dehazing Transformer with Transmission-Aware 3D Position EmbeddingPytorch1.2.13 and laterJun 2022
lightglueLightGlue-ONNXPytorch1.2.15 and laterJun 2023
docshadowDocShadow-ONNX-TensorRTPytorch1.2.10 and laterAug 2023

Image restoration

ModelReferenceExported FromSupported Ailia VersionDateBlog
nafnetNAFNet: Nonlinear Activation Free Network for Image RestorationPytorch1.2.10 and laterMar 2022JP

Image segmentation

ModelReferenceExported FromSupported Ailia VersionDateBlog
pytorch-fcnpytorch-fcnPytorch1.3.0 and laterNov 2014
pytorch-enetPyTorch-ENetPytorch1.2.8 and laterJun 2016
tusimple-DUCTuSimple-DUCPytorch1.2.10 and laterFeb 2017
pytorch-unetPytorch-UnetPytorch1.2.5 and laterAug 2017
deeplabv3Xception65 for backbone network of DeepLab v3+Chainer1.2.0 and laterFeb 2018
pspnet-hair-segmentationpytorch-hair-segmentationPytorch1.2.2 and laterNov 2018
swiftnetSwiftNetPytorch1.2.6 and laterMar 2019
hrnet_segmentationHigh-resolution networks (HRNets) for Semantic SegmentationPytorch1.2.1 and laterApr 2019
hair_segmentationhair segmentation in mobile deviceKeras1.2.1 and laterJul 2019
paddlesegPaddleSegPytorch1.2.7 and laterAug 2019EN JP
human_part_segmentationSelf Correction for Human ParsingPytorch1.2.4 and laterOct 2019EN JP
semantic-segmentation-mobilenet-v3Semantic segmentation with MobileNetV3TensorFlow1.2.5 and laterNov 2019
suimSUIMKeras1.2.6 and laterApr 2020
yet-another-anime-segmenterYet Another Anime SegmenterPytorch1.2.6 and laterOct 2020
dense_prediction_transformersVision Transformers for Dense PredictionPytorch1.2.7 and laterMar 2021EN JP
group_vitGroupViTPytorch1.2.10 and laterFeb 2022
pp_litesegPP-LiteSegPytorch1.2.10 and laterApr 2022
anime-segmentationAnime SegmentationPytorch1.2.9 and laterAug 2022
yolov8-segYOLOv8Pytorch1.2.14.1 and laterJan 2023
segment-anythingSegment AnythingPytorch1.2.16 and laterApr 2023
grounded_samGrounded-SAMPytorch1.2.16 and laterApr 2023
fast_samFastSAMPytorch1.2.14 and laterJun 2023
mobile_samMobileSAMPytorch1.6.0 and laterJun 2023
edge_samEdgeSAMPytorch1.2.10 and laterDec 2023
segment-anything-2Segment Anything 2Pytorch1.2.16 and laterJul 2024
yolov11-segUltralytics YOLO11Pytorch1.2.14.1 and laterSep 2024

Landmark classification

ModelReferenceExported FromSupported Ailia VersionDateBlog
places365Release of Places365-CNNsPytorch1.2.5 and laterOct 2016
landmarks_classifier_asiaLandmarks classifier_asia_V1.1TensorFlow Hub1.2.4 and laterApr 2020EN JP

Line segment detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
dexinedDexiNed: Dense Extreme Inception Network for Edge DetectionPytorch1.2.5 and laterSep 2019
mlsdM-LSD: Towards Light-weight and Real-time Line Segment DetectionTensorFlow1.2.8 and laterJun 2021EN JP

Low Light Image Enhancement

ModelReferenceExported FromSupported Ailia VersionDateBlog
agllnetAGLLNet: Attention Guided Low-light Image Enhancement (IJCV 2021)Pytorch1.2.9 and laterAug 2019EN JP
drbn_skfDRBN SKFPytorch1.2.14 and laterApr 2023

Natural language processing

Bert

ModelReferenceExported FromSupported Ailia VersionDateBlog
bertpytorch-pretrained-bertPytorch1.2.2 and laterOct 2018EN JP
bert_maskedlmhuggingface/transformersPytorch1.2.5 and laterOct 2018
bert_question_answeringhuggingface/transformersPytorch1.2.5 and laterOct 2018

Embedding

ModelReferenceExported FromSupported Ailia VersionDateBlog
sentence_transformers_japanesesentence transformersPytorch1.2.7 and laterAug 2019JP
multilingual-e5multilingual-e5-basePytorch1.2.15 and laterDec 2022JP
glucoseGLuCoSE (General Luke-based Contrastive Sentence Embedding)-base-JapanesePytorch1.2.15 and laterJul 2023
ruri-v3ruri-v3-310m Pytorch1.2.13 and laterApr 2025
embeddinggemmaEmbeddingGemmaPytorch1.2.14 and laterSep 2025JP

Error corrector

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert_insert_punctuationbert-japanesePytorch1.2.15 and laterNov 2019
bertjscbertjscPytorch1.2.15 and laterMar 2023
t5_whisper_medicalerror correction of medical terms using t5Pytorch1.2.13 and later

Grapheme to phoneme

ModelReferenceExported FromSupported Ailia VersionDateBlog
g2p_eng2p_enPytorch1.2.14 and laterJan 2019JP
g2pwg2pWPytorch1.2.9 and laterMar 2022
soundchoice-g2pHugging Face - speechbrain/soundchoice-g2pPytorch1.2.16 and laterJul 2022

Named entity recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert_nerhuggingface/transformersPytorch1.2.5 and laterOct 2018
t5_base_japanese_nert5-japanesePytorch1.2.13 and laterMar 2021
bert_ner_japanesejurabi/bert-ner-japanesePytorch1.2.10 and laterMar 2023

Reranker

ModelReferenceExported FromSupported Ailia VersionDateBlog
cross_encoder_mmarcojeffwan/mmarco-mMiniLMv2-L12-H384-vPytorch1.2.10 and laterSep 2022JP
japanese-reranker-cross-encoderhotchpotch/japanese-reranker-cross-encoder-large-v1Pytorch1.2.16 and laterApr 2024

Sentence generation

ModelReferenceExported FromSupported Ailia VersionDateBlog
gpt2GPT-2Pytorch1.2.7 and laterFeb 2019
rinna_gpt2japanese-pretrained-modelsPytorch1.2.7 and laterApr 2021

Sentiment analysis

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert_sentiment_analysishuggingface/transformersPytorch1.2.5 and laterOct 2018
bert_tweets_sentimenthuggingface/transformersPytorch1.2.5 and laterOct 2018

Summarize

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert_sum_extBERTSUMEXTPytorch1.2.7 and laterMay 2019
presummPreSummPytorch1.2.8 and laterAug 2019
t5_base_japanese_title_generationt5-japanesePytorch1.2.13 and laterMar 2021JP
t5_base_summarizationt5-japanesePytorch1.2.13 and laterMar 2021

Translation

ModelReferenceExported FromSupported Ailia VersionDateBlog
fugumt-en-jaFugu-Machine TranslatorPytorch1.2.9 and laterNov 2020JP
fugumt-ja-enFugu-Machine TranslatorPytorch1.2.10 abd laterNov 2020

Zero shot classification

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert_zero_shot_classificationhuggingface/transformersPytorch1.2.5 and laterOct 2018
multilingual-minilmv2MoritzLaurer/multilingual-MiniLMv2-L12-mnli-xnliPytorch1.2.10 and laterJun 2022

Network intrusion detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
bert-network-packet-flow-header-payloadbert-network-packet-flow-header-payloadPytorch1.2.10 and laterSep 2023
falcon-adapter-network-packetfalcon-adapter-network-packetPytorch1.2.10 and laterSep 2023

Neural Rendering

ModelReferenceExported FromSupported Ailia VersionDateBlog
nerfNeRF: Neural Radiance FieldsTensorflow1.2.10 and laterMar 2020EN JP
TripoSRTripoSRPytorch1.2.6 and laterMar 2024

NSFW detector

ModelReferenceExported FromSupported Ailia VersionDateBlog
clip-based-nsfw-detectorCLIP-based-NSFW-DetectorKeras1.2.10 and laterMar 2022JP

Object detection

CNN

ModelReferenceExported FromSupported Ailia VersionDateBlog
yolov1-tinyYOLO: Real-Time Object DetectionDarknet1.1.0 and laterJun 2015JP
yolov2YOLO: Real-Time Object DetectionPytorch1.2.0 and laterDec 2016
yolov2-tinyYOLO: Real-Time Object DetectionPytorch1.2.6 and laterDec 2016
maskrcnnMask R-CNN: real-time neural network for object instance segmentationPytorch1.2.3 and laterMar 2017
yolov3YOLO: Real-Time Object DetectionONNX Runtime1.2.1 and laterApr 2018EN JP
yolov3-tinyYOLO: Real-Time Object DetectionONNX Runtime1.2.1 and laterApr 2018
mobilenet_ssdMobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in PytorchPytorch1.2.1 and laterAug 2018EN JP
m2detM2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid NetworkPytorch1.2.3 and laterNov 2018EN JP
centernetCenterNet : Objects as PointsPytorch1.2.1 and laterApr 2019EN JP
yolactYou Only Look At CoefficienTsPytorch1.2.6 and laterApr 2019
efficientdetEfficientDet: Scalable and Efficient Object Detection, in PyTorchPytorch1.2.6 and laterNov 2019
pedestrian_detectionPedestrian-Detection-on-YOLOv3_Research-and-APPKeras1.2.1 and laterMar 2020
crowd_detDetection in Crowded ScenesPytorch1.2.13 and laterMar 2020
yolov4Pytorch-YOLOv4Pytorch1.2.4 and laterApr 2020EN JP
yolov4-tinyPytorch-YOLOv4Pytorch1.2.5 and laterApr 2020
yolov5yolov5Pytorch1.2.5 and laterMay 2020EN JP
poly_yoloPoly YOLOKeras1.2.6 and laterMay 2020
nanodetNanoDetPytorch1.2.6 and laterNov 2020
yoloryolorPytorch1.2.5 and laterMay 2021
yoloxYOLOXPytorch1.2.6 and laterJul 2021EN JP
picodetPP-PicoDetPytorch1.2.10 and laterNov 2021
yolox-ti-liteedgeai-yoloxPytorch1.2.9 and laterDec 2021
yolov7YOLOv7Pytorch1.2.7 and laterJul 2022
fastest-detFastestDetPytorch1.2.5 and laterJul 2022
yolovYOLOVPytorch1.2.10 and laterAug 2022
yolov6YOLOV6Pytorch1.2.10 and laterSep 2022
damo_yoloDAMO-YOLOPytorch1.2.9 and laterNov 2022
yolov8YOLOv8Pytorch1.2.14.1 and laterJan 2023
yolox_body_head_hand_faceYOLOX-Body-Head-Hand-FacePytorch1.2.15 and laterJan 2024
yolov9YOLOv9Pytorch1.2.10 and laterFeb 2024
yolov10YOLOv10Pytorch1.2.11 and laterMay 2024
yolov11YOLOv11Pytorch1.2.14 and laterSep 2024
yolov12YOLOv12Pytorch1.2.14 and laterFeb 2025

Transformer

ModelReferenceExported FromSupported Ailia VersionDateBlog
glipGLIPPytorch1.2.13 and laterDec 2021
dab-detrDAB-DETRPytorch1.2.12 and laterJan 2022
deticDetecting Twenty-thousand Classes using Image-level SupervisionPytorch1.2.10 and laterJan 2022EN JP
groundingdinoGrounding DINOPytorch1.2.16 and laterMar 2023JP
rt-detr-v2RT-DETRPytorch1.2.13 and laterJul 2024JP

Specific target

ModelReferenceExported FromSupported Ailia VersionDateBlog
traffic-sign-detectionTraffic Sign DetectionTensorflow1.2.10 and laterAug 2018EN JP
sku110k-densedetSKU110K-DenseDetPytorch1.2.9 and laterApr 2019EN JP
footandballFootAndBall: Integrated player and ball detectorPytorch1.2.0 and laterDec 2019
qrcode_wechatqrcodeqrcode_wechatqrcodeCaffe1.2.15 and laterMar 2021
mobile_object_localizermobile_object_localizer_v1TensorFlow Hub1.2.6 and laterJun 2021EN JP
layout_parsingunstructured-inferencePytorch1.2.9 and laterDec 2022

Object detection 3d

ModelReferenceExported FromSupported Ailia VersionDateBlog
3d_bbox3D Bounding Box Estimation Using Deep Learning and GeometryPytorch1.2.6 and laterDec 2016
d4lcnD4LCNPytorch1.2.9 and laterDec 2019
egonetEgoNetPytorch1.2.9 and laterNov 2020
mediapipe_objectronMediaPipe ObjectronTensorFlow Lite1.2.5 and laterDec 2020
3d-object-detection.pytorch3d-object-detection.pytorchPytorch1.2.8 and laterFeb 2021EN JP
did_m3dDID M3DPytorch1.2.11 and laterJul 2022

Object tracking

ModelReferenceExported FromSupported Ailia VersionDateBlog
deepsortDeep Sort with PyTorchPytorch1.2.3 and laterMar 2017EN JP
person_reid_baseline_pytorchUTS-Person-reID-PracticalPytorch1.2.6 and laterMar 2019
abd_netAttentive but Diverse Person Re-IdentificationPytorch1.2.7 and laterAug 2019
deepsort_vehicleMulti-Camera Live Object TrackingPytorch1.2.9 and laterMay 2020
qd-3dtMonocular Quasi-Dense 3D Object TrackingPytorch1.2.11 and laterMar 2021 
centroids-reidOn the Unreasonable Effectiveness of Centroids in Image RetrievalPytorch1.2.9 and laterApr 2021 
siam-motSiamMOTPytorch1.2.9 and laterMay 2021
bytetrackByteTrackPytorch1.2.5 and laterOct 2021EN JP 
strong_sortStrongSORTPytorch1.2.15 and laterFeb 2022 
samuraiSAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryPytorch1.6.1 and laterNov 2024

Optical Flow Estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
raftRAFT: Recurrent All Pairs Field Transforms for Optical FlowPytorch1.2.6 and laterMar 2020EN JP 
cotracker3 CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real VideosPytorch1.6.1 and laterOct 2024

Point segmentation

ModelReferenceExported FromSupported Ailia VersionDateBlog
pointnet_pytorchPointNet.pytorchPytorch1.2.6 and laterDec 2016

Pose estimation

ModelReferenceExported FromSupported Ailia VersionDateBlog
openposeCode repo for realtime multi-person pose estimation in CVPR'17 (Oral)Caffe1.2.1 and laterNov 2016
posenetPoseNet PytorchPytorch1.2.10 and laterJan 2017
pose_resnetSimple Baselines for Human Pose Estimation and TrackingPytorch1.2.1 and laterApr 2018EN JP
lightweight-human-pose-estimationFast and accurate human pose estimation in PyTorch.
Contains implementation of
"Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
Pytorch1.2.1 and laterNov 2018EN JP
animalposeMMPose - 2D animal pose estimationPytorch1.2.7 and laterAug 2019EN JP
efficientposeCode repo for EfficientPoseTensorFlow1.2.6 and laterApr 2020
blazeposeMediaPipePyTorchPytorch1.2.5 and laterJun 2020
mediapipe_holisticMediaPipe HolisticTensorFlow1.2.9 and laterDec 2020
movenetCode repo for movenetTensorFlow1.2.8 and laterMay 2021EN JP
ap-10kAP-10KPytorch1.2.4 and laterAug 2021
e2poseE2PoseTensorflow1.2.5 and laterOct 2022

Pose estimation 3d

ModelReferenceExported FromSupported Ailia VersionDateBlog
pose-hg-3dTowards 3D Human Pose Estimation in the Wild: a Weakly-supervised ApproachPytorch1.2.6 and laterApr 2017
3d-pose-baselineA simple baseline for 3d human pose estimation in tensorflow.
Presented at ICCV 17.
TensorFlow1.2.3 and laterMay 2017
lightweight-human-pose-estimation-3dReal-time 3D multi-person pose estimation demo in PyTorch.
OpenVINO backend can be used for fast inference on CPU.
Pytorch1.2.1 and laterDec 2017
3dmppe_posenetPoseNet of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image"Pytorch1.2.6 and laterJul 2019
gastA Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net)Pytorch1.2.7 and laterMar 2020EN JP
blazepose-fullbodyMediaPipeTensorFlow Lite1.2.5 and laterJun 2020EN JP
mediapipe_pose_world_landmarksMediaPipe Pose real-world 3D coordinatesTensorFlow Lite1.2.10 and laterJun 2022

Road detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
road-segmentation-adasroad-segmentation-adas-0001OpenVINO1.2.5 and laterSep 2018
codes-for-lane-detectionCodes-for-Lane-DetectionPytorch1.2.6 and laterAug 2019EN JP
ultra-fast-lane-detectionUltra-Fast-Lane-DetectionPytorch1.2.6 and laterApr 2020
polylanenetPolyLaneNetPytorch1.2.9 and laterApr 2020
roneldRONELD-Lane-DetectionPytorch1.2.6 and laterOct 2020
lstrLSTRPytorch1.2.8 and laterNov 2020
yolopYOLOPPytorch1.2.6 and laterAug 2021
cdnetCDNetPytorch1.2.5 and laterFeb 2022
hybridnetsHybridNetsPytorch1.2.6 and laterMar 2022

Rotation prediction

ModelReferenceExported FromSupported Ailia VersionDateBlog
rotnetCNNs for predicting the rotation angle of an image to correct its orientationKeras1.2.1 and laterMar 2018

Style transfer

ModelReferenceExported FromSupported Ailia VersionDateBlog
adainArbitrary Style Transfer in Real-time with Adaptive Instance NormalizationPytorch1.2.1 and laterMar 2017EN JP
pix2pixHDpix2pixHD: High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANsPytorch1.2.6 and laterNov 2017
beauty_ganBeautyGANPytorch1.2.7 and laterJul 2018
psganPSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup TransferPytorch1.2.7 and laterSep 2019
animeganv2PyTorch Implementation of AnimeGANv2Pytorch1.2.5 and laterNov 2020
EleGANtEleGANt: Exquisite and Locally Editable GAN for Makeup TransferPytorch1.2.15 and laterJul 2022

Super resolution

ModelReferenceExported FromSupported Ailia VersionDateBlog
srresnetPhoto-Realistic Single Image Super-Resolution Using a Generative Adversarial NetworkPytorch1.2.0 and laterSep 2016EN JP
edsrEnhanced Deep Residual Networks for Single Image Super-ResolutionPytorch1.2.6 and laterJul 2017EN JP
hanSingle Image Super-Resolution via a Holistic Attention NetworkPytorch1.2.6 and laterAug 2020
real-esrganReal-ESRGANPytorch1.2.9 and laterJul 2021JP
swinirSwinIR: Image Restoration Using Swin TransformerPytorch1.2.12 and laterAug 2021
rcan-itRevisiting RCAN: Improved Training for Image Super-ResolutionPytorch1.2.10 and laterJan 2022
HatHatPytorch1.2.6 and laterMay 2022
SPANSPANPytorch1.2.14 and laterNov 2023JP

Text detection

ModelReferenceExported FromSupported Ailia VersionDateBlog
eastEAST: An Efficient and Accurate Scene Text DetectorTensorFlow1.2.6 and laterApr 2017
pixel_linkPixel-LinkTensorFlow1.2.6 and laterJan 2018
craft_pytorchCRAFT: Character-Region Awareness For Text detectionPytorch1.2.2 and laterApr 2019

Text recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
etlJapanese Character ClassificationKeras1.1.0 and later1973JP
crnn.pytorchConvolutional Recurrent Neural NetworkPytorch1.2.6 and laterJul 2015
deep-text-recognition-benchmarkdeep-text-recognition-benchmarkPytorch1.2.6 and laterApr 2019
easyocrReady-to-use OCR with 80+ supported languagesPytorch1.2.6 and laterApr 2020
paddleocrPaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddlePytorch1.2.6 and laterSep 2020EN JP
donutDonutPytorch1.2.16 and laterNov 2021
ndlocr_text_recognitionNDL OCRPytorch1.2.5 and laterApr 2022
paddleocr_v3PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddlePytorch1.2.17 and laterJun 2022JP

Time-Series Forecasting

ModelReferenceExported FromSupported Ailia VersionDateBlog
informer2020Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting (AAAI'21 Best Paper)Pytorch1.2.10 and laterDec 2020
timesfmTimesFMPytorch1.2.16 and laterOct 2023JP

Vehicle recognition

ModelReferenceExported FromSupported Ailia VersionDateBlog
vehicle-attributes-recognition-barriervehicle-attributes-recognition-barrier-0042OpenVINO1.2.5 and laterMay 2018EN JP
vehicle-license-plate-detection-barriervehicle-license-plate-detection-barrier-0106OpenVINO1.2.5 and laterMay 2018

Vision Language Model

ModelReferenceExported FromSupported Ailia VersionDateBlog
llavaLLaVAPytorch1.2.16 and laterApr 2023JP
florence2Hugging Face - microsoft/Florence-2-basePytorch1.2.16 and laterNov 2023JP
mobilevlmMobileVLMPytorch1.5.0 and laterDec 2023
llava-jpLLaVA-JPPytorch1.5.0 and laterJan 2024
qwen2_vlQwen2-VLPytorch1.5.0 and laterSep 2024JP

Commercial model

ModelReferenceExported FromSupported Ailia VersionDateBlog
acculus-poseAcculus, Inc.Caffe1.2.3 and laterMay 2018

Other languages