预训练的先进AI模型集合。

关于ailia SDK

ailia SDK 是一个自包含、跨平台、高速推理的AI SDK。ailia SDK在Windows、Mac、Linux、iOS、Android、Jetson和Raspberry Pi平台上提供统一的C++ API。它还支持Unity（C#）、Python、Rust、Flutter（Dart）和JNI，以实现高效的AI应用。ailia SDK通过Vulkan和Metal广泛利用GPU，实现加速计算。

如何使用

新功能 - ailia SDK现在可以通过"pip3 install ailia"安装！

ailia MODELS教程

ailia MODELS教程日文版

支持的模型

截至2024年8月9日有340个模型

最新更新

2024.08.09 添加mahalanobis-ad, t5_base_japanese_ner
2024.08.08 添加sdxl-turbo, sd-turbo
2024.08.05 从Transformers迁移到ailia Tokenizer 1.3
2024.07.16 添加grounded_sam
2024.07.12 添加llava
2024.07.09 添加GroundingDINO
更多信息请见我们的Wiki

动作识别

模型	参考	导出自	支持的Ailia版本	博客
mars	MARS: Motion-Augmented RGB Stream for Action Recognition	Pytorch	1.2.4及以上	EN JP
st-gcn	ST-GCN	Pytorch	1.2.5及以上	EN JP
ax_action_recognition	Realtime-Action-Recognition	Pytorch	1.2.7及以上
va-cnn	View Adaptive Neural Networks (VA) for Skeleton-based Human Action Recognition	Pytorch	1.2.7及以上
driver-action-recognition-adas	driver-action-recognition-adas-0002	OpenVINO	1.2.5及以上
action_clip	ActionCLIP	Pytorch	1.2.7及以上

异常检测

模型	参考	导出自	支持的Ailia版本	博客
padim	PaDiM-Anomaly-Detection-Localization-master	Pytorch	1.2.6及以上	EN JP
spade-pytorch	Sub-Image Anomaly Detection with Deep Pyramid Correspondences	Pytorch	1.2.6及以上
patchcore	PatchCore_anomaly_detection	Pytorch	1.2.6及以上
mahalanobisad	MahalanobisAD-pytorch	Pytorch	1.2.9及以上

音频处理

模型	参考	导出自	支持的Ailia版本	博客
crnn_audio_classification	crnn-audio-classification	Pytorch	1.2.5及以上	EN JP
deepspeech2	deepspeech.pytorch	Pytorch	1.2.2及以上	EN JP
pytorch-dc-tts	Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention	Pytorch	1.2.6及以上	EN JP
unet_source_separation	source_separation	Pytorch	1.2.6及以上	EN JP
transformer-cnn-emotion-recognition	Combining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-Encoders	Pytorch	1.2.5及以上
auto_speech	AutoSpeech: Neural Architecture Search for Speaker Recognition	Pytorch	1.2.5及以上	EN JP
voicefilter	VoiceFilter	Pytorch	1.2.7及以上	EN JP
whisper	Whisper	Pytorch	1.2.10及以上	JP
clap	CLAP	Pytorch	1.2.6及以上
wespeaker	WeSpeaker	Onnxruntime	1.2.9及以上
tacotron2	Tacotron2	Pytorch	1.2.15及以上	JP
silero-vad	Silero VAD	Pytorch	1.2.15及以上	JP
rvc	Retrieval-based-Voice-Conversion-WebUI	Pytorch	1.2.12及以上	JP
crepe	torchcrepe	Pytorch	1.2.10及以上	JP
vall-e-x	VALL-E-X	Pytorch	1.2.15及以上
hifigan	HiFi-GAN	Pytorch	1.2.9及以上
distil-whisper	Hugging Face - Distil-Whisper	Pytorch	1.2.16及以上
microsoft clap	CLAP	Pytorch	1.2.11及以上
narabas	narabas: Japanese phoneme forced alignment tool	Pytorch	1.2.11及以上
rnnoise	rnnoise	Keras	1.2.15及以上
audioset_tagging_cnn	PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition	Pytorch	1.2.9及以上
deep music enhancer	On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks	Pytorch	1.2.6及以上
pyannote-audio	Pyannote-audio	Pytorch	1.2.15及以上	JP
kotoba-whisper	kotoba-whisper	Pytorch	1.2.16及以上
reazon_speech	ReazonSpeech	Pytorch	1.4.0及以上版本
reazon_speech2	ReazonSpeech2	Pytorch	1.4.0及以上版本
gpt-sovits	GPT-SoVITS	Pytorch	1.4.0及以上版本	JP

背景移除

模型	参考	导出自	支持Ailia版本	博客
U-2-Net	U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection	Pytorch	1.2.2及以上	EN JP
u2net-portrait-matting	U^2-Net - Portrait matting	Pytorch	1.2.7及以上
u2net-human-seg	U^2-Net - human segmentation	Pytorch	1.2.4及以上
deep-image-matting	Deep Image Matting	Keras	1.2.3及以上	EN JP
indexnet	Indices Matter: Learning to Index for Deep Image Matting	Pytorch	1.2.7及以上
modnet	MODNet: Trimap-Free Portrait Matting in Real Time	Pytorch	1.2.7及以上
background_matting_v2	Real-Time High-Resolution Background Matting	Pytorch	1.2.9及以上
cascade_psp	CascadePSP	Pytorch	1.2.9及以上
rembg	Rembg	Pytorch	1.2.4及以上
dis_seg	Highly Accurate Dichotomous Image Segmentation	Pytorch	1.2.10及以上
gfm	Bridging Composite and Real: Towards End-to-end Deep Image Matting	Pytorch	1.2.10及以上

人群计数

	模型	参考	导出自	支持Ailia版本	博客
	crowdcount-cascaded-mtl	基于CNN的层叠多任务学习的高层先验与密度估计的人群计数（单图像人群计数）	Pytorch	1.2.1及以上	EN JP
	c-3-framework	人群计数代码框架（C^3-Framework）	Pytorch	1.2.5及以上

深度时尚

模型	参考	来源	支持的 Ailia 版本	博客
clothing-detection	Clothing-Detection	Pytorch	1.2.1 及更高版本	EN JP
mmfashion	MMFashion	Pytorch	1.2.5 及更高版本	EN JP
mmfashion_tryon	MMFashion virtual try-on	Pytorch	1.2.8 及更高版本
mmfashion_retrieval	MMFashion In-Shop Clothes Retrieval	Pytorch	1.2.5 及更高版本
fashionai-key-points-detection	A Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points Detection	Pytorch	1.2.5 及更高版本
person-attributes-recognition-crossroad	person-attributes-recognition-crossroad-0230	Pytorch	1.2.10 及更高版本

深度估计

	模型	参考	来源	支持的 Ailia 版本	博客
	monodepth2	Monocular depth estimation from a single image	Pytorch	1.2.2 及更高版本
	midas	Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer	Pytorch	1.2.4 及更高版本	EN JP
	fcrn-depthprediction	Deeper Depth Prediction with Fully Convolutional Residual Networks	TensorFlow	1.2.6 及更高版本
	fast-depth	ICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems"	Pytorch	1.2.5 及更高版本
	lap-depth	LapDepth-release	Pytorch	1.2.9 及更高版本
	hitnet	ONNX-HITNET-Stereo-Depth-estimation	Pytorch	1.2.9 及更高版本
	crestereo	ONNX-CREStereo-Depth-Estimation	Pytorch	1.2.13 及更高版本
	mobilestereonet	MobileStereoNet	Pytorch	1.2.13 及更高版本
	zoe_depth	ZoeDepth	Pytorch	1.3.0 及更高版本
	模型	参考	来源	支持的Ailia版本	博客
:-----------	------------:	:------------:	:------------:	:------------:	:------------:
	latent-diffusion-txt2img	Latent Diffusion - txt2img	Pytorch	1.2.10及以后
	latent-diffusion-inpainting	Latent Diffusion - inpainting	Pytorch	1.2.10及以后
	latent-diffusion-superresolution	Latent Diffusion - Super-resolution	Pytorch	1.2.10及以后
	stable-diffusion-txt2img	Stable Diffusion	Pytorch	1.2.14及以后	JP
	control_net	ControlNet	Pytorch	1.2.15及以后
	DA-CLIP	DA-CLIP	Pytorch	1.2.16及以后
	riffusion	Riffusion	Pytorch	1.2.16及以后
	marigold	Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation	Pytorch	1.2.16及以后
	sdxl-turbo	Hugging Face - SDXL-Turbo	Pytorch	1.2.16及以后
	sd-turbo	Hugging Face - SD-Turbo	Pytorch	1.2.16及以后

面部检测

	模型	参考	来源	支持的Ailia版本	博客
	yolov1-face	YOLO-Face-detection	Darknet	1.1.0及以后
	yolov3-face	Face detection using keras-yolov3	Keras	1.2.1及以后
	blazeface	BlazeFace-PyTorch	Pytorch	1.2.1及以后	EN JP
	face-mask-detection	Face detection using keras-yolov3	Keras	1.2.1及以后	EN JP
	dbface	DBFace : real-time, single-stage detector for face detection, with faster speed and higher accuracy	Pytorch	1.2.2及以后
	retinaface	RetinaFace: Single-stage Dense Face Localisation in the Wild.	Pytorch	1.2.5及以后	JP
	anime-face-detector	Anime Face Detector	Pytorch	1.2.6及以后
	face-detection-adas	face-detection-adas-0001	OpenVINO	1.2.5及以后
	mtcnn	mtcnn	Keras	1.2.10及以后
	模型	引用	导出自	支持的Ailia版本	博客
:-----------	------------:	:------------:	:------------:	:------------:	:------------:
	vggface2	用于人脸识别的VGGFace2数据集	Caffe	1.1.0及以上
	arcface	ArcFace的Pytorch实现	Pytorch	1.2.1及以上	英文日文
	insightface	InsightFace：2D和3D人脸分析项目	Pytorch	1.2.5及以上
	cosface	CosFace的Pytorch实现	Pytorch	1.2.10及以上
	facenet_pytorch	使用Pytorch进行人脸识别	Pytorch	1.2.6及以上

人脸识别

模型	参考	导出自	支持的 Ailia 版本	博客
face_classification	实时面部检测和情感/性别分类	Keras	1.1.0 及以上
facial_feature	kaggle-facial-keypoints	Pytorch	1.2.0 及以上
face_alignment	使用 Pytorch 构建的 2D 和 3D 面部对齐库	Pytorch	1.2.1 及以上	EN JP
prnet	联合 3D 面部重建和密集对齐使用位置图回归网络	TensorFlow	1.2.2 及以上
gazeml	基于 Tensorflow 的深度学习框架用于训练高性能的注视估计	TensorFlow	1.2.0 及以上
facemesh	facemesh.pytorch	Pytorch	1.2.2 及以上	EN JP
mediapipe_iris	irislandmarks.pytorch	Pytorch	1.2.2 及以上	EN JP
hopenet	deep-head-pose	Pytorch	1.2.2 及以上	EN JP
ax_gaze_estimation	ax 视线估计	Pytorch	1.2.2 及以上	EN JP
age-gender-recognition-retail	age-gender-recognition-retail-0013	OpenVINO	1.2.5 及以上	EN JP
ferplus	FER+	CNTK	1.2.2 及以上
face-anti-spoofing	轻量化的面部防伪	Pytorch	1.2.5 及以上	EN JP
ax_facial_features	ax 面部特征	Pytorch	1.2.5 及以上	EN
6d_repnet	6D 旋转表示用于无约束头部姿势估计 (Pytorch)	Pytorch	1.2.6 及以上
hsemotion	HSEmotion（高速面部情感识别）库	Pytorch	1.2.5 及以上
facemesh_v2	MediaPipe 面部关键点检测	Pytorch	1.2.9 及以上版本	JP
3ddfa	Towards Fast, Accurate and Stable 3D Dense Face Alignment	Pytorch	1.2.10 及以上版本
mivolo	MiVOLO: 多输入Transformer 用于年龄和性别估计	Pytorch	1.2.13 及以上版本
L2CS_Net	L2CS_Net	Pytorch	1.2.9 及以上版本

面部交换

	模型	参考	导出自	支持的 Ailia 版本	博客
	facefusion	FaceFusion	ONNXRuntime	1.2.10 及以上

帧插值

模型	参考	导出自	支持的 Ailia 版本	博客
flavr	FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation	Pytorch	1.2.7 及以上	EN JP
cain	Channel Attention Is All You Need for Video Frame Interpolation	Pytorch	1.2.5 及以上
film	FILM: Frame Interpolation for Large Motion	Tensorflow	1.2.10 及以上
rife	Real-Time Intermediate Flow Estimation for Video Frame Interpolation	Pytorch	1.2.13 及以上

生成对抗网络

模型	参考	导出自	支持的 Ailia 版本	博客
pytorch-gan	Pytorch GAN Zoo 项目的代码库 (用于训练此模型)	Pytorch	1.2.4 及以上
council-gan	Council-GAN	Pytorch	1.2.4 及以上
restyle-encoder	ReStyle	Pytorch	1.2.9 及以上
sam	Age Transformation Using a Style-Based Regression Model	Pytorch	1.2.9 及以上
gfpgan	GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior	Pytorch	1.2.10 及以上	JP
sber-swap	SberSwap	Pytorch	1.2.12 及以上
encoder4editing	Designing an Encoder for StyleGAN Image Manipulation	Pytorch	1.2.10 及以上
lipgan	LipGAN	Keras	1.2.15 及以上	JP

手势检测

模型	参考	导出自	支持的 Ailia 版本
yolov3-hand	使用 keras-yolov3 的人脸检测分支的手检测	Keras	1.2.1 及以上
hand_detection_pytorch	hand-detection.PyTorch	Pytorch	1.2.2 及以上
blazepalm	MediaPipePyTorch	Pytorch	1.2.5 及以上

手势识别

模型	参考	导出自	支持的 Ailia 版本	博客
blazehand	MediaPipePyTorch	Pytorch	1.2.5 及以上	英文日文
hand3d	ColorHandPose3D network	TensorFlow	1.2.5 及以上
minimal-hand	Minimal Hand	TensorFlow	1.2.8 及以上
v2v-posenet	V2V-PoseNet	Pytorch	1.2.6 及以上
hands_segmentation_pytorch	hands-segmentation-pytorch	Pytorch	1.2.10 及以上

图像描述

模型	参考	导出自	支持的 Ailia 版本	博客
illustration2vec	Illustration2Vec	Caffe	1.2.2 及以上
image_captioning_pytorch	Image Captioning pytorch	Pytorch	1.2.5 及以上	英文日文
blip2	Hugging Face - BLIP-2	Pytorch	1.2.16 及以上

图像分类

模型	参考资料	导出自	支持的 Ailia 版本	博客
vgg16	大型图像识别的非常深度卷积网络	Keras	1.1.0 及以上
googlenet	通过卷积深入发展	Pytorch	1.2.0 及以上
resnet50	用于图像识别的深度残差学习	Chainer	1.2.0 及以上
inceptionv3	重新思考计算机视觉的Inception架构	Pytorch	1.2.0 及以上	JP
inceptionv4	Keras Inception-V4	Keras	1.2.5 及以上
mobilenetv2	MobileNet V2的PyTorch实现	Pytorch	1.2.0 及以上
mobilenetv3	MobileNet V3的PyTorch实现	Pytorch	1.2.1 及以上
partialconv	用于填充和图像修复的部分卷积层	Pytorch	1.2.0 及以上
efficientnet	EfficientNet的PyTorch实现	Pytorch	1.2.3 及以上
efficientnetv2	EfficientNetV2	Pytorch	1.2.4 及以上
vit	Vision Transformer的PyTorch重新实现（图片价值16x16描述：规模识别中的变压器）	Pytorch	1.2.7 及以上	EN JP
wide_resnet50	宽神经网络	Pytorch	1.2.5 及以上
resnet18	ResNet18	Pytorch	1.2.8 及以上
mlp_mixer	MLP-Mixer	Pytorch	1.2.9 及以上
alexnet	AlexNet PyTorch	Pytorch	1.2.5 及以上
clip	CLIP	Pytorch	1.2.9 及以上	EN JP
japanese-clip	Japanese-CLIP	Pytorch	1.2.15 及以上
weather-prediction-from-image	从图像预测天气 - （图像的温度）	Keras	1.2.5 及以上
swin-transformer	Swin Transformer	Pytorch	1.2.6 及以上
convnext	ConvNeXt的PyTorch实现	Pytorch	1.2.5 及以上
mobileone	MobileOne的PyTorch实现	Pytorch	1.2.1 及以上
imagenet21k	ImageNet21K	Pytorch	1.2.11 及以上
japanese-stable-clip-vit-l-16	japanese-stable-clip-vit-l-16	Pytorch	1.2.11 及以后版本

图像修复

模型	参考	导出自	支持的Ailia版本	博客
部分卷积修复	pytorch-inpainting-with-partial-conv	PyTorch	1.2.6及以上	EN JP
Generative多列卷积神经网络修复	Image Inpainting via Generative Multi-column Convolutional Neural Networks	TensorFlow	1.2.6及以上
3D照片修复	3D Photography using Context-aware Layered Depth Inpainting	Pytorch	1.2.7及以上
深度填充v2	Free-Form Image Inpainting with Gated Convolution	Pytorch	1.2.9及以上

图像操作

模型	参考	导出自	支持的Ailia版本	博客
noise2noise	Learning Image Restoration without Clean Data	Pytorch	1.2.0及以上
dewarpnet	DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks	Pytorch	1.2.1及以上
illnet	Document Rectification and Illumination Correction using a Patch-based CNN	Pytorch	1.2.2及以上
colorization	Colorful Image Colorization	Pytorch	1.2.2及以上	EN JP
u2net_portrait	U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection	Pytorch	1.2.2及以上
style2paints	Style2Paints	TensorFlow	1.2.6及以上
deep_white_balance	Deep White-Balance Editing, CVPR 2020 (Oral)	PyTorch	1.2.6及以上
deblur_gan	DeblurGAN	Pytorch	1.2.6及以上
可逆去噪网络	Invertible Image Denoising	Pytorch	1.2.8及以上
dfm	Deep Feature Matching	Pytorch	1.2.6及以上
dfe	Deep Fundamental Matrix Estimation	Pytorch	1.2.6及以上
dehamer	Image Dehazing Transformer with Transmission-Aware 3D Position Embedding	Pytorch	1.2.13及以上
pytorch-superpoint	pytorch-superpoint : Self-Supervised Interest Point Detection and Description	Pytorch	1.2.6及以上
cnngeometric_pytorch	CNN几何Pytorch实现	Pytorch	1.2.7及以上
lightglue	LightGlue-ONNX	Pytorch	1.2.15及以上
docshadow	DocShadow-ONNX-TensorRT	Pytorch	1.2.10及以上

图像修复

	模型	参考	导出自	支持的Ailia版本	博客
	nafnet	NAFNet: Nonlinear Activation Free Network for Image Restoration	Pytorch	1.2.10及以上

图像分割

| | 模型 | 参考 | 导出自 | 支持的 Ailia 版本 | 博客 |
|:-----------|------------:|:------------:|:------------:|:------------:|:------------:|
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/86482ba2-87bc-4775-b500-084913324d00.png" width=128px>](image_segmentation/deeplabv3/) | [deeplabv3](/image_segmentation/deeplabv3/) | [DeepLab v3+的Xception65主干网络](https://github.com/tensorflow/models/tree/master/research/deeplab) | Chainer | 1.2.0及更高版本 |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/40d6b198-22e0-4a76-b050-ebc2515a97ee.png" width=128px>](image_segmentation/hrnet_segmentation/) | [hrnet_segmentation](/image_segmentation/hrnet_segmentation/) | [高分辨率网络(HRNets)用于语义分割](https://github.com/HRNet/HRNet-Semantic-Segmentation) | Pytorch | 1.2.1及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/1e320bcc-53eb-406d-aa8a-515fcd22eb59.png" width=128px>](image_segmentation/hair_segmentation/) | [hair_segmentation](/image_segmentation/hair_segmentation/) | [移动设备上的头发分割](https://github.com/thangtran480/hair-segmentation) | Keras | 1.2.1及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/04966dfb-1ece-4e10-81e3-3889bb017322.png" width=128px>](image_segmentation/pspnet-hair-segmentation/) | [pspnet-hair-segmentation](/image_segmentation/pspnet-hair-segmentation/) | [pytorch-头发分割](https://github.com/YBIGTA/pytorch-hair-segmentation) | Pytorch | 1.2.2及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/25de63d3-8862-4074-bca6-d44c10cc2650.png" width=128px>](image_segmentation/human_part_segmentation/) | [human_part_segmentation](/image_segmentation/human_part_segmentation/) | [人体解析的自我纠正](https://github.com/PeikeLi/Self-Correction-Human-Parsing) | Pytorch | 1.2.4及更高版本 | [EN](https://medium.com/axinc-ai/humanpartsegmentation-a-machine-learning-model-for-segmenting-human-parts-cd7e39480714) [JP](https://medium.com/axinc/humanpartsegmentation-%E5%8B%95%E7%94%BB%E3%81%8B%E3%82%89%E4%BD%93%E3%81%AE%E9%83%A8%E4%BD%8D%E3%82%92%E3%82%BB%E3%82%B0%E3%83%A1%E3%83%B3%E3%83%86%E3%83%BC%E3%82%B7%E3%83%A7%E3%83%B3%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-e8a0e405255) |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/86449775-771e-413d-928f-49cf76ba9ffd.png" width=128px>](image_segmentation/semantic-segmentation-mobilenet-v3/) | [semantic-segmentation-mobilenet-v3](/image_segmentation/semantic-segmentation-mobilenet-v3) | [使用MobileNetV3进行语义分割](https://github.com/OniroAI/Semantic-segmentation-with-MobileNetV3) | TensorFlow | 1.2.5及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/0e4dd8bc-92a9-4125-bb4c-d30f23e1213b.jpg" width=128px>](image_segmentation/pytorch-unet/) | [pytorch-unet](/image_segmentation/pytorch-unet/) | [Pytorch-Unet](https://github.com/milesial/Pytorch-UNet) | Pytorch | 1.2.5及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/c181f4bc-2cfa-4804-aeeb-60267951122f.png" width=128px>](image_segmentation/pytorch-enet/) | [pytorch-enet](/image_segmentation/pytorch-enet/) | [PyTorch-ENet](https://github.com/davidtvs/PyTorch-ENet) | Pytorch | 1.2.8及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/2662f1e7-4ee3-4091-b743-4000ae4335c2.png" width=128px>](image_segmentation/yet-another-anime-segmenter/) | [yet-another-anime-segmenter](/image_segmentation/yet-another-anime-segmenter/) | [Yet Another Anime Segmenter](https://github.com/zymk9/Yet-Another-Anime-Segmenter) | Pytorch | 1.2.6及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/6dc9b9f1-6ac0-492d-82b7-1317637fd6b6.png" width=128px>](image_segmentation/swiftnet/) | [swiftnet](/image_segmentation/swiftnet/) | [SwiftNet](https://github.com/orsic/swiftnet) | Pytorch | 1.2.6及更高版本 | |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/926da1b5-88a0-4188-bbd4-9043c5afc654.png" width=128px>](image_segmentation/dense_prediction_transformers/) | [dense_prediction_transformers](/image_segmentation/dense_prediction_transformers/) | [用于密集预测的视觉Transformer](https://github.com/intel-isl/DPT)   | Pytorch | 1.2.7及更高版本 | [EN](https://medium.com/axinc-ai/dpt-segmentation-model-using-vision-transformer-b479f3027468) [JP](https://medium.com/axinc/dpt-vision-transformer%E3%82%92%E4%BD%BF%E7%94%A8%E3%81%97%E3%81%9F%E3%82%BB%E3%82%B0%E3%83%A1%E3%83%B3%E3%83%86%E3%83%BC%E3%82%B7%E3%83%A7%E3%83%B3%E3%83%A2%E3%83%87%E3%83%AB-88db4842b4a7) |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/5d7ebfb6-aa18-4f59-b800-60637ae534d8.png" width=128px>](image_segmentation/paddleseg/) | [paddleseg](/image_segmentation/paddleseg/) | [PaddleSeg](https://github.com/PaddlePaddle/PaddleSeg/tree/release/2.3/contrib/CityscapesSOTA) | Pytorch | 1.2.7及更高版本 | [EN](https://medium.com/axinc-ai/paddleseg-highly-accurate-segmentation-model-using-hierarchical-attention-18e69363dc2a) [JP](https://medium.com/axinc/paddleseg-%E9%9A%8E%E5%B1%A4%E7%9A%84%E3%81%AA%E3%82%A2%E3%83%86%E3%83%B3%E3%82%B7%E3%83%A7%E3%83%B3%E3%82%92%E4%BD%BF%E7%94%A8%E3%81%97%E3%81%9F%E9%AB%98%E7%B2%BE%E5%BA%A6%E3%81%AA%E3%82%BB%E3%82%B0%E3%83%A1%E3%83%B3%E3%83%86%E3%83%BC%E3%82%B7%E3%83%A7%E3%83%B3%E3%83%A2%E3%83%87%E3%83%AB-acc89bf50423) |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/49c8de46-512a-46e2-bee5-6965a9cb12ee.png" width=128px>](image_segmentation/pp_liteseg/) | [pp_liteseg](/image_segmentation/pp_liteseg/) | [PP-LiteSeg](https://github.com/PaddlePaddle/PaddleSeg/tree/develop/configs/pp_liteseg) | Pytorch | 1.2.10及更高版本 |  |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/dabc6212-5b12-46da-8f1b-a0660e55b638.jpg" width=128px>](image_segmentation/suim/) | [suim](/image_segmentation/suim/) | [SUIM](https://github.com/IRVLab/SUIM) | Keras | 1.2.6及更高版本 |  |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/724e5404-a272-4eba-afa8-eb7dbf0f3472.png" width=128px>](image_segmentation/group_vit/) | [group_vit](/image_segmentation/group_vit/) | [GroupViT](https://github.com/NVlabs/GroupViT) | Pytorch | 1.2.10及更高版本 |  |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/de9b5887-9c5a-4f46-ba5c-519b575b8409.png" width=128px>](image_segmentation/anime-segmentation/) | [anime-segmentation](/image_segmentation/anime-segmentation/) | [动漫分割](https://github.com/SkyTNT/anime-segmentation) | Pytorch | 1.2.9及更高版本 |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/00b27b55-6b7d-45f0-9401-4fa29c4d3b86.png" width=128px>](image_segmentation/segment-anything/) | [segment-anything](/image_segmentation/segment-anything/) | [Segment Anything](https://github.com/facebookresearch/segment-anything) | Pytorch | 1.2.16及更高版本 |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/e3a704dc-e733-40d5-bf3b-047adb2218e3.png" width=128px>](image_segmentation/tusimple-DUC/) | [tusimple-DUC](/image_segmentation/tusimple-DUC/) | [TuSimple-DUC](https://github.com/TuSimple/TuSimple-DUC) | Pytorch | 1.2.10及更高版本 |  |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/2e9421ff-ba09-4528-a194-6c68bf5b6ac5.jpg" width=128px>](image_segmentation/pytorch-fcn/) | [pytorch-fcn](/image_segmentation/pytorch-fcn/) | [pytorch-fcn](https://github.com/wkentaro/pytorch-fcn) | Pytorch | 1.3.0及更高版本 |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/fc795dec-3e8f-4a08-b243-75d1b0a7a240.png" width=128px>](image_segmentation/grounded_sam/) | [grounded_sam](/image_segmentation/grounded_sam/) | [Grounded-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything/tree/main) | Pytorch | 1.2.16及更高版本 |

## 大型语言模型
| 模型 | 参考 | 导出自 | 支持的 Ailia 版本 | 博客 |
|------------:|:------------:|:------------:|:------------:|:------------:|
|[llava](/large_language_model/llava) | [LLaVA](https://github.com/haotian-liu/LLaVA) | Pytorch | 1.2.16及更高版本 | |

## 地标分类

	模型	参考文献	导出自	支持的 Ailia 版本	博客
	landmarks_classifier_asia	Landmarks classifier_asia_V1.1	TensorFlow Hub	1.2.4 及以上版本	EN JP
	places365	Release of Places365-CNNs	Pytorch	1.2.5 及以上版本

线段检测

	模型	参考文献	导出自	支持的 Ailia 版本	博客
	mlsd	M-LSD: Towards Light-weight and Real-time Line Segment Detection	TensorFlow	1.2.8 及以上版本	EN JP
	dexined	DexiNed: Dense Extreme Inception Network for Edge Detection	Pytorch	1.2.5 及以上版本

低光照图像增强

	模型	参考文献	导出自	支持的 Ailia 版本	博客
	agllnet	AGLLNet: Attention Guided Low-light Image Enhancement (IJCV 2021)	Pytorch	1.2.9 及以上版本	EN JP

自然语言处理

| Model | 参考资料 | 导出平台 | 支持的 Ailia 版本 | 博客 |
|------------:|:------------:|:------------:|:------------:|:------------:|
|[bert](/natural_language_processing/bert) | [pytorch-pretrained-bert](https://pypi.org/project/pytorch-pretrained-bert/) | Pytorch | 1.2.2 或更高 | [EN](https://medium.com/axinc-ai/bert-a-machine-learning-model-for-efficient-natural-language-processing-aef3081c24e8) [JP](https://medium.com/axinc/bert-%E8%87%AA%E7%84%B6%E8%A8%80%E8%AA%9E%E5%87%A6%E7%90%86%E3%82%92%E5%8A%B9%E7%8E%87%E7%9A%84%E3%81%AB%E5%AD%A6%E7%BF%92%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-3a9c27d78cf8) |
|[bert_maskedlm](/natural_language_processing/bert_maskedlm) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[bert_ner](/natural_language_processing/bert_ner) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[bert_question_answering](/natural_language_processing/bert_question_answering) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[bert_sentiment_analysis](/natural_language_processing/bert_sentiment_analysis) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[bert_zero_shot_classification](/natural_language_processing/bert_zero_shot_classification) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[bert_tweets_sentiment](/natural_language_processing/bert_tweets_sentiment) | [huggingface/transformers](https://github.com/huggingface/transformers) | Pytorch | 1.2.5 或更高 |
|[gpt2](/natural_language_processing/gpt2) | [GPT-2](https://github.com/onnx/models/blob/master/text/machine_comprehension/gpt-2/README.md) | Pytorch | 1.2.7 或更高 |
|[rinna_gpt2](/natural_language_processing/rinna_gpt2) | [japanese-pretrained-models](https://github.com/rinnakk/japanese-pretrained-models)   | Pytorch | 1.2.7 或更高 |
|[fugumt-en-ja](/natural_language_processing/fugumt-en-ja) | [Fugu-Machine Translator](https://github.com/s-taka/fugumt)   | Pytorch | 1.2.9 或更高 | [JP](https://medium.com/axinc/fugumt-%E8%8B%B1%E8%AA%9E%E3%81%8B%E3%82%89%E6%97%A5%E6%9C%AC%E8%AA%9E%E3%81%B8%E3%81%AE%E7%BF%BB%E8%A8%B3%E3%82%92%E8%A1%8C%E3%81%86%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-46b839c1b4ae) |
|[fugumt-ja-en](/natural_language_processing/fugumt-ja-en) | [Fugu-Machine Translator](https://github.com/s-taka/fugumt)   | Pytorch | 1.2.10 或更高 |
|[bert_sum_ext](/natural_language_processing/bert_sum_ext) | [BERTSUMEXT](https://github.com/dmmiller612/bert-extractive-summarizer)   | Pytorch | 1.2.7 或更高 |
|[sentence_transformers_japanese](/natural_language_processing/sentence_transformers_japanese) | [sentence transformers](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | Pytorch | 1.2.7 或更高 | [JP](https://medium.com/axinc/sentencetransformer-%E3%83%86%E3%82%AD%E3%82%B9%E3%83%88%E3%81%8B%E3%82%89embedding%E3%82%92%E5%8F%96%E5%BE%97%E3%81%99%E3%82%8B%E8%A8%80%E8%AA%9E%E5%87%A6%E7%90%86%E3%83%A2%E3%83%87%E3%83%AB-b7d2a9bb2c31) |
|[presumm](/natural_language_processing/presumm) | [PreSumm](https://github.com/nlpyang/PreSumm)   | Pytorch | 1.2.8 或更高|
|[t5_base_japanese_title_generation](/natural_language_processing/t5_base_japanese_title_generation) | [t5-japanese](https://github.com/sonoisa/t5-japanese) | Pytorch | 1.2.13 或更高 | [JP](https://medium.com/axinc/t5-%E3%83%86%E3%82%AD%E3%82%B9%E3%83%88%E3%81%8B%E3%82%89%E3%83%86%E3%82%AD%E3%82%B9%E3%83%88%E3%82%92%E7%94%9F%E6%88%90%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-602830bdc5b4) |
|[bertjsc](/natural_language_processing/bertjsc) | [bertjsc](https://github.com/er-ri/bertjsc) | Pytorch | 1.2.15 或更高 |
|[multilingual-e5](/natural_language_processing/multilingual-e5) | [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) | Pytorch | 1.2.15 或更高 | [JP](https://medium.com/axinc/multilingual-e5-%E5%A4%9A%E8%A8%80%E8%AA%9E%E3%81%AE%E3%83%86%E3%82%AD%E3%82%B9%E3%83%88%E3%82%92embedding%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-71f1dec7c4f0) |
|[bert_insert_punctuation](/natural_language_processing/bert_insert_punctuation) | [bert-japanese](https://github.com/cl-tohoku/bert-japanese) | Pytorch | 1.2.15 或更高 |
|[t5_whisper_medical](/natural_language_processing/t5_whisper_medical) | 使用 t5 进行医学术语纠错 | Pytorch | 1.2.13 或更高 | |
|[t5_base_summarization](/natural_language_processing/t5_base_japanese_summarization) | [t5-japanese](https://github.com/sonoisa/t5-japanese) | Pytorch | 1.2.13 或更高 |
|[glucose](/natural_language_processing/glucose) | [GLuCoSE (General Luke-based Contrastive Sentence Embedding)-base-Japanese](https://huggingface.co/pkshatech/GLuCoSE-base-ja) | Pytorch | 1.2.15 或更高 |
|[cross_encoder_mmarco](/natural_language_processing/cross_encoder_mmarco) | [jeffwan/mmarco-mMiniLMv2-L12-H384-v](https://huggingface.co/jeffwan/mmarco-mMiniLMv2-L12-H384-v1) | Pytorch | 1.2.10 或更高 | [JP](https://medium.com/axinc/crossencodermmarco-%E8%B3%AA%E5%95%8F%E6%96%87%E3%81%A8%E5%9B%9E%E7%AD%94%E6%96%87%E3%81%AE%E9%A1%9E%E4%BC%BC%E5%BA%A6%E3%82%92%E8%A8%88%E7%AE%97%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-c90b35e9fc09)|
|[soundchoice-g2p](/natural_language_processing/soundchoice-g2p) | [Hugging Face - speechbrain/soundchoice-g2p](https://huggingface.co/speechbrain/soundchoice-g2p) | Pytorch | 1.2.16 或更高 | |
|[g2p_en](/natural_language_processing/g2p_en) | [g2p_en](https://github.com/Kyubyong/g2p) | Pytorch | 1.2.14 或更高 | |
|[t5_base_japanese_ner](/natural_language_processing/t5_base_japanese_ner) |  [t5-japanese](https://github.com/sonoisa/t5-japanese) | Pytorch | 1.2.13 或更高 |
|[japanese-reranker-cross-encoder](/natural_language_processing/japanese-reranker-cross-encoder) | [hotchpotch/japanese-reranker-cross-encoder-large-v1](https://huggingface.co/hotchpotch/japanese-reranker-cross-encoder-large-v1) | Pytorch | 1.2.16 或更高 |

## 网络入侵检测

| Model | 参考资料 | 导出平台 | 支持的 Ailia 版本 | 博客 |
|------------:|:------------:|:------------:|:------------:|:------------:|
| [bert-network-packet-flow-header-payload](/network_intrusion_detection/bert-network-packet-flow-header-payload/) | [bert-network-packet-flow-header-payload](https://huggingface.co/rdpahalavan/bert-network-packet-flow-header-payload)| Pytorch | 1.2.10 或更高 | |
| [falcon-adapter-network-packet](/network_intrusion_detection/falcon-adapter-network-packet/) | [falcon-adapter-network-packet](https://huggingface.co/rdpahalavan/falcon-adapter-network-packet)| Pytorch | 1.2.10 或更高 | |

## 神经渲染

| | Model | 参考资料 | 导出平台 | 支持的 Ailia 版本 | 博客 |
|:-----------|------------:|:------------:|:------------:|:------------:|:------------:|
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/0f455f74-1f46-4bb6-a0b4-d2abb0164621.png" width=128px>](neural_rendering/nerf/) | [nerf](/neural_rendering/nerf/) | [NeRF: Neural Radiance Fields](https://github.com/bmild/nerf) | Tensorflow | 1.2.10 或更高 | [EN](https://medium.com/axinc-ai/nerf-machine-learning-model-to-generate-and-render-3d-models-from-multiple-viewpoint-images-599631dc2075) [JP](https://medium.com/axinc/nerf-%E8%A4%87%E6%95%B0%E3%81%AE%E8%A6%96%E7%82%B9%E3%81%AE%E7%94%BB%E5%83%8F%E3%81%8B%E3%82%893d%E3%83%A2%E3%83%87%E3%83%AB%E3%82%92%E7%94%9F%E6%88%90%E3%81%97%E3%81%A6%E3%83%AC%E3%83%B3%E3%83%80%E3%83%AA%E3%83%B3%E3%82%B0%E3%81%99%E3%82%8B%E6%A9%9F%E6%A2%B0%E5%AD%A6%E7%BF%92%E3%83%A2%E3%83%87%E3%83%AB-2d6bee7ff22f) |
| [<img src="https://yellow-cdn.veclightyear.com/35dd4d3f/e34e7237-6308-4e49-b2a9-ba04fbf94180.gif" width=128px>](neural_rendering/tripo_sr/) | [TripoSR](/neural_rendering/tripo_sr/) | [TripoSR](https://github.com/VAST-AI-Research/TripoSR) | Pytorch | 1.2.6 或更高 |

## 不适宜内容检测

模型	参考资料	导出自	支持的 Ailia 版本	博客
clip-based-nsfw-detector	CLIP-based-NSFW-Detector	Keras	1.2.10 及之后版本	JP

物体检测

模型	参考	来源	支持的 Ailia 版本	博客
yolov1-tiny	YOLO: 实时物体检测	Darknet	1.1.0 及之后	JP
yolov2	YOLO: 实时物体检测	Pytorch	1.2.0 及之后
yolov2-tiny	YOLO: 实时物体检测	Pytorch	1.2.6 及之后
yolov3	YOLO: 实时物体检测	ONNX Runtime	1.2.1 及之后	EN JP
yolov3-tiny	YOLO: 实时物体检测	ONNX Runtime	1.2.1 及之后
yolov4	Pytorch-YOLOv4	Pytorch	1.2.4 及之后	EN JP
yolov4-tiny	Pytorch-YOLOv4	Pytorch	1.2.5 及之后
yolov5	yolov5	Pytorch	1.2.5 及之后	EN JP
yolov6	YOLOV6	Pytorch	1.2.10 及之后
yolov7	YOLOv7	Pytorch	1.2.7 及之后
yolov8	YOLOv8	Pytorch	1.2.14.1 及之后
yolov8-seg	YOLOv8	Pytorch	1.2.14.1 及之后
yolov9	YOLOv9	Pytorch	1.2.10 及之后
yolor	yolor	Pytorch	1.2.5 及之后
yolox	YOLOX	Pytorch	1.2.6 及之后	EN JP
yolox-ti-lite	edgeai-yolox	Pytorch	1.2.9 及之后
yolov	YOLOV	Pytorch	1.2.10 及之后
mobilenet_ssd	MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch	Pytorch	1.2.1 及之后	EN JP
maskrcnn	Mask R-CNN: 实时神经网络物体实例分割	Pytorch	1.2.3 及之后
m2det	M2Det: 基于多层次特征金字塔网络的单阶段物体检测器	Pytorch	1.2.3及更高版本	EN JP
centernet	CenterNet: 将物体视为点	Pytorch	1.2.1及更高版本	EN JP
pedestrian_detection	Pedestrian-Detection-on-YOLOv3_Research-and-APP	Keras	1.2.1及更高版本
efficientdet	EfficientDet: 在PyTorch中可扩展且高效的物体检测	Pytorch	1.2.6及更高版本
nanodet	NanoDet	Pytorch	1.2.6及更高版本
mobile_object_localizer	mobile_object_localizer_v1	TensorFlow Hub	1.2.6及更高版本	EN JP
sku110k-densedet	SKU110K-DenseDet	Pytorch	1.2.9及更高版本	EN JP
traffic-sign-detection	交通标志检测	Tensorflow	1.2.10及更高版本	EN JP
detic	使用图像级监督检测两万个类别	Pytorch	1.2.10及更高版本	EN JP
picodet	PP-PicoDet	Pytorch	1.2.10及更高版本
yolact	You Only Look At CoefficienTs	Pytorch	1.2.6及更高版本
fastest-det	FastestDet	Pytorch	1.2.5及更高版本
dab-detr	DAB-DETR	Pytorch	1.2.12及更高版本
poly_yolo	Poly YOLO	Keras	1.2.6及更高版本
glip	GLIP	Pytorch	1.2.13及更高版本
crowd_det	人群场景中的检测	Pytorch	1.2.13及更高版本
footandball	FootAndBall: 集成的球员和球检测器	Pytorch	1.2.0及更高版本
qrcode_wechatqrcode	qrcode_wechatqrcode	Caffe	1.2.15及更高版本
layout_parsing	unstructured-inference	Pytorch	1.2.9及以上
damo_yolo	DAMO-YOLO	Pytorch	1.2.9及以上
groundingdino	Grounding DINO	Pytorch	1.2.16及以上

目标检测 3D

模型	参考	导出自	支持的 Ailia 版本	博客
3d_bbox	使用深度学习和几何学进行3D边界框估计	Pytorch	1.2.6 及以上
3d-object-detection.pytorch	3d-object-detection.pytorch	Pytorch	1.2.8 及以上	EN JP
mediapipe_objectron	MediaPipe Objectron	TensorFlow Lite	1.2.5 及以上
egonet	EgoNet	Pytorch	1.2.9 及以上
d4lcn	D4LCN	Pytorch	1.2.9 及以上
did_m3d	DID M3D	Pytorch	1.2.11 及以上

目标跟踪

模型	参考	导出自	支持的 Ailia 版本	博客
deepsort	使用 PyTorch 的 Deep Sort	Pytorch	1.2.3 及以上	EN JP
person_reid_baseline_pytorch	UTS-Person-reID-Practical	Pytorch	1.2.6 及以上
abd_net	注意但多样的人员再识别	Pytorch	1.2.7 及以上
siam-mot	SiamMOT	Pytorch	1.2.9 及以上
bytetrack	ByteTrack	Pytorch	1.2.5 及以上	EN JP
qd-3dt	单目准密3D目标跟踪	Pytorch	1.2.11 及以上
strong_sort	StrongSORT	Pytorch	1.2.15 及以上
centroids-reid	质心在图像检索中的非同寻常效果	Pytorch	1.2.9 及以上
deepsort_vehicle	多摄像头实时目标跟踪	Pytorch	1.2.9 及以上

光流估计

	模型	参考	导出自	支持的 Ailia 版本	博客
	raft	RAFT: 光流的循环所有对字段变换	Pytorch	1.2.6 及以上	EN JP

点云分割

	模型	参考	导出自	支持的 Ailia 版本	博客
	pointnet_pytorch	PointNet.pytorch	Pytorch	1.2.6 及以上

姿态估计

模型	参考	导出自	支持的Ailia版本	博客
openpose	CVPR'17（口头）实时多人姿态估计代码库	Caffe	1.2.1及以后
lightweight-human-pose-estimation	PyTorch中快速准确的姿态估计。包含“轻量级OpenPose的实时CPU多人2D姿态估计”论文的实现。	Pytorch	1.2.1及以后	EN JP
pose_resnet	人类姿态估计和跟踪的简单基线	Pytorch	1.2.1及以后	EN JP
blazepose	MediaPipePyTorch	Pytorch	1.2.5及以后
efficientpose	EfficientPose代码库	TensorFlow	1.2.6及以后
movenet	movenet代码库	TensorFlow	1.2.8及以后	EN JP
animalpose	MMPose - 2D动物姿态估计	Pytorch	1.2.7及以后	EN JP
mediapipe_holistic	MediaPipe Holistic	TensorFlow	1.2.9及以后
ap-10k	AP-10K	Pytorch	1.2.4及以后
posenet	PoseNet Pytorch	Pytorch	1.2.10及以后
e2pose	E2Pose	Tensorflow	1.2.5及以后

3D姿态估计

            | | 模型 | 参考 | 导出自 | 支持的 Ailia 版本 | 博客 |

|:-----------|------------:|:------------:|:------------:|:------------:|:------------:| | |lightweight-human-pose-estimation-3d | PyTorch 实时 3D 多人姿态估计 demo。
OpenVINO 后端可用于快速的 CPU 推理。 | Pytorch | 1.2.1及更高 | | | |3d-pose-baseline | 一个简单的 TensorFlow 3D 人体姿态估计基线。
在 ICCV 17 上展示。 | TensorFlow | 1.2.3及更高 | | | |pose-hg-3d | Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach | Pytorch | 1.2.6及更高 | | | |blazepose-fullbody | MediaPipe | TensorFlow Lite | 1.2.5及更高 | EN JP | | |3dmppe_posenet | 来自单个 RGB 图像的 3D 多人姿态估计的“相机距离感知自上而下方法”的 PoseNet | Pytorch | 1.2.6及更高 | | | |gast | 视频中 3D 人体姿态估计的图注意空间时间卷积网络 (GAST-Net) | Pytorch | 1.2.7及更高 | EN JP | | |mediapipe_pose_world_landmarks | MediaPipe 实际坐标点 | TensorFlow Lite | 1.2.10及更高 | |

道路检测

模型	参考	导出自	支持的 Ailia 版本	博客
codes-for-lane-detection	车道检测代码	Pytorch	1.2.6及更高	EN JP
roneld	RONELD 车道检测	Pytorch	1.2.6及更高
road-segmentation-adas	road-segmentation-adas-0001	OpenVINO	1.2.5及更高
cdnet	CDNet	Pytorch	1.2.5及更高
lstr	LSTR	Pytorch	1.2.8及更高
ultra-fast-lane-detection	Ultra-Fast-Lane-Detection	Pytorch	1.2.6及更高
yolop	YOLOP	Pytorch	1.2.6及更高
hybridnets	HybridNets	Pytorch	1.2.6及更高
polylanenet	PolyLaneNet	Pytorch	1.2.9及更高

旋转预测

	模型	参考	导出自	支持的 Ailia 版本	博客
	rotnet	用于预测图像旋转角度以校正其方向的 CNN	Keras	1.2.1及更高

样式迁移

模型	参考文献	来源	支持的 Ailia 版本	博客
adain	实时任意风格转换与自适应实例归一化	Pytorch	1.2.1 及以上	EN JP
psgan	PSGAN: 姿势和表情鲁棒的空间感知 GAN，用于定制化妆转换	Pytorch	1.2.7 及以上
beauty_gan	BeautyGAN	Pytorch	1.2.7 及以上
animeganv2	AnimeGANv2 的 PyTorch 实现	Pytorch	1.2.5 及以上
pix2pixHD	pix2pixHD: 使用条件 GAN 进行高分辨率图像合成和语义操控	Pytorch	1.2.6 及以上

超分辨率

模型	参考文献	来源	支持的 Ailia 版本	博客
srresnet	用生成对抗网络实现照片级单图像超分辨率	Pytorch	1.2.0 及以上	EN JP
edsr	用于单图像超分辨率的增强深度残差网络	Pytorch	1.2.6 及以上	EN JP
han	通过整体注意网络实现单图像超分辨率	Pytorch	1.2.6 及以上
real-esrgan	Real-ESRGAN	Pytorch	1.2.9 及以上
rcan-it	重新审视 RCAN: 改进的单图像超分辨率训练	Pytorch	1.2.10 及以上
swinir	SwinIR: 使用 Swin Transformer 的图像恢复	Pytorch	1.2.12 及以上
Hat	Hat	Pytorch	1.2.6 及以上

文本检测

模型	参考文献	来源	支持的 Ailia 版本
craft_pytorch	CRAFT: 文本检测的字符区域感知	Pytorch	1.2.2 及以上
pixel_link	Pixel-Link	TensorFlow	1.2.6 及以上
east	EAST: 一种高效且准确的场景文本检测器	TensorFlow	1.2.6 及以上

文本识别

模型	参考	导出自	支持的 Ailia 版本	博客
etl	日本字符分类	Keras	1.1.0 及以上	JP
deep-text-recognition-benchmark	deep-text-recognition-benchmark	Pytorch	1.2.6 及以上
crnn.pytorch	卷积递归神经网络	Pytorch	1.2.6 及以上
paddleocr	PaddleOCR：基于 PaddlePaddle 的多语言 OCR 工具包	Pytorch	1.2.6 及以上	EN JP
easyocr	支持 80 多种语言的即用型 OCR	Pytorch	1.2.6 及以上
ndlocr_text_recognition	NDL OCR	Pytorch	1.2.5 及以上

时间序列预测

模型	参考	导出自	支持的 Ailia 版本	博客
informer2020	Informer：用于长序列时间序列预测的高效转换器 (AAAI'21 最佳论文)	Pytorch	1.2.10 及以上

车辆识别

	模型	参考	导出自	支持的 Ailia 版本	博客
	vehicle-attributes-recognition-barrier	vehicle-attributes-recognition-barrier-0042	OpenVINO	1.2.5 及以上	EN JP
	vehicle-license-plate-detection-barrier	vehicle-license-plate-detection-barrier-0106	OpenVINO	1.2.5 及以上