Skip to content

Latest commit

 

History

History
150 lines (129 loc) · 27.9 KB

deep-learning-examples.md

File metadata and controls

150 lines (129 loc) · 27.9 KB

Deep Learning Examples

Introduction

This repository provides NVIDIA's State-of-the-Art Deep Learning examples that are easy to train and deploy, achieving the best reproducible accuracy and performance with NVIDIA CUDA-X software stack running on NVIDIA Volta, Turing and Ampere GPUs.

DeepOps Deployment Options

Once the cluster, and a local docker registry are accessible by all nodes (default is port registry.local:31500, use helm to deploy a jupyter lab session which will be exposed via NodePort (default port is 30888).

helm install <DEPLOYMENT_EXAMPLE_NAME> workloads/examples/k8s/deep-learning-examples --set exampleName=<DEPLOYMENT_EXAMPLE_NAME>

Deployment modifications may be made using the --set flag or directly in the values.yaml. See the values.yaml file for more detail on the available configuration.

Deployment Example Name Options:

- cuda-optimized-fastspeech
- dglpytorch-drugdiscovery-se3transformer
- mxnet-classification
- pytorch-classification-convnets
- pytorch-detection-efficientdet
- pytorch-detection-ssd
- pytorch-forecasting-tft
- pytorch-languagemodeling-bart
- pytorch-languagemodeling-bert
- pytorch-languagemodeling-transformer-xl
- pytorch-recommendation-dlrm
- pytorch-recommendation-ncf
- pytorch-segmentation-maskrcnn
- pytorch-segmentation-nnunet
- pytorch-speechrecognition-jasper
- pytorch-speechrecognition-quartznet
- pytorch-speechsynthesis-fastpitch
- pytorch-speechsynthesis-tacotron2
- pytorch-translation-gnmt
- pytorch-translation-transformer
- tensorflow-classification-convnets
- tensorflow-detection-ssd
- tensorflow-languagemodeling-bert
- tensorflow-languagemodeling-transformerxl
- tensorflow-recommendation-ncf
- tensorflow-recommendation-vaecf
- tensorflow-recommendation-wideanddeep
- tensorflow-segmentation-unet3dmedical
- tensorflow-segmentation-unetindustrial
- tensorflow-segmentation-unetmedical
- tensorflow-segmentation-vnet
- tensorflow-translation-gnmt
- tensorflow2-efficientnet
- tensorflow2-languagemodeling-bert
- tensorflow2-languagemodeling-electra
- tensorflow2-recommendation-dlrm
- tensorflow2-recommendation-wideanddeep
- tensorflow2-segmentation-maskrcnn
- tensorflow2-segmentation-unet-medical

Performance

Computer Vision

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
ResNet-50 PyTorch Yes Yes Yes - Yes - Yes Yes -
ResNeXt-101 PyTorch Yes Yes Yes - Yes - Yes Yes -
SE-ResNeXt-101 PyTorch Yes Yes Yes - Yes - Yes Yes -
EfficientNet-B0 PyTorch Yes Yes Yes - - - - Yes -
EfficientNet-B4 PyTorch Yes Yes Yes - - - - Yes -
EfficientNet-WideSE-B0 PyTorch Yes Yes Yes - - - - Yes -
EfficientNet-WideSE-B4 PyTorch Yes Yes Yes - - - - Yes -
Mask R-CNN PyTorch Yes Yes Yes - - - - - Yes
nnUNet PyTorch Yes Yes Yes - - - - Yes -
SSD PyTorch Yes Yes Yes - - - - - Yes
ResNet-50 TensorFlow Yes Yes Yes - - - - Yes -
ResNeXt101 TensorFlow Yes Yes Yes - - - - Yes -
SE-ResNeXt-101 TensorFlow Yes Yes Yes - - - - Yes -
Mask R-CNN TensorFlow Yes Yes Yes - - - - Yes -
SSD TensorFlow Yes Yes Yes - - - - Yes Yes
U-Net Ind TensorFlow Yes Yes Yes - - - - Yes Yes
U-Net Med TensorFlow Yes Yes Yes - - - - Yes -
U-Net 3D TensorFlow Yes Yes Yes - - - - Yes -
V-Net Med TensorFlow Yes Yes Yes - - - - Yes -
U-Net Med TensorFlow2 Yes Yes Yes - - - - Yes -
Mask R-CNN TensorFlow2 Yes Yes Yes - - - - Yes -
EfficientNet TensorFlow2 Yes Yes Yes Yes - - - Yes -
ResNet-50 MXNet - Yes Yes - - - - - -

Natural Language Processing

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
BERT PyTorch Yes Yes Yes Yes - - Yes Yes -
TransformerXL PyTorch Yes Yes Yes Yes - - - Yes -
GNMT PyTorch Yes Yes Yes - - - - - -
Transformer PyTorch Yes Yes Yes - - - - - -
ELECTRA TensorFlow2 Yes Yes Yes Yes - - - Yes -
BERT TensorFlow Yes Yes Yes Yes Yes - Yes Yes Yes
BERT TensorFlow2 Yes Yes Yes Yes - - - Yes -
BioBert TensorFlow Yes Yes Yes - - - - Yes Yes
TransformerXL TensorFlow Yes Yes Yes - - - - - -
GNMT TensorFlow Yes Yes Yes - - - - - -
Faster Transformer Tensorflow - - - - Yes - - - -

Recommender Systems

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
DLRM PyTorch Yes Yes Yes - - Yes Yes Yes Yes
DLRM TensorFlow2 Yes Yes Yes Yes - - - Yes -
NCF PyTorch Yes Yes Yes - - - - - -
Wide&Deep TensorFlow Yes Yes Yes - - - - Yes -
Wide&Deep TensorFlow2 Yes Yes Yes - - - - Yes -
NCF TensorFlow Yes Yes Yes - - - - Yes -
VAE-CF TensorFlow Yes Yes Yes - - - - - -

Speech to Text

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
Jasper PyTorch Yes Yes Yes - Yes Yes Yes Yes Yes
Hidden Markov Model Kaldi - - Yes - - - Yes - -

Text to Speech

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
FastPitch PyTorch Yes Yes Yes - - - - Yes -
FastSpeech PyTorch - Yes Yes - Yes - - - -
Tacotron 2 and WaveGlow PyTorch Yes Yes Yes - Yes Yes Yes Yes -

Graph Neural Networks

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton DLC NB
SE(3)-Transformer PyTorch Yes Yes Yes - - - - - -