Models, Datasets and Transformations for Images


[Up] [Top]

Documentation for package ‘torchvision’ version 0.8.0

Help Pages

B C D E F G I K L M N O P Q R T V W misc

-- B --

base_loader Base loader
batched_nms Batched Non-maximum Suppression (NMS)
box_area Box Area
box_convert Box Convert
box_cxcywh_to_xyxy box_cxcywh_to_xyxy
box_iou Box IoU
box_xywh_to_xyxy box_xywh_to_xyxy
box_xyxy_to_cxcywh box_xyxy_to_cxcywh
box_xyxy_to_xywh box_xyxy_to_xywh

-- C --

caltech101_dataset Caltech Datasets
caltech256_dataset Caltech Datasets
caltech_dataset Caltech Datasets
cifar100_dataset CIFAR datasets
cifar10_dataset CIFAR datasets
clip_boxes_to_image Clip Boxes to Image
coco_caption_dataset COCO Caption Dataset
coco_detection_dataset COCO Detection Dataset

-- D --

draw_bounding_boxes Draws bounding boxes on image.
draw_bounding_boxes.default Draws bounding boxes on image.
draw_bounding_boxes.image_with_bounding_box Draws bounding boxes on image.
draw_bounding_boxes.torch_tensor Draws bounding boxes on image.
draw_keypoints Draws Keypoints
draw_segmentation_masks Draw segmentation masks
draw_segmentation_masks.default Draw segmentation masks
draw_segmentation_masks.image_with_segmentation_mask Draw segmentation masks
draw_segmentation_masks.torch_tensor Draw segmentation masks

-- E --

emnist_collection MNIST and Derived Datasets
emnist_dataset MNIST and Derived Datasets
eurosat100_dataset EuroSAT datasets
eurosat_all_bands_dataset EuroSAT datasets
eurosat_dataset EuroSAT datasets

-- F --

fashion_mnist_dataset MNIST and Derived Datasets
fer_dataset FER-2013 Facial Expression Dataset
fgvc_aircraft_dataset FGVC Aircraft Dataset
flickr30k_caption_dataset Flickr Caption Datasets
flickr8k_caption_dataset Flickr Caption Datasets
flickr_caption_dataset Flickr Caption Datasets
flowers102_dataset Oxford Flowers 102 Dataset

-- G --

generalized_box_iou Generalized Box IoU

-- I --

imagenet_classes ImageNet Class Labels
imagenet_label ImageNet Class Labels
image_folder_dataset Create an image folder dataset

-- K --

kmnist_dataset MNIST and Derived Datasets

-- L --

lfw_dataset LFW Datasets
lfw_pairs_dataset LFW Datasets
lfw_people_dataset LFW Datasets

-- M --

magick_loader Load an Image using ImageMagick
mnist_dataset MNIST and Derived Datasets
model_alexnet AlexNet Model Architecture
model_convnext ConvNeXt Implementation
model_convnext_base_1k ConvNeXt Implementation
model_convnext_base_22k ConvNeXt Implementation
model_convnext_large_1k ConvNeXt Implementation
model_convnext_large_22k ConvNeXt Implementation
model_convnext_small_22k ConvNeXt Implementation
model_convnext_small_22k1k ConvNeXt Implementation
model_convnext_tiny_1k ConvNeXt Implementation
model_convnext_tiny_22k ConvNeXt Implementation
model_deeplabv3 DeepLabV3 Models
model_deeplabv3_resnet101 DeepLabV3 Models
model_deeplabv3_resnet50 DeepLabV3 Models
model_efficientnet EfficientNet Models
model_efficientnet_b0 EfficientNet Models
model_efficientnet_b1 EfficientNet Models
model_efficientnet_b2 EfficientNet Models
model_efficientnet_b3 EfficientNet Models
model_efficientnet_b4 EfficientNet Models
model_efficientnet_b5 EfficientNet Models
model_efficientnet_b6 EfficientNet Models
model_efficientnet_b7 EfficientNet Models
model_efficientnet_v2 EfficientNetV2 Models
model_efficientnet_v2_l EfficientNetV2 Models
model_efficientnet_v2_m EfficientNetV2 Models
model_efficientnet_v2_s EfficientNetV2 Models
model_facenet MTCNN Face Detection Networks
model_facenet_inception_resnet_v1 MTCNN Face Detection Networks
model_facenet_onet MTCNN Face Detection Networks
model_facenet_pnet MTCNN Face Detection Networks
model_facenet_rnet MTCNN Face Detection Networks
model_fasterrcnn Faster R-CNN Models
model_fasterrcnn_mobilenet_v3_large_320_fpn Faster R-CNN Models
model_fasterrcnn_mobilenet_v3_large_fpn Faster R-CNN Models
model_fasterrcnn_resnet50_fpn Faster R-CNN Models
model_fasterrcnn_resnet50_fpn_v2 Faster R-CNN Models
model_fcn_resnet Fully Convolutional Network for Semantic Segmentation
model_fcn_resnet101 Fully Convolutional Network for Semantic Segmentation
model_fcn_resnet50 Fully Convolutional Network for Semantic Segmentation
model_inception_v3 Inception v3 model
model_maxvit MaxViT Model
model_mobilenet_v2 MobileNetV2 Model
model_mobilenet_v3 MobileNetV3 Model
model_mobilenet_v3_large MobileNetV3 Model
model_mobilenet_v3_large_quantized MobileNetV3 Model
model_mobilenet_v3_small MobileNetV3 Model
model_mtcnn MTCNN Face Detection Networks
model_resnet ResNet implementation
model_resnet101 ResNet implementation
model_resnet152 ResNet implementation
model_resnet18 ResNet implementation
model_resnet34 ResNet implementation
model_resnet50 ResNet implementation
model_resnext101_32x8d ResNet implementation
model_resnext50_32x4d ResNet implementation
model_vgg VGG implementation
model_vgg11 VGG implementation
model_vgg11_bn VGG implementation
model_vgg13 VGG implementation
model_vgg13_bn VGG implementation
model_vgg16 VGG implementation
model_vgg16_bn VGG implementation
model_vgg19 VGG implementation
model_vgg19_bn VGG implementation
model_vit Vision Transformer Implementation
model_vit_b_16 Vision Transformer Implementation
model_vit_b_32 Vision Transformer Implementation
model_vit_h_14 Vision Transformer Implementation
model_vit_l_16 Vision Transformer Implementation
model_vit_l_32 Vision Transformer Implementation
model_wide_resnet101_2 ResNet implementation
model_wide_resnet50_2 ResNet implementation

-- N --

nms Non-maximum Suppression (NMS)

-- O --

oxfordiiitpet_binary_dataset Oxford-IIIT Pet Classification Datasets
oxfordiiitpet_dataset Oxford-IIIT Pet Classification Datasets
oxfordiiitpet_segmentation_dataset Oxford-IIIT Pet Segmentation Dataset

-- P --

pascal_detection_dataset Pascal VOC Datasets
pascal_segmentation_dataset Pascal VOC Datasets
pascal_voc_datasets Pascal VOC Datasets
places365_dataset Places365 Dataset
places365_dataset_large Places365 Dataset

-- Q --

qmnist_dataset MNIST and Derived Datasets

-- R --

remove_small_boxes Remove Small Boxes
rf100_biology_collection RoboFlow 100 Biology dataset Collection
rf100_damage_collection RoboFlow 100 Damages dataset Collection
rf100_document_collection RF100 Document Collection Datasets
rf100_infrared_collection RoboFlow 100 Infrared dataset Collection
rf100_medical_collection RoboFlow 100 Medical dataset Collection
rf100_peixos_segmentation_dataset RF100 Peixos Segmentation Dataset
rf100_underwater_collection RoboFlow 100 Underwater dataset Collection

-- T --

tensor_image_browse Display image tensor
tensor_image_display Display image tensor
tiny_imagenet_dataset Tiny ImageNet dataset
transform_adjust_brightness Adjust the brightness of an image
transform_adjust_contrast Adjust the contrast of an image
transform_adjust_gamma Adjust the gamma of an RGB image
transform_adjust_hue Adjust the hue of an image
transform_adjust_saturation Adjust the color saturation of an image
transform_affine Apply affine transformation on an image keeping image center invariant
transform_center_crop Crops the given image at the center
transform_color_jitter Randomly change the brightness, contrast and saturation of an image
transform_convert_image_dtype Convert a tensor image to the given 'dtype' and scale the values accordingly
transform_crop Crop the given image at specified location and output size
transform_five_crop Crop image into four corners and a central crop
transform_grayscale Convert image to grayscale
transform_hflip Horizontally flip a PIL Image or Tensor
transform_linear_transformation Transform a tensor image with a square transformation matrix and a mean_vector computed offline
transform_normalize Normalize a tensor image with mean and standard deviation
transform_pad Pad the given image on all sides with the given "pad" value
transform_perspective Perspective transformation of an image
transform_random_affine Random affine transformation of the image keeping center invariant
transform_random_apply Apply a list of transformations randomly with a given probability
transform_random_choice Apply single transformation randomly picked from a list
transform_random_crop Crop the given image at a random location
transform_random_erasing Randomly selects a rectangular region in an image and erases its pixel values
transform_random_grayscale Randomly convert image to grayscale with a given probability
transform_random_horizontal_flip Horizontally flip an image randomly with a given probability
transform_random_order Apply a list of transformations in a random order
transform_random_perspective Random perspective transformation of an image with a given probability
transform_random_resized_crop Crop image to random size and aspect ratio
transform_random_rotation Rotate the image by angle
transform_random_vertical_flip Vertically flip an image randomly with a given probability
transform_resize Resize the input image to the given size
transform_resized_crop Crop an image and resize it to a desired size
transform_rgb_to_grayscale Convert RGB Image Tensor to Grayscale
transform_rotate Angular rotation of an image
transform_ten_crop Crop an image and the flipped image each into four corners and a central crop
transform_to_tensor Convert an image to a tensor
transform_vflip Vertically flip a PIL Image or Tensor

-- V --

vision_make_grid A simplified version of torchvision.utils.make_grid

-- W --

whoi_plankton_dataset WHOI Plankton Datasets
whoi_small_coralnet_dataset Coralnet Dataset
whoi_small_plankton_dataset WHOI Plankton Datasets

-- misc --

.rename_mobilenet_v3_state_dict Rename state_dict keys to match current MobileNetV3 model structure