Image Matching Datasets

This post summarized the most used benchmark datasets for the Image Matching Task.

Dataset name	Size (pairs)	Class	Source datasets	Annotations	Characteristics
Caltech-101	1,515	101	Caltech-101	object segmentation	tightly cropped images of objects, little background
PASCAL-PARTS	3,884	20	PASCAL-PARTS PASCAL3D+	keypoints (0~12), azimuth, elevation, cyclo-rotation, body part segmentation	tightly cropped images of objects, little background, part and 3D information
Animal-parts	≈7,000	100	ILSVRC 2012	keypoints (1~6)	keypoints limited to eyes and feet of animals
CUB-200-2011	120k	200	CUB-200-2011	15 part locations, 312 binary attributes, bbox	tightly cropped images of object, only bird images
TSS	400	9	FG3DCar, JODS, PASCAL	object segmentation, flow, vectors	cropped images of objects, moderate background
PF-WILLOW	900	5	PASCAL VOC 2007, Caltech-256	keypoints (10)	center-aligned images, pairs with the same viewpoint
PF-PASCAL	1,300	20	PASCAL VOC 2007	keypoints (4~17), bbox	pairs with the same viewpoint
SPair-71k	70,958	18	PASCAL3D+, PASCAL VOC 2012	keypoints (3~30), azimuth, viewpoint diff., scale diff., trunc. diff., occl. diff., object seg., bbox	large-scale data with diverse variations, rich annotations, clear dataset splits

CUB-200-2011

Paper: C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, “The Caltech-UCSD birds-200–2011 dataset,” Tech. Rep., 2011.

Contains 11,788 images of 200 bird categories, with 15 parts annotated

Download: http://www.vision.caltech.edu/visipedia/CUB-200-2011.html

TSS dataset

Paper: T. Taniai, S. N. Sinha, and Y. Sato, “Joint recovery of dense correspondence and cosegmentation in two images,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 4246–4255.

Consists of 400 image pairs divided into three groups: FG3DCar, JODS, and PASCAL

Download: https://taniai.space/projects/cvpr16_dccs/

PF-WILLOW dataset

Paper: B. Ham, M. Cho, C. Schmid, and J. Ponce, “Proposal ﬂow,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 3475–3484.
Includes 10 object sub-classes with 10 keypoint annotations for each image.
Download: https://www.di.ens.fr/willow/research/proposalflow/

PF-PASCAL

Paper: B. Ham, M. Cho, C. Schmid, and J. Ponce, “Proposal ﬂow,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 3475–3484.

Contains 1,351 image pairs for 20 object categories with PASCAL keypoint annotations
Download: https://www.di.ens.fr/willow/research/proposalflow/

SPair-71k

Paper: J. Min, J. Lee, J. Ponce, and M. Cho, “Hyperpixel ﬂow: Semantic correspondence with multi-layer neural features,” in Proc. IEEE Int. Conf. Comput. Vis., 2019, pp. 3395–3404.

Provides 70,958 image pairs of 18 object categories with ground-truth annotations for object bounding boxes, segmentation masks, and keypoints. The image pairs feature various changes in viewpoint, scale, truncation, and occlusion.

Leave a Reply Cancel reply