Top Cited Papers: CVPR 2017

Best Paper Award

2017

“Densely Connected Convolutional Networks”

G. Huang, Z. Liu, L. van der Maaten, K. Q. Weinberger

Honorable Mention

2017	“Annotating Object Instances with a Polygon-RNN”	L. Castrejon, K. Kundu, R. Urtasun, S. Fidler
2017	“YOLO9000: Better, Faster, Stronger”	J. Redmon, A. Farhadi

Best Student Paper Award

2017

“Computational Imaging on the Electric Grid”

M. Sheinin, Y. Y. Schechner, K. N. Kutulakos

Longuet-Higgins Prize(Test-of-Time)

2017	“Accurate, Dense, and Robust Multi-View Stereopsis”	Y. Furukawa, J. Ponce
2017	“Object Retrieval with Large Vocabularies and Fast Spatial Matching”	J. Philbin, O. Chum, M. Isard, J. Sivic, A. Zisserman

Curated Papers:

Densely Connected Convolutional Networks

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we embrace this observation and introduce the Dense Convolutional Network (DenseNet), which connects each layer to every other layer in a feed-forward fashion. Whereas traditional convolutional networks with L layers have L connections-one between each layer and its subsequent layer-our network has L(L+1)/2 direct connections. For each layer, the feature-maps of all preceding layers are used as inputs, and its own feature-maps are used as inputs into all subsequent layers. DenseNets have several compelling advantages: they alleviate the vanishing-gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters. We evaluate our proposed architecture on four highly competitive object recognition benchmark tasks (CIFAR-10, CIFAR-100, SVHN, and ImageNet). DenseNets obtain significant improvements over the state-of-the-art on most of them, whilst requiring less memory and computation to achieve high performance. Code and pre-trained models are available at https://github.com/liuzhuang13/DenseNet.

Image-to-Image Translation with Conditional Adversarial Networks

We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks. Moreover, since the release of the pi×2pi× software associated with this paper, hundreds of twitter users have posted their own artistic experiments using our system. As a community, we no longer hand-engineer our mapping functions, and this work suggests we can achieve reasonable results without handengineering our loss functions either.

Feature Pyramid Networks for Object Detection

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.

Full List:

Densely Connected Convolutional Networks Gao Huang;Zhuang Liu;Laurens Van Der Maaten;Kilian Q. Weinberger Publication Year: 2017,Page(s):2261 – 2269 Cited by: Papers (2800) | Patents (6)
Image-to-Image Translation with Conditional Adversarial Networks Phillip Isola;Jun-Yan Zhu;Tinghui Zhou;Alexei A. Efros Publication Year: 2017,Page(s):5967 – 5976 Cited by: Papers (1574) | Patents (8)
Feature Pyramid Networks for Object Detection Tsung-Yi Lin;Piotr Dollár;Ross Girshick;Kaiming He;Bharath Hariharan;Serge Belongie Publication Year: 2017,Page(s):936 – 944 Cited by: Papers (1569) | Patents (1)
YOLO9000: Better, Faster, Stronger Joseph Redmon;Ali Farhadi Publication Year: 2017,Page(s):6517 – 6525 Cited by: Papers (1369) | Patents (2)
Pyramid Scene Parsing Network Hengshuang Zhao;Jianping Shi;Xiaojuan Qi;Xiaogang Wang;Jiaya Jia Publication Year: 2017,Page(s):6230 – 6239 Cited by: Papers (1020)
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Christian Ledig;Lucas Theis;Ferenc Huszár;Jose Caballero;Andrew Cunningham;Alejandro Acosta;Andrew Aitken;Alykhan Tejani;Johannes Totz;Zehan Wang;Wenzhe Shi Publication Year: 2017,Page(s):105 – 114 Cited by: Papers (1000) | Patents (1)
Xception: Deep Learning with Depthwise Separable Convolutions François Chollet Publication Year: 2017,Page(s):1800 – 1807 Cited by: Papers (807) | Patents (9)
Aggregated Residual Transformations for Deep Neural Networks Saining Xie;Ross Girshick;Piotr Dollár;Zhuowen Tu;Kaiming He Publication Year: 2017,Page(s):5987 – 5995 Cited by: Papers (713)
Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields Zhe Cao;Tomas Simon;Shih-En Wei;Yaser Sheikh Publication Year: 2017,Page(s):1302 – 1310 Cited by: Papers (712) | Patents (2)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira;Andrew Zisserman Publication Year: 2017,Page(s):4724 – 4733 Cited by: Papers (631) | Patents (1)
Adversarial Discriminative Domain Adaptation Eric Tzeng;Judy Hoffman;Kate Saenko;Trevor Darrell Publication Year: 2017,Page(s):2962 – 2971 Cited by: Papers (467)
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks Eddy Ilg;Nikolaus Mayer;Tonmoy Saikia;Margret Keuper;Alexey Dosovitskiy;Thomas Brox Publication Year: 2017,Page(s):1647 – 1655 Cited by: Papers (450) | Patents (3)
Unsupervised Monocular Depth Estimation with Left-Right Consistency Clément Godard;Oisin Mac Aodha;Gabriel J. Brostow Publication Year: 2017,Page(s):6602 – 6611 Cited by: Papers (436) | Patents (4)
Residual Attention Network for Image Classification Fei Wang;Mengqing Jiang;Chen Qian;Shuo Yang;Cheng Li;Honggang Zhang;Xiaogang Wang;Xiaoou Tang Publication Year: 2017,Page(s):6450 – 6458 Cited by: Papers (428) | Patents (1)
Unsupervised Learning of Depth and Ego-Motion from Video Tinghui Zhou;Matthew Brown;Noah Snavely;David G. Lowe Publication Year: 2017,Page(s):6612 – 6619 Cited by: Papers (403) | Patents (2)
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors Jonathan Huang;Vivek Rathod;Chen Sun;Menglong Zhu;Anoop Korattikara;Alireza Fathi;Ian Fischer;Zbigniew Wojna;Yang Song;Sergio Guadarrama;Kevin Murphy Publication Year: 2017,Page(s):3296 – 3297 Cited by: Papers (393) | Patents (2)
ECO: Efficient Convolution Operators for Tracking Martin Danelljan;Goutam Bhat;Fahad Shahbaz Khan;Michael Felsberg Publication Year: 2017,Page(s):6931 – 6939 Cited by: Papers (380)
RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation Guosheng Lin;Anton Milan;Chunhua Shen;Ian Reid Publication Year: 2017,Page(s):5168 – 5177 Cited by: Papers (379)
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution Wei-Sheng Lai;Jia-Bin Huang;Narendra Ahuja;Ming-Hsuan Yang Publication Year: 2017,Page(s):5835 – 5843 Cited by: Papers (347)
SphereFace: Deep Hypersphere Embedding for Face Recognition Weiyang Liu;Yandong Wen;Zhiding Yu;Ming Li;Bhiksha Raj;Le Song Publication Year: 2017,Page(s):6738 – 6746 Cited by: Papers (339)
End-to-End Representation Learning for Correlation Filter Based Tracking Jack Valmadre;Luca Bertinetto;João Henriques;Andrea Vedaldi;Philip H. S. Torr Publication Year: 2017,Page(s):5000 – 5008 Cited by: Papers (301) | Patents (1)
Image Super-Resolution via Deep Recursive Residual Network Ying Tai;Jian Yang;Xiaoming Liu Publication Year: 2017,Page(s):2790 – 2798 Cited by: Papers (298) | Patents (1)
Multi-view 3D Object Detection Network for Autonomous Driving Xiaozhi Chen;Huimin Ma;Ji Wan;Bo Li;Tian Xia Publication Year: 2017,Page(s):6526 – 6534 Cited by: Papers (295) | Patents (6)
Learning from Simulated and Unsupervised Images through Adversarial Training Ashish Shrivastava;Tomas Pfister;Oncel Tuzel;Joshua Susskind;Wenda Wang;Russell Webb Publication Year: 2017,Page(s):2242 – 2251 Cited by: Papers (276) | Patents (6)
Learning Deep CNN Denoiser Prior for Image Restoration Kai Zhang;Wangmeng Zuo;Shuhang Gu;Lei Zhang Publication Year: 2017,Page(s):2808 – 2817 Cited by: Papers (275)
Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification Weihua Chen;Xiaotang Chen;Jianguo Zhang;Kaiqi Huang Publication Year: 2017,Page(s):1320 – 1329 Cited by: Papers (226) | Patents (1)
EAST: An Efficient and Accurate Scene Text Detector Xinyu Zhou;Cong Yao;He Wen;Yuzhi Wang;Shuchang Zhou;Weiran He;Jiajun Liang Publication Year: 2017,Page(s):2642 – 2651 Cited by: Papers (226)
Re-ranking Person Re-identification with k-Reciprocal Encoding Zhun Zhong;Liang Zheng;Donglin Cao;Shaozi Li Publication Year: 2017,Page(s):3652 – 3661 Cited by: Papers (226) | Patents (1)
Dilated Residual Networks Fisher Yu;Vladlen Koltun;Thomas Funkhouser Publication Year: 2017,Page(s):636 – 644 Cited by: Papers (218)
Scene Parsing through ADE20K Dataset Bolei Zhou;Hang Zhao;Xavier Puig;Sanja Fidler;Adela Barriuso;Antonio Torralba Publication Year: 2017,Page(s):5122 – 5130 Cited by: Papers (218)
OctNet: Learning Deep 3D Representations at High Resolutions Gernot Riegler;Ali Osman Ulusoy;Andreas Geiger Publication Year: 2017,Page(s):6620 – 6629 Cited by: Papers (215) | Patents (1)
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes Angela Dai;Angel X. Chang;Manolis Savva;Maciej Halber;Thomas Funkhouser;Matthias Nießner Publication Year: 2017,Page(s):2432 – 2443 Cited by: Papers (215)
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning Long Chen;Hanwang Zhang;Jun Xiao;Liqiang Nie;Jian Shao;Wei Liu;Tat-Seng Chua Publication Year: 2017,Page(s):6298 – 6306 Cited by: Papers (211) | Patents (1)
Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks Konstantinos Bousmalis;Nathan Silberman;David Dohan;Dumitru Erhan;Dilip Krishnan Publication Year: 2017,Page(s):95 – 104 Cited by: Papers (210) | Patents (3)
Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network Chao Peng;Xiangyu Zhang;Gang Yu;Guiming Luo;Jian Sun Publication Year: 2017,Page(s):1743 – 1751 Cited by: Papers (208) | Patents (3)
Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion Haiyu Zhao;Maoqing Tian;Shuyang Sun;Jing Shao;Junjie Yan;Shuai Yi;Xiaogang Wang;Xiaoou Tang Publication Year: 2017,Page(s):907 – 915 Cited by: Papers (198) | Patents (1)
A Point Set Generation Network for 3D Object Reconstruction from a Single Image Haoqiang Fan;Hao Su;Leonidas Guibas Publication Year: 2017,Page(s):2463 – 2471 Cited by: Papers (195)
Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring Seungjun Nah;Tae Hyun Kim;Kyoung Mu Lee Publication Year: 2017,Page(s):257 – 265 Cited by: Papers (189)
Semantic Scene Completion from a Single Depth Image Shuran Song;Fisher Yu;Andy Zeng;Angel X. Chang;Manolis Savva;Thomas Funkhouser Publication Year: 2017,Page(s):190 – 198 Cited by: Papers (177)
Universal Adversarial Perturbations Seyed-Mohsen Moosavi-Dezfooli;Alhussein Fawzi;Omar Fawzi;Pascal Frossard Publication Year: 2017,Page(s):86 – 94 Cited by: Papers (175)
Learning Deep Context-Aware Features over Body and Latent Parts for Person Re-identification Dangwei Li;Xiaotang Chen;Zhang Zhang;Kaiqi Huang Publication Year: 2017,Page(s):7398 – 7407 Cited by: Papers (171)
Deeply Supervised Salient Object Detection with Short Connections Qibin Hou;Ming-Ming Cheng;Xiaowei Hu;Ali Borji;Zhuowen Tu;Philip Torr Publication Year: 2017,Page(s):5300 – 5309 Cited by: Papers (167)
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition Luan Tran;Xi Yin;Xiaoming Liu Publication Year: 2017,Page(s):1283 – 1292 Cited by: Papers (167)
Semantic Image Inpainting with Deep Generative Models Raymond A. Yeh;Chen Chen;Teck Yian Lim;Alexander G. Schwing;Mark Hasegawa-Johnson;Minh N. Do Publication Year: 2017,Page(s):6882 – 6890 Cited by: Papers (161)
Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition Jianlong Fu;Heliang Zheng;Tao Mei Publication Year: 2017,Page(s):4476 – 4484 Cited by: Papers (159) | Patents (2)
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation R. Qi Charles;Hao Su;Mo Kaichun;Leonidas J. Guibas Publication Year: 2017,Page(s):77 – 85 Cited by: Papers (157) | Patents (1)
ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases Xiaosong Wang;Yifan Peng;Le Lu;Zhiyong Lu;Mohammadhadi Bagheri;Ronald M. Summers Publication Year: 2017,Page(s):3462 – 3471 Cited by: Papers (156) | Patents (2)
Finding Tiny Faces Peiyun Hu;Deva Ramanan Publication Year: 2017,Page(s):1522 – 1530 Cited by: Papers (155)
Switching Convolutional Neural Network for Crowd Counting Deepak Babu Sam;Shiv Surya;R. Venkatesh Babu Publication Year: 2017,Page(s):4031 – 4039 Cited by: Papers (155) | Patents (2)
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning Jiasen Lu;Caiming Xiong;Devi Parikh;Richard Socher Publication Year: 2017,Page(s):3242 – 3250 Cited by: Papers (154) | Patents (3)

https://ieeexplore.ieee.org/xpl/conhome/8097368/proceeding?sortType=paper-citations&rowsPerPage=50&pageNumber=1