| Video-to-Video Synthesis | NIPS | code | 5578 |
| Deep Image Prior | CVPR | code | 3736 |
| StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation | CVPR | code | 3405 |
| Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network | ECCV | code | 2434 |
| Learning to See in the Dark | CVPR | code | 2326 |
| Glow: Generative Flow with Invertible 1x1 Convolutions | NIPS | code | 2088 |
| Squeeze-and-Excitation Networks | CVPR | code | 1477 |
| Efficient Neural Architecture Search via Parameters Sharing | ICML | code | 1382 |
| Multimodal Unsupervised Image-to-image Translation | ECCV | code | 1296 |
| Non-Local Neural Networks | CVPR | code | 992 |
| Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? | CVPR | code | 924 |
| Single-Shot Refinement Neural Network for Object Detection | CVPR | code | 875 |
| Image Generation From Scene Graphs | CVPR | code | 851 |
| GANimation: Anatomically-aware Facial Animation from a Single Image | ECCV | code | 772 |
| Simple Baselines for Human Pose Estimation and Tracking | ECCV | code | 752 |
| Visualizing the Loss Landscape of Neural Nets | NIPS | code | 724 |
| Detect-and-Track: Efficient Pose Estimation in Videos | CVPR | code | 650 |
| Relation Networks for Object Detection | CVPR | code | 635 |
| Generative Image Inpainting With Contextual Attention | CVPR | code | 609 |
| PointCNN | NIPS | code | 607 |
| Look at Boundary: A Boundary-Aware Face Alignment Algorithm | CVPR | code | 575 |
| [Pelee: A Real-Time Object Detection System on Mobile Devices]!(nan) | NIPS | code | 548 |
| Distractor-aware Siamese Networks for Visual Object Tracking | ECCV | code | 545 |
| Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples | ICML | code | 535 |
| Which Training Methods for GANs do actually Converge? | ICML | code | 520 |
| End-to-End Recovery of Human Shape and Pose | CVPR | code | 502 |
| Taskonomy: Disentangling Task Transfer Learning | CVPR | code | 502 |
| Cascaded Pyramid Network for Multi-Person Pose Estimation | CVPR | code | 497 |
| Neural 3D Mesh Renderer | CVPR | code | 489 |
| Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs | CVPR | code | 489 |
| In-Place Activated BatchNorm for Memory-Optimized Training of DNNs | CVPR | code | 485 |
| The Unreasonable Effectiveness of Deep Features as a Perceptual Metric | CVPR | code | 447 |
| Frustum PointNets for 3D Object Detection From RGB-D Data | CVPR | code | 434 |
| The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks | CVPR | code | 416 |
| ICNet for Real-Time Semantic Segmentation on High-Resolution Images | ECCV | code | 415 |
| PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume | CVPR | code | 398 |
| Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++ | CVPR | code | 397 |
| Gibson Env: Real-World Perception for Embodied Agents | CVPR | code | 385 |
| Acquisition of Localization Confidence for Accurate Object Detection | ECCV | code | 384 |
| Noise2Noise: Learning Image Restoration without Clean Data | ICML | code | 370 |
| GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation | CVPR | code | 359 |
| GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose | CVPR | code | 359 |
| A Style-Aware Content Loss for Real-time HD Style Transfer | ECCV | code | 349 |
| Soccer on Your Tabletop | CVPR | code | 338 |
| Pyramid Stereo Matching Network | CVPR | code | 335 |
| Neural Baby Talk | CVPR | code | 332 |
| License Plate Detection and Recognition in Unconstrained Scenarios | ECCV | code | 326 |
| Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors | CVPR | code | 326 |
| Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images | ECCV | code | 323 |
| Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning | CVPR | code | 317 |
| Fast End-to-End Trainable Guided Filter | CVPR | code | 312 |
| Deep Clustering for Unsupervised Learning of Visual Features | ECCV | code | 302 |
| Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs | CVPR | code | 294 |
| Neural Relational Inference for Interacting Systems | ICML | code | 289 |
| Adversarially Regularized Autoencoders | ICML | code | 282 |
| Learning to Adapt Structured Output Space for Semantic Segmentation | CVPR | code | 280 |
| Convolutional Neural Networks With Alternately Updated Clique | CVPR | code | 272 |
| Learning to Segment Every Thing | CVPR | code | 269 |
| Supervising Unsupervised Learning | NIPS | code | 262 |
| LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation | CVPR | code | 261 |
| Bilinear Attention Networks | NIPS | code | 258 |
| ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation | ECCV | code | 254 |
| An intriguing failing of convolutional neural networks and the CoordConv solution | NIPS | code | 249 |
| End-to-End Learning of Motion Representation for Video Understanding | CVPR | code | 238 |
| Image Super-Resolution Using Very Deep Residual Channel Attention Networks | ECCV | code | 234 |
| Iterative Visual Reasoning Beyond Convolutions | CVPR | code | 228 |
| Semi-Parametric Image Synthesis | CVPR | code | 226 |
| Compressed Video Action Recognition | CVPR | code | 225 |
| Style Aggregated Network for Facial Landmark Detection | CVPR | code | 223 |
| Pose-Robust Face Recognition via Deep Residual Equivariant Mapping | CVPR | code | 220 |
| Multi-Content GAN for Few-Shot Font Style Transfer | CVPR | code | 218 |
| GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models | ICML | code | 214 |
| Referring Relationships | CVPR | code | 210 |
| MoCoGAN: Decomposing Motion and Content for Video Generation | CVPR | code | 205 |
| Latent Alignment and Variational Attention | NIPS | code | 204 |
| LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image | CVPR | code | 202 |
| Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs | CVPR | code | 197 |
| An End-to-End TextSpotter With Explicit Alignment and Attention | CVPR | code | 195 |
| DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks | CVPR | code | 189 |
| SPLATNet: Sparse Lattice Networks for Point Cloud Processing | CVPR | code | 188 |
| Attentive Generative Adversarial Network for Raindrop Removal From a Single Image | CVPR | code | 186 |
| Single View Stereo Matching | CVPR | code | 182 |
| MegaDepth: Learning Single-View Depth Prediction From Internet Photos | CVPR | code | 181 |
| ECO: Efficient Convolutional Network for Online Video Understanding | ECCV | code | 180 |
| Unsupervised Feature Learning via Non-Parametric Instance Discrimination | CVPR | code | 180 |
| ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing | CVPR | code | 179 |
| Video Based Reconstruction of 3D People Models | CVPR | code | 179 |
| Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks | CVPR | code | 178 |
| Learning Category-Specific Mesh Reconstruction from Image Collections | ECCV | code | 176 |
| Realistic Evaluation of Deep Semi-Supervised Learning Algorithms | NIPS | code | 175 |
| BSN: Boundary Sensitive Network for Temporal Action Proposal Generation | ECCV | code | 175 |
| Group Normalization | ECCV | code | 175 |
| Real-Time Seamless Single Shot 6D Object Pose Prediction | CVPR | code | 174 |
| MVSNet: Depth Inference for Unstructured Multi-view Stereo | ECCV | code | 174 |
| Neural Motifs: Scene Graph Parsing With Global Context | CVPR | code | 171 |
| Learning a Single Convolutional Super-Resolution Network for Multiple Degradations | CVPR | code | 169 |
| Optimizing Video Object Detection via a Scale-Time Lattice | CVPR | code | 168 |
| MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network | ECCV | code | 167 |
| Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns | CVPR | code | 166 |
| Weakly Supervised Instance Segmentation Using Class Peak Response | CVPR | code | 166 |
| PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image | CVPR | code | 164 |
| Residual Dense Network for Image Super-Resolution | CVPR | code | 163 |
| Embodied Question Answering | CVPR | code | 162 |
| Evolved Policy Gradients | NIPS | code | 160 |
| Camera Style Adaptation for Person Re-Identification | CVPR | code | 159 |
| Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer | CVPR | code | 159 |
| Scale-Recurrent Network for Deep Image Deblurring | CVPR | code | 159 |
| Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction | CVPR | code | 158 |
| Relational recurrent neural networks | NIPS | code | 157 |
| Densely Connected Pyramid Dehazing Network | CVPR | code | 155 |
| Image Inpainting for Irregular Holes Using Partial Convolutions | ECCV | code | 153 |
| SO-Net: Self-Organizing Network for Point Cloud Analysis | CVPR | code | 152 |
| Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling | CVPR | code | 152 |
| ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices | CVPR | code | 152 |
| DenseASPP for Semantic Segmentation in Street Scenes | CVPR | code | 151 |
| Facelet-Bank for Fast Portrait Manipulation | CVPR | code | 150 |
| Self-Imitation Learning | ICML | code | 145 |
| Graph R-CNN for Scene Graph Generation | ECCV | code | 144 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | CVPR | code | 143 |
| Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation | CVPR | code | 143 |
| Quantized Densely Connected U-Nets for Efficient Landmark Localization | ECCV | code | 143 |
| Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining | ECCV | code | 142 |
| Two-Stream Convolutional Networks for Dynamic Texture Synthesis | CVPR | code | 141 |
| Integral Human Pose Regression | ECCV | code | 141 |
| Adaptive Affinity Fields for Semantic Segmentation | ECCV | code | 141 |
| LSTM Pose Machines | CVPR | code | 141 |
| Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships | CVPR | code | 140 |
| Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform | CVPR | code | 139 |
| Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification | CVPR | code | 137 |
| Learning to Compare: Relation Network for Few-Shot Learning | CVPR | code | 135 |
| CosFace: Large Margin Cosine Loss for Deep Face Recognition | CVPR | code | 135 |
| Deep Depth Completion of a Single RGB-D Image | CVPR | code | 134 |
| Deep Back-Projection Networks for Super-Resolution | CVPR | code | 132 |
| Context Embedding Networks | CVPR | code | 131 |
| Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics | CVPR | code | 131 |
| Perturbative Neural Networks | CVPR | code | 130 |
| Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis | ICML | code | 129 |
| Fast and Accurate Online Video Object Segmentation via Tracking Parts | CVPR | code | 129 |
| Nonlinear 3D Face Morphable Model | CVPR | code | 128 |
| BodyNet: Volumetric Inference of 3D Human Body Shapes | ECCV | code | 126 |
| 3D-CODED: 3D Correspondences by Deep Deformation | ECCV | code | 125 |
| DeepMVS: Learning Multi-View Stereopsis | CVPR | code | 125 |
| Hierarchical Imitation and Reinforcement Learning | ICML | code | 124 |
| Domain Adaptive Faster R-CNN for Object Detection in the Wild | CVPR | code | 123 |
| L4: Practical loss-based stepsize adaptation for deep learning | NIPS | code | 123 |
| A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts | CVPR | code | 122 |
| Recurrent Relational Networks | NIPS | code | 121 |
| Gated Path Planning Networks | ICML | code | 121 |
| PSANet: Point-wise Spatial Attention Network for Scene Parsing | ECCV | code | 121 |
| Rethinking Feature Distribution for Loss Functions in Image Classification | CVPR | code | 120 |
| Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network | CVPR | code | 118 |
| FOTS: Fast Oriented Text Spotting With a Unified Network | CVPR | code | 118 |
| ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes | ECCV | code | 117 |
| PU-Net: Point Cloud Upsampling Network | CVPR | code | 117 |
| PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning | CVPR | code | 117 |
| Long-term Tracking in the Wild: a Benchmark | ECCV | code | 116 |
| Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene | CVPR | code | 114 |
| Repulsion Loss: Detecting Pedestrians in a Crowd | CVPR | code | 113 |
| Unsupervised Attention-guided Image-to-Image Translation | NIPS | code | 110 |
| Attention-based Deep Multiple Instance Learning | ICML | code | 109 |
| Learning Blind Video Temporal Consistency | ECCV | code | 109 |
| Noisy Natural Gradient as Variational Inference | ICML | code | 108 |
| End-to-End Weakly-Supervised Semantic Alignment | CVPR | code | 106 |
| Decoupled Networks | CVPR | code | 105 |
| LiDAR-Video Driving Dataset: Learning Driving Policies Effectively | CVPR | code | 104 |
| MAttNet: Modular Attention Network for Referring Expression Comprehension | CVPR | code | 104 |
| LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks | ECCV | code | 103 |
| FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors | CVPR | code | 100 |
| Deep Mutual Learning | CVPR | code | 100 |
| Macro-Micro Adversarial Network for Human Parsing | ECCV | code | 98 |
| ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans | CVPR | code | 97 |
| Learning Depth From Monocular Videos Using Direct Methods | CVPR | code | 97 |
| VITON: An Image-Based Virtual Try-On Network | CVPR | code | 95 |
| Cascade R-CNN: Delving Into High Quality Object Detection | CVPR | code | 93 |
| Learning Human-Object Interactions by Graph Parsing Neural Networks | ECCV | code | 93 |
| Future Frame Prediction for Anomaly Detection – A New Baseline | CVPR | code | 92 |
| Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence | ECCV | code | 92 |
| Tell Me Where to Look: Guided Attention Inference Network | CVPR | code | 91 |
| Neural Kinematic Networks for Unsupervised Motion Retargetting | CVPR | code | 90 |
| Learning SO(3) Equivariant Representations with Spherical CNNs | ECCV | code | 89 |
| One-Shot Unsupervised Cross Domain Translation | NIPS | code | 89 |
| Synthesizing Images of Humans in Unseen Poses | CVPR | code | 88 |
| Depth-aware CNN for RGB-D Segmentation | ECCV | code | 88 |
| Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights | ECCV | code | 88 |
| Knowledge Aided Consistency for Weakly Supervised Phrase Grounding | CVPR | code | 87 |
| CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes | CVPR | code | 87 |
| Neural Arithmetic Logic Units | NIPS | code | 87 |
| A PID Controller Approach for Stochastic Optimization of Deep Networks | CVPR | code | 87 |
| VITAL: VIsual Tracking via Adversarial Learning | CVPR | code | 86 |
| Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking | CVPR | code | 86 |
| Recurrent Pixel Embedding for Instance Grouping | CVPR | code | 85 |
| SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation | CVPR | code | 84 |
| Multi-Scale Location-Aware Kernel Representation for Object Detection | CVPR | code | 84 |
| Repeatability Is Not Enough: Learning Affine Regions via Discriminability | ECCV | code | 84 |
| “Zero-Shot” Super-Resolution Using Deep Internal Learning | CVPR | code | 84 |
| DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency | ECCV | code | 82 |
| Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction | CVPR | code | 80 |
| Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation | ECCV | code | 78 |
| Generalizing A Person Retrieval Model Hetero- and Homogeneously | ECCV | code | 78 |
| Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning | CVPR | code | 77 |
| Pairwise Confusion for Fine-Grained Visual Classification | ECCV | code | 77 |
| Learning to Reweight Examples for Robust Deep Learning | ICML | code | 76 |
| Improving Generalization via Scalable Neighborhood Component Analysis | ECCV | code | 76 |
| SparseMAP: Differentiable Sparse Structured Inference | ICML | code | 75 |
| PDE-Net: Learning PDEs from Data | ICML | code | 75 |
| Pose-Normalized Image Generation for Person Re-identification | ECCV | code | 75 |
| Disentangled Person Image Generation | CVPR | code | 75 |
| Learning to Navigate for Fine-grained Classification | ECCV | code | 74 |
| Superpixel Sampling Networks | ECCV | code | 74 |
| Shift-Net: Image Inpainting via Deep Feature Rearrangement | ECCV | code | 74 |
| 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation | ECCV | code | 74 |
| Ordinal Depth Supervision for 3D Human Pose Estimation | CVPR | code | 74 |
| Path-Level Network Transformation for Efficient Architecture Search | ICML | code | 73 |
| Diverse Image-to-Image Translation via Disentangled Representations | ECCV | code | 72 |
| Visual Feature Attribution Using Wasserstein GANs | CVPR | code | 72 |
| Real-World Anomaly Detection in Surveillance Videos | CVPR | code | 72 |
| Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval | CVPR | code | 72 |
| Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image | ECCV | code | 72 |
| Learning to Find Good Correspondences | CVPR | code | 72 |
| Learning Less Is More - 6D Camera Localization via 3D Surface Regression | CVPR | code | 72 |
| Object Level Visual Reasoning in Videos | ECCV | code | 71 |
| Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing | CVPR | code | 71 |
| Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration | CVPR | code | 71 |
| Fast and Accurate Single Image Super-Resolution via Information Distillation Network | CVPR | code | 71 |
| Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present | CVPR | code | 70 |
| Multi-Shot Pedestrian Re-Identification via Sequential Decision Making | CVPR | code | 70 |
| PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition | CVPR | code | 69 |
| Progressive Neural Architecture Search | ECCV | code | 68 |
| Generative Neural Machine Translation | NIPS | code | 68 |
| Learning Latent Super-Events to Detect Multiple Activities in Videos | CVPR | code | 67 |
| Generate to Adapt: Aligning Domains Using Generative Adversarial Networks | CVPR | code | 67 |
| Adversarial Feature Augmentation for Unsupervised Domain Adaptation | CVPR | code | 67 |
| Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking | CVPR | code | 67 |
| Pointwise Convolutional Neural Networks | CVPR | code | 67 |
| Optimizing the Latent Space of Generative Networks | ICML | code | 66 |
| Part-Aligned Bilinear Representations for Person Re-Identification | ECCV | code | 64 |
| Geometry-Aware Learning of Maps for Camera Localization | CVPR | code | 63 |
| Fighting Fake News: Image Splice Detection via Learned Self-Consistency | ECCV | code | 62 |
| Isolating Sources of Disentanglement in Variational Autoencoders | NIPS | code | 62 |
| Neural Program Synthesis from Diverse Demonstration Videos | ICML | code | 62 |
| Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation | ECCV | code | 61 |
| Rotation-Sensitive Regression for Oriented Scene Text Detection | CVPR | code | 61 |
| Human Semantic Parsing for Person Re-Identification | CVPR | code | 61 |
| Unsupervised Discovery of Object Landmarks as Structural Representations | CVPR | code | 61 |
| IQA: Visual Question Answering in Interactive Environments | CVPR | code | 60 |
| Hierarchical Long-term Video Prediction without Supervision | ICML | code | 60 |
| Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency | ECCV | code | 60 |
| Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning | CVPR | code | 59 |
| Neural Style Transfer via Meta Networks | CVPR | code | 59 |
| Frame-Recurrent Video Super-Resolution | CVPR | code | 58 |
| PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction | ECCV | code | 57 |
| CBAM: Convolutional Block Attention Module | ECCV | code | 57 |
| Decorrelated Batch Normalization | CVPR | code | 57 |
| [Learning Conditioned Graph Structures for Interpretable Visual Question Answering]!(nan) | NIPS | code | 57 |
| Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition | ECCV | code | 57 |
| Leveraging Unlabeled Data for Crowd Counting by Learning to Rank | CVPR | code | 56 |
| Deep Marching Cubes: Learning Explicit Surface Representations | CVPR | code | 56 |
| Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation | CVPR | code | 56 |
| LF-Net: Learning Local Features from Images | NIPS | code | 55 |
| Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model | ECCV | code | 55 |
| Discriminability Objective for Training Descriptive Captions | CVPR | code | 54 |
| BlockDrop: Dynamic Inference Paths in Residual Networks | CVPR | code | 54 |
| Conditional Probability Models for Deep Image Compression | CVPR | code | 54 |
| Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation | CVPR | code | 54 |
| Learning towards Minimum Hyperspherical Energy | NIPS | code | 54 |
| DeepVS: A Deep Learning Based Video Saliency Prediction Approach | ECCV | code | 53 |
| Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting | ECCV | code | 52 |
| Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation | CVPR | code | 52 |
| Wasserstein Introspective Neural Networks | CVPR | code | 51 |
| SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis | CVPR | code | 51 |
| Self-produced Guidance for Weakly-supervised Object Localization | ECCV | code | 51 |
| Measuring abstract reasoning in neural networks | ICML | code | 51 |
| A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation | NIPS | code | 51 |
| RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials | CVPR | code | 51 |
| Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation | ECCV | code | 50 |
| Efficient end-to-end learning for quantizable representations | ICML | code | 50 |
| Visual Question Generation as Dual Task of Visual Question Answering | CVPR | code | 50 |
| Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam | ICML | code | 49 |
| Surface Networks | CVPR | code | 48 |
| Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions | ICML | code | 48 |
| Stacked Cross Attention for Image-Text Matching | ECCV | code | 48 |
| Actor and Observer: Joint Modeling of First and Third-Person Videos | CVPR | code | 48 |
| Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation | CVPR | code | 47 |
| Learning-based Video Motion Magnification | ECCV | code | 47 |
| Pose Partition Networks for Multi-Person Pose Estimation | ECCV | code | 47 |
| Neural Autoregressive Flows | ICML | code | 47 |
| Weakly- and Semi-Supervised Panoptic Segmentation | ECCV | code | 46 |
| Video Re-localization | ECCV | code | 46 |
| Real-time ‘Actor-Critic’ Tracking | ECCV | code | 46 |
| Black-box Adversarial Attacks with Limited Queries and Information | ICML | code | 46 |
| Hyperbolic Entailment Cones for Learning Hierarchical Embeddings | ICML | code | 46 |
| Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation | CVPR | code | 46 |
| Differentiable Compositional Kernel Learning for Gaussian Processes | ICML | code | 45 |
| Visualizing and Understanding Atari Agents | ICML | code | 45 |
| Image Manipulation with Perceptual Discriminators | ECCV | code | 45 |
| Learning Intrinsic Image Decomposition From Watching the World | CVPR | code | 45 |
| Overcoming Catastrophic Forgetting with Hard Attention to the Task | ICML | code | 44 |
| Learning Pose Specific Representations by Predicting Different Views | CVPR | code | 44 |
| Zero-Shot Object Detection | ECCV | code | 43 |
| Mean Field Multi-Agent Reinforcement Learning | ICML | code | 43 |
| Partial Adversarial Domain Adaptation | ECCV | code | 43 |
| Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation | ECCV | code | 43 |
| Robust Classification With Convolutional Prototype Learning | CVPR | code | 43 |
| SimplE Embedding for Link Prediction in Knowledge Graphs | NIPS | code | 42 |
| PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning | ICML | code | 42 |
| Learning to Blend Photos | ECCV | code | 42 |
| Mask-Guided Contrastive Attention Model for Person Re-Identification | CVPR | code | 41 |
| Link Prediction Based on Graph Neural Networks | NIPS | code | 41 |
| Generalisation in humans and deep neural networks | NIPS | code | 41 |
| Towards Binary-Valued Gates for Robust LSTM Training | ICML | code | 41 |
| Multi-scale Residual Network for Image Super-Resolution | ECCV | code | 41 |
| Fully Motion-Aware Network for Video Object Detection | ECCV | code | 41 |
| Interpretable Convolutional Neural Networks | CVPR | code | 40 |
| Generative Adversarial Perturbations | CVPR | code | 40 |
| The Sound of Pixels | ECCV | code | 40 |
| Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization | CVPR | code | 40 |
| Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance | ECCV | code | 40 |
| Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation | NIPS | code | 40 |
| Learning Warped Guidance for Blind Face Restoration | ECCV | code | 39 |
| Adversarial Complementary Learning for Weakly Supervised Object Localization | CVPR | code | 39 |
| Learning Semantic Representations for Unsupervised Domain Adaptation | ICML | code | 39 |
| Neural Architecture Search with Bayesian Optimisation and Optimal Transport | NIPS | code | 39 |
| Mutual Information Neural Estimation | ICML | code | 39 |
| NetGAN: Generating Graphs via Random Walks | ICML | code | 39 |
| Learning to Evaluate Image Captioning | CVPR | code | 38 |
| Hyperbolic Neural Networks | NIPS | code | 37 |
| Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation | ECCV | code | 37 |
| Adversarially Learned One-Class Classifier for Novelty Detection | CVPR | code | 37 |
| Disentangling by Factorising | ICML | code | 37 |
| Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples | ICML | code | 37 |
| Tangent Convolutions for Dense Prediction in 3D | CVPR | code | 37 |
| Few-Shot Image Recognition by Predicting Parameters From Activations | CVPR | code | 37 |
| Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer | CVPR | code | 37 |
| Generalizing to Unseen Domains via Adversarial Data Augmentation | NIPS | code | 36 |
| SeGAN: Segmenting and Generating the Invisible | CVPR | code | 36 |
| Graphical Generative Adversarial Networks | NIPS | code | 36 |
| PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference | CVPR | code | 36 |
| Gated Fusion Network for Single Image Dehazing | CVPR | code | 35 |
| Neural Code Comprehension: A Learnable Representation of Code Semantics | NIPS | code | 35 |
| Eye In-Painting With Exemplar Generative Adversarial Networks | CVPR | code | 35 |
| Deep One-Class Classification | ICML | code | 34 |
| Deep Regression Tracking with Shrinkage Loss | ECCV | code | 34 |
| Deflecting Adversarial Attacks With Pixel Deflection | CVPR | code | 34 |
| Learning Visual Question Answering by Bootstrapping Hard Attention | ECCV | code | 33 |
| Human-Centric Indoor Scene Synthesis Using Stochastic Grammar | CVPR | code | 33 |
| Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering | CVPR | code | 33 |
| CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise | CVPR | code | 33 |
| Speaker-Follower Models for Vision-and-Language Navigation | NIPS | code | 33 |
| Improving Shape Deformation in Unsupervised Image-to-Image Translation | ECCV | code | 33 |
| Learning Single-View 3D Reconstruction with Limited Pose Supervision | ECCV | code | 33 |
| 3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data | NIPS | code | 33 |
| Adversarial Logit Pairing | NIPS | code | 32 |
| Attention in Convolutional LSTM for Gesture Recognition | NIPS | code | 32 |
| Graph-Cut RANSAC | CVPR | code | 32 |
| Neural Guided Constraint Logic Programming for Program Synthesis | NIPS | code | 32 |
| Learning Dynamic Memory Networks for Object Tracking | ECCV | code | 32 |
| GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints | ECCV | code | 32 |
| [A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks]!(nan) | NIPS | code | 32 |
| Flow-Grounded Spatial-Temporal Video Prediction from Still Images | ECCV | code | 32 |
| Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection | ECCV | code | 32 |
| On the Robustness of Semantic Segmentation Models to Adversarial Attacks | CVPR | code | 31 |
| Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning | CVPR | code | 31 |
| SketchyScene: Richly-Annotated Scene Sketches | ECCV | code | 31 |
| Deep Randomized Ensembles for Metric Learning | ECCV | code | 30 |
| Deep High Dynamic Range Imaging with Large Foreground Motions | ECCV | code | 30 |
| Revisiting Video Saliency: A Large-Scale Benchmark and a New Model | CVPR | code | 30 |
| Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning | CVPR | code | 30 |
| Deep Model-Based 6D Pose Refinement in RGB | ECCV | code | 30 |
| TOM-Net: Learning Transparent Object Matting From a Single Image | CVPR | code | 30 |
| Quaternion Convolutional Neural Networks | ECCV | code | 30 |
| Densely Connected Attention Propagation for Reading Comprehension | NIPS | code | 30 |
| A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising | ECCV | code | 30 |
| Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings | ICML | code | 29 |
| Video Rain Streak Removal by Multiscale Convolutional Sparse Coding | CVPR | code | 29 |
| Recurrent Scene Parsing With Perspective Understanding in the Loop | CVPR | code | 29 |
| Single Shot Scene Text Retrieval | ECCV | code | 29 |
| Toward Characteristic-Preserving Image-based Virtual Try-On Network | ECCV | code | 29 |
| Explainable Neural Computation via Stack Neural Module Networks | ECCV | code | 29 |
| Exploring Disentangled Feature Representation Beyond Face Identification | CVPR | code | 29 |
| Controllable Video Generation With Sparse Trajectories | CVPR | code | 28 |
| Layer-structured 3D Scene Inference via View Synthesis | ECCV | code | 28 |
| Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation | ECCV | code | 28 |
| PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection | CVPR | code | 28 |
| Learning Rich Features for Image Manipulation Detection | CVPR | code | 27 |
| Fast Video Object Segmentation by Reference-Guided Mask Propagation | CVPR | code | 27 |
| 3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration | ECCV | code | 27 |
| Who Let the Dogs Out? Modeling Dog Behavior From Visual Data | CVPR | code | 27 |
| EC-Net: an Edge-aware Point set Consolidation Network | ECCV | code | 27 |
| Interpretable Intuitive Physics Model | ECCV | code | 27 |
| Learning a Discriminative Feature Network for Semantic Segmentation | CVPR | code | 26 |
| Partial Transfer Learning With Selective Adversarial Networks | CVPR | code | 26 |
| Cross-Modal Deep Variational Hand Pose Estimation | CVPR | code | 26 |
| Between-Class Learning for Image Classification | CVPR | code | 26 |
| AON: Towards Arbitrarily-Oriented Text Recognition | CVPR | code | 26 |
| Conditional Image-to-Image Translation | CVPR | code | 25 |
| Learning Convolutional Networks for Content-Weighted Image Compression | CVPR | code | 25 |
| Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification | CVPR | code | 25 |
| Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries | ECCV | code | 25 |
| CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation | CVPR | code | 25 |
| Deep Texture Manifold for Ground Terrain Recognition | CVPR | code | 25 |
| Audio-Visual Event Localization in Unconstrained Videos | ECCV | code | 25 |
| First Order Generative Adversarial Networks | ICML | code | 25 |
| Visual Coreference Resolution in Visual Dialog using Neural Module Networks | ECCV | code | 25 |
| SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks | CVPR | code | 24 |
| Deep Reinforcement Learning of Marked Temporal Point Processes | NIPS | code | 24 |
| Explicit Inductive Bias for Transfer Learning with Convolutional Networks | ICML | code | 24 |
| LEGO: Learning Edge With Geometry All at Once by Watching Videos | CVPR | code | 24 |
| Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes | ECCV | code | 24 |
| Multi-Agent Diverse Generative Adversarial Networks | CVPR | code | 23 |
| Face Aging With Identity-Preserved Conditional Generative Adversarial Networks | CVPR | code | 23 |
| Learning to Separate Object Sounds by Watching Unlabeled Video | ECCV | code | 23 |
| Exploiting the Potential of Standard Convolutional Autoencoders for Image Restoration by Evolutionary Search | ICML | code | 23 |
| To Trust Or Not To Trust A Classifier | NIPS | code | 23 |
| Im2Flow: Motion Hallucination From Static Images for Action Recognition | CVPR | code | 22 |
| ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing | CVPR | code | 22 |
| Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning | CVPR | code | 22 |
| Anonymous Walk Embeddings | ICML | code | 22 |
| Learning to Multitask | NIPS | code | 22 |
| CondenseNet: An Efficient DenseNet Using Learned Group Convolutions | CVPR | code | 22 |
| HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN | CVPR | code | 22 |
| Hierarchical Relational Networks for Group Activity Recognition and Retrieval | ECCV | code | 22 |
| Collaborative and Adversarial Network for Unsupervised Domain Adaptation | CVPR | code | 22 |
| Geometry-Aware Scene Text Detection With Instance Transformation Network | CVPR | code | 22 |
| Learning to Promote Saliency Detectors | CVPR | code | 21 |
| CSGNet: Neural Shape Parser for Constructive Solid Geometry | CVPR | code | 21 |
| Local Spectral Graph Convolution for Point Set Feature Learning | ECCV | code | 21 |
| HiDDeN: Hiding Data with Deep Networks | ECCV | code | 21 |
| GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning | CVPR | code | 20 |
| Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal | CVPR | code | 20 |
| Fully-Convolutional Point Networks for Large-Scale Point Clouds | ECCV | code | 20 |
| Learning Superpixels With Segmentation-Aware Affinity Loss | CVPR | code | 20 |
| Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks | CVPR | code | 20 |
| Crowd Counting With Deep Negative Correlation Learning | CVPR | code | 20 |
| Dimensionality-Driven Learning with Noisy Labels | ICML | code | 20 |
| Objects that Sound | ECCV | code | 20 |
| Deep Expander Networks: Efficient Deep Networks from Graph Theory | ECCV | code | 19 |
| Low-Shot Learning With Large-Scale Diffusion | CVPR | code | 19 |
| Low-Shot Learning With Imprinted Weights | CVPR | code | 19 |
| Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery | CVPR | code | 19 |
| Learning Descriptor Networks for 3D Shape Synthesis and Analysis | CVPR | code | 19 |
| Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders | ECCV | code | 19 |
| CTAP: Complementary Temporal Action Proposal Generation | ECCV | code | 18 |
| DVAE#: Discrete Variational Autoencoders with Relaxed Boltzmann Priors | NIPS | code | 18 |
| Conditional Image-Text Embedding Networks | ECCV | code | 18 |
| EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images | CVPR | code | 18 |
| Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points | CVPR | code | 18 |
| Bayesian Optimization of Combinatorial Structures | ICML | code | 18 |
| FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis | CVPR | code | 18 |
| Learning Type-Aware Embeddings for Fashion Compatibility | ECCV | code | 17 |
| Sliced Wasserstein Distance for Learning Gaussian Mixture Models | CVPR | code | 17 |
| Revisiting Deep Intrinsic Image Decompositions | CVPR | code | 17 |
| A Spectral Approach to Gradient Estimation for Implicit Distributions | ICML | code | 17 |
| Hierarchical Novelty Detection for Visual Object Recognition | CVPR | code | 17 |
| Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies | CVPR | code | 17 |
| Learning Generative ConvNets via Multi-Grid Modeling and Sampling | CVPR | code | 17 |
| Learning 3D Shape Completion From Laser Scan Data With Weak Supervision | CVPR | code | 17 |
| Triplet Loss in Siamese Network for Object Tracking | ECCV | code | 17 |
| Adversarial Attack on Graph Structured Data | ICML | code | 17 |
| Arbitrary Style Transfer With Deep Feature Reshuffle | CVPR | code | 17 |
| Visual Question Reasoning on General Dependency Tree | CVPR | code | 17 |
| Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition | ECCV | code | 16 |
| Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks | NIPS | code | 16 |
| Coded Sparse Matrix Multiplication | ICML | code | 16 |
| Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment | CVPR | code | 16 |
| Recovering 3D Planes from a Single Image via Convolutional Neural Networks | ECCV | code | 16 |
| SegStereo: Exploiting Semantic Information for Disparity Estimation | ECCV | code | 16 |
| Functional Gradient Boosting based on Residual Network Perception | ICML | code | 16 |
| NAG: Network for Adversary Generation | CVPR | code | 16 |
| Generative Probabilistic Novelty Detection with Adversarial Autoencoders | NIPS | code | 16 |
| Hashing as Tie-Aware Learning to Rank | CVPR | code | 15 |
| Pose Proposal Networks | ECCV | code | 15 |
| Convolutional Sequence to Sequence Model for Human Dynamics | CVPR | code | 15 |
| Joint Pose and Expression Modeling for Facial Expression Recognition | CVPR | code | 15 |
| Grounding Referring Expressions in Images by Variational Context | CVPR | code | 15 |
| Rethinking the Form of Latent States in Image Captioning | ECCV | code | 15 |
| Open Set Domain Adaptation by Backpropagation | ECCV | code | 15 |
| Neural Sign Language Translation | CVPR | code | 15 |
| SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters | ECCV | code | 15 |
| Efficient Neural Audio Synthesis | ICML | code | 15 |
| Deep Learning Under Privileged Information Using Heteroscedastic Dropout | CVPR | code | 14 |
| Image Transformer | ICML | code | 14 |
| Learning to Understand Image Blur | CVPR | code | 14 |
| Learning and Using the Arrow of Time | CVPR | code | 14 |
| Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints | CVPR | code | 14 |
| Learning to Forecast and Refine Residual Motion for Image-to-Video Generation | ECCV | code | 14 |
| Multi-Scale Weighted Nuclear Norm Image Restoration | CVPR | code | 14 |
| Synthesizing Robust Adversarial Examples | ICML | code | 13 |
| Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data | ECCV | code | 13 |
| Assessing Generative Models via Precision and Recall | NIPS | code | 13 |
| Deep Diffeomorphic Transformer Networks | CVPR | code | 13 |
| Learning by Asking Questions | CVPR | code | 13 |
| Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection | CVPR | code | 13 |
| Variational Autoencoders for Deforming 3D Mesh Models | CVPR | code | 13 |
| Min-Entropy Latent Model for Weakly Supervised Object Detection | CVPR | code | 13 |
| Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering | CVPR | code | 13 |
| Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace | ICML | code | 13 |
| Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition | CVPR | code | 13 |
| Finding Influential Training Samples for Gradient Boosted Decision Trees | ICML | code | 13 |
| Gesture Recognition: Focus on the Hands | CVPR | code | 12 |
| Cross-View Image Synthesis Using Conditional GANs | CVPR | code | 12 |
| Joint Optimization Framework for Learning With Noisy Labels | CVPR | code | 12 |
| Future Person Localization in First-Person Videos | CVPR | code | 12 |
| AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos | ECCV | code | 12 |
| Learning Transferable Architectures for Scalable Image Recognition | CVPR | code | 12 |
| Clipped Action Policy Gradient | ICML | code | 12 |
| Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation | CVPR | code | 12 |
| Decouple Learning for Parameterized Image Operators | ECCV | code | 12 |
| Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction | ICML | code | 12 |
| Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models | NIPS | code | 12 |
| AMNet: Memorability Estimation With Attention | CVPR | code | 12 |
| Adversarial Time-to-Event Modeling | ICML | code | 12 |
| [Reversible Recurrent Neural Networks]!(nan) | NIPS | code | 12 |
| Human Pose Estimation With Parsing Induced Learner | CVPR | code | 11 |
| ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking | ECCV | code | 11 |
| A Joint Sequence Fusion Model for Video Question Answering and Retrieval | ECCV | code | 11 |
| Learning Face Age Progression: A Pyramid Architecture of GANs | CVPR | code | 11 |
| Robust Physical-World Attacks on Deep Learning Visual Classification | CVPR | code | 11 |
| High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach | ICML | code | 11 |
| Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory | ICML | code | 11 |
| Multimodal Explanations: Justifying Decisions and Pointing to the Evidence | CVPR | code | 11 |
| Accelerating Natural Gradient with Higher-Order Invariance | ICML | code | 11 |
| Hierarchical Multi-Label Classification Networks | ICML | code | 11 |
| Convolutional Image Captioning | CVPR | code | 11 |
| Boosting Domain Adaptation by Discovering Latent Domains | CVPR | code | 11 |
| Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks | CVPR | code | 10 |
| PacGAN: The power of two samples in generative adversarial networks | NIPS | code | 10 |
| Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification | CVPR | code | 10 |
| End-to-End Incremental Learning | ECCV | code | 10 |
| Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation | CVPR | code | 10 |
| On GANs and GMMs | NIPS | code | 10 |
| Salient Object Detection Driven by Fixation Prediction | CVPR | code | 9 |
| Semantic Video Segmentation by Gated Recurrent Flow Propagation | CVPR | code | 9 |
| Constraint-Aware Deep Neural Network Compression | ECCV | code | 9 |
| Statistically-motivated Second-order Pooling | ECCV | code | 9 |
| Excitation Backprop for RNNs | CVPR | code | 9 |
| Analyzing Uncertainty in Neural Machine Translation | ICML | code | 9 |
| Learning Dynamics of Linear Denoising Autoencoders | ICML | code | 9 |
| Saliency Detection in 360° Videos | ECCV | code | 9 |
| Density Adaptive Point Set Registration | CVPR | code | 9 |
| Decoupled Parallel Backpropagation with Convergence Guarantee | ICML | code | 9 |
| Classification from Pairwise Similarity and Unlabeled Data | ICML | code | 9 |
| oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis | ICML | code | 9 |
| Modeling Sparse Deviations for Compressed Sensing using Generative Models | ICML | code | 9 |
| Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction | CVPR | code | 9 |
| Towards Open-Set Identity Preserving Face Synthesis | CVPR | code | 9 |
| Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras | CVPR | code | 8 |
| BourGAN: Generative Networks with Metric Embeddings | NIPS | code | 8 |
| Fast Information-theoretic Bayesian Optimisation | ICML | code | 8 |
| Deep Variational Reinforcement Learning for POMDPs | ICML | code | 8 |
| Specular-to-Diffuse Translation for Multi-View Reconstruction | ECCV | code | 8 |
| Dynamic Conditional Networks for Few-Shot Learning | ECCV | code | 8 |
| Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering | CVPR | code | 8 |
| High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs | CVPR | code | 8 |
| Deep Defense: Training DNNs with Improved Adversarial Robustness | NIPS | code | 8 |
| Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations | ICML | code | 8 |
| Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based Modeling | ECCV | code | 7 |
| [Non-metric Similarity Graphs for Maximum Inner Product Search]!(nan) | NIPS | code | 7 |
| Towards Realistic Predictors | ECCV | code | 7 |
| [Deep Non-Blind Deconvolution via Generalized Low-Rank Approximation]!(nan) | NIPS | code | 7 |
| Don’t Just Assume Look and Answer: Overcoming Priors for Visual Question Answering | CVPR | code | 7 |
| Learning Dual Convolutional Neural Networks for Low-Level Vision | CVPR | code | 7 |
| The Mirage of Action-Dependent Baselines in Reinforcement Learning | ICML | code | 7 |
| DVQA: Understanding Data Visualizations via Question Answering | CVPR | code | 7 |
| A Two-Step Disentanglement Method | CVPR | code | 7 |
| Detecting and Correcting for Label Shift with Black Box Predictors | ICML | code | 7 |
| Conditional Prior Networks for Optical Flow | ECCV | code | 7 |
| Generative Adversarial Learning Towards Fast Weakly Supervised Detection | CVPR | code | 7 |
| Adversarial Learning with Local Coordinate Coding | ICML | code | 7 |
| Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks | CVPR | code | 7 |
| AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks | CVPR | code | 7 |
| Learning to Explain: An Information-Theoretic Perspective on Model Interpretation | ICML | code | 7 |
| Banach Wasserstein GAN | NIPS | code | 7 |
| Gradually Updated Neural Networks for Large-Scale Image Recognition | ICML | code | 7 |
| Learning Steady-States of Iterative Algorithms over Graphs | ICML | code | 7 |
| Progressive Attention Guided Recurrent Network for Salient Object Detection | CVPR | code | 7 |
| Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains | CVPR | code | 6 |
| Unsupervised holistic image generation from key local patches | ECCV | code | 6 |
| Inner Space Preserving Generative Pose Machine | ECCV | code | 6 |
| Bilevel Programming for Hyperparameter Optimization and Meta-Learning | ICML | code | 6 |
| Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition | CVPR | code | 6 |
| Breaking the Activation Function Bottleneck through Adaptive Parameterization | NIPS | code | 6 |
| Ultra Large-Scale Feature Selection using Count-Sketches | ICML | code | 6 |
| Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks | CVPR | code | 6 |
| Orthogonally Decoupled Variational Gaussian Processes | NIPS | code | 6 |
| Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design | ICML | code | 6 |
| A Modulation Module for Multi-task Learning with Applications in Image Retrieval | ECCV | code | 6 |
| A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos | CVPR | code | 6 |
| Towards Effective Low-Bitwidth Convolutional Neural Networks | CVPR | code | 5 |
| Disentangling Factors of Variation by Mixing Them | CVPR | code | 5 |
| Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior | ECCV | code | 5 |
| Learning Longer-term Dependencies in RNNs with Auxiliary Losses | ICML | code | 5 |
| Contour Knowledge Transfer for Salient Object Detection | ECCV | code | 5 |
| HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning | ECCV | code | 5 |
| Sidekick Policy Learning for Active Visual Exploration | ECCV | code | 5 |
| Learning to Localize Sound Source in Visual Scenes | CVPR | code | 5 |
| Neural Architecture Optimization | NIPS | code | 5 |
| [COLA: Decentralized Linear Learning]!(nan) | NIPS | code | 5 |
| Diverse and Coherent Paragraph Generation from Images | ECCV | code | 5 |
| DRACO: Byzantine-resilient Distributed Training via Redundant Gradients | ICML | code | 5 |
| Inter and Intra Topic Structure Learning with Word Embeddings | ICML | code | 5 |
| Estimating the Success of Unsupervised Image to Image Translation | ECCV | code | 5 |
| Dynamic-Structured Semantic Propagation Network | CVPR | code | 5 |
| The Description Length of Deep Learning models | NIPS | code | 5 |
| Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving | ECCV | code | 5 |
| Blind Justice: Fairness with Encrypted Sensitive Attributes | ICML | code | 5 |
| Transfer Learning via Learning to Transfer | ICML | code | 5 |
| Deepcode: Feedback Codes via Deep Learning | NIPS | code | 4 |
| Configurable Markov Decision Processes | ICML | code | 4 |
| A Framework for Evaluating 6-DOF Object Trackers | ECCV | code | 4 |
| Differentially Private Database Release via Kernel Mean Embeddings | ICML | code | 4 |
| Recognizing Human Actions as the Evolution of Pose Estimation Maps | CVPR | code | 4 |
| Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images | CVPR | code | 4 |
| DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map | CVPR | code | 4 |
| Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification | ECCV | code | 4 |
| Tracking Emerges by Colorizing Videos | ECCV | code | 4 |
| Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes | ECCV | code | 4 |
| Inference Suboptimality in Variational Autoencoders | ICML | code | 4 |
| Black Box FDR | ICML | code | 4 |
| Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence | CVPR | code | 4 |
| Quadrature-based features for kernel approximation | NIPS | code | 4 |
| Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking | ECCV | code | 4 |
| Transferable Adversarial Perturbations | ECCV | code | 4 |
| Single Image Water Hazard Detection using FCN with Reflection Attention Units | ECCV | code | 4 |
| Multimodal Generative Models for Scalable Weakly-Supervised Learning | NIPS | code | 4 |
| Importance Weighted Transfer of Samples in Reinforcement Learning | ICML | code | 3 |
| Feature Generating Networks for Zero-Shot Learning | CVPR | code | 3 |
| DICOD: Distributed Convolutional Coordinate Descent for Convolutional Sparse Coding | ICML | code | 3 |
| [CapProNet: Deep Feature Learning via Orthogonal Projections onto Capsule Subspaces]!(nan) | NIPS | code | 3 |
| Bidirectional Retrieval Made Simple | CVPR | code | 3 |
| [Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages]!(nan) | NIPS | code | 3 |
| A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping | CVPR | code | 3 |
| Spatially-Adaptive Filter Units for Deep Neural Networks | CVPR | code | 3 |
| Learning to Branch | ICML | code | 3 |
| [Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives]!(nan) | NIPS | code | 3 |
| Lifelong Learning via Progressive Distillation and Retrospection | ECCV | code | 3 |
| CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition | CVPR | code | 3 |
| Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care | ICML | code | 3 |
| Learning Answer Embeddings for Visual Question Answering | CVPR | code | 3 |
| Information Constraints on Auto-Encoding Variational Bayes | NIPS | code | 3 |
| Parallel Bayesian Network Structure Learning | ICML | code | 3 |
| Ring Loss: Convex Feature Normalization for Face Recognition | CVPR | code | 3 |
| Teaching Categories to Human Learners With Visual Explanations | CVPR | code | 3 |
| Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization | ICML | code | 3 |
| Deep Burst Denoising | ECCV | code | 3 |
| Convergent Tree Backup and Retrace with Function Approximation | ICML | code | 3 |
| Gaze Prediction in Dynamic 360° Immersive Videos | CVPR | code | 3 |
| Statistical Recurrent Models on Manifold valued Data | NIPS | code | 3 |
| End-to-End Flow Correlation Tracking With Spatial-Temporal Attention | CVPR | code | 3 |