| Bridging the Gap Between Value and Policy Based Reinforcement Learning | NIPS | code | 46593 |
| REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models | NIPS | code | 46593 |
| Focal Loss for Dense Object Detection | ICCV | code | 18356 |
| Mask R-CNN | ICCV | code | 9493 |
| Deep Photo Style Transfer | CVPR | code | 8655 |
| LightGBM: A Highly Efficient Gradient Boosting Decision Tree | NIPS | code | 7536 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation | NIPS | code | 6449 |
| Attention is All you Need | NIPS | code | 6288 |
| Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression | ICCV | code | 3354 |
| Densely Connected Convolutional Networks | CVPR | code | 3130 |
| A Unified Approach to Interpreting Model Predictions | NIPS | code | 3122 |
| Deformable Convolutional Networks | ICCV | code | 2165 |
| ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games | NIPS | code | 1823 |
| PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation | CVPR | code | 1523 |
| Improved Training of Wasserstein GANs | NIPS | code | 1405 |
| Fully Convolutional Instance-Aware Semantic Segmentation | CVPR | code | 1395 |
| Aggregated Residual Transformations for Deep Neural Networks | CVPR | code | 1361 |
| Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network | CVPR | code | 1301 |
| Unsupervised Image-to-Image Translation Networks | NIPS | code | 1205 |
| Photographic Image Synthesis With Cascaded Refinement Networks | ICCV | code | 1142 |
| High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis | CVPR | code | 1072 |
| SphereFace: Deep Hypersphere Embedding for Face Recognition | CVPR | code | 1048 |
| Deep Feature Flow for Video Recognition | CVPR | code | 966 |
| Bayesian GAN | NIPS | code | 942 |
| Pyramid Scene Parsing Network | CVPR | code | 934 |
| Efficient Modeling of Latent Information in Supervised Learning using Gaussian Processes | NIPS | code | 906 |
| Finding Tiny Faces | CVPR | code | 856 |
| Toward Multimodal Image-to-Image Translation | NIPS | code | 794 |
| Learning to Discover Cross-Domain Relations with Generative Adversarial Networks | ICML | code | 784 |
| YOLO9000: Better, Faster, Stronger | CVPR | code | 773 |
| PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space | NIPS | code | 772 |
| Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks | ICML | code | 729 |
| FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks | CVPR | code | 720 |
| Channel Pruning for Accelerating Very Deep Neural Networks | ICCV | code | 649 |
| Dilated Residual Networks | CVPR | code | 640 |
| Inferring and Executing Programs for Visual Reasoning | ICCV | code | 636 |
| DSOD: Learning Deeply Supervised Object Detectors From Scratch | ICCV | code | 582 |
| Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization | ICCV | code | 572 |
| Accelerating Eulerian Fluid Simulation With Convolutional Networks | ICML | code | 570 |
| Learning Disentangled Representations with Semi-Supervised Deep Generative Models | NIPS | code | 556 |
| Inductive Representation Learning on Large Graphs | NIPS | code | 552 |
| Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network | CVPR | code | 537 |
| How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks) | ICCV | code | 526 |
| SSH: Single Stage Headless Face Detector | ICCV | code | 515 |
| Learning From Simulated and Unsupervised Images Through Adversarial Training | CVPR | code | 492 |
| Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space | CVPR | code | 487 |
| Video Frame Interpolation via Adaptive Convolution | CVPR | code | 482 |
| Video Frame Interpolation via Adaptive Separable Convolution | ICCV | code | 482 |
| GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence | CVPR | code | 460 |
| Joint Detection and Identification Feature Learning for Person Search | CVPR | code | 459 |
| Dual Path Networks | NIPS | code | 451 |
| Flow-Guided Feature Aggregation for Video Object Detection | ICCV | code | 436 |
| Deep Image Matting | CVPR | code | 434 |
| Richer Convolutional Features for Edge Detection | CVPR | code | 399 |
| Annotating Object Instances With a Polygon-RNN | CVPR | code | 397 |
| Recurrent Highway Networks | ICML | code | 397 |
| Detect to Track and Track to Detect | ICCV | code | 387 |
| RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation | CVPR | code | 379 |
| Detecting Oriented Text in Natural Images by Linking Segments | CVPR | code | 364 |
| Deep Lattice Networks and Partial Monotonic Functions | NIPS | code | 349 |
| Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results | NIPS | code | 347 |
| RON: Reverse Connection With Objectness Prior Networks for Object Detection | CVPR | code | 345 |
| Universal Style Transfer via Feature Transforms | NIPS | code | 344 |
| Residual Attention Network for Image Classification | CVPR | code | 329 |
| One-Shot Video Object Segmentation | CVPR | code | 316 |
| Accurate Single Stage Detector Using Recurrent Rolling Convolution | CVPR | code | 314 |
| Feature Pyramid Networks for Object Detection | CVPR | code | 310 |
| Efficient softmax approximation for GPUs | ICML | code | 304 |
| OctNet: Learning Deep 3D Representations at High Resolutions | CVPR | code | 302 |
| Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution | CVPR | code | 301 |
| Pixel Recursive Super Resolution | ICCV | code | 301 |
| Self-Critical Sequence Training for Image Captioning | CVPR | code | 299 |
| Age Progression/Regression by Conditional Adversarial Autoencoder | CVPR | code | 297 |
| Style Transfer from Non-Parallel Text by Cross-Alignment | NIPS | code | 296 |
| Dilated Recurrent Neural Networks | NIPS | code | 285 |
| Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image | CVPR | code | 280 |
| DeepBach: a Steerable Model for Bach Chorales Generation | ICML | code | 276 |
| The Predictron: End-To-End Learning and Planning | ICML | code | 274 |
| Convolutional Sequence to Sequence Learning | ICML | code | 258 |
| OptNet: Differentiable Optimization as a Layer in Neural Networks | ICML | code | 245 |
| Prototypical Networks for Few-shot Learning | NIPS | code | 244 |
| Deep Voice: Real-time Neural Text-to-Speech | ICML | code | 242 |
| Reinforcement Learning with Deep Energy-Based Policies | ICML | code | 233 |
| Learning Deep CNN Denoiser Prior for Image Restoration | CVPR | code | 231 |
| GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium | NIPS | code | 229 |
| A Point Set Generation Network for 3D Object Reconstruction From a Single Image | CVPR | code | 228 |
| Deeply Supervised Salient Object Detection With Short Connections | CVPR | code | 228 |
| BlitzNet: A Real-Time Deep Network for Scene Understanding | ICCV | code | 227 |
| Language Modeling with Gated Convolutional Networks | ICML | code | 221 |
| Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro | ICCV | code | 215 |
| Stacked Generative Adversarial Networks | CVPR | code | 215 |
| RMPE: Regional Multi-Person Pose Estimation | ICCV | code | 215 |
| Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning | CVPR | code | 214 |
| Generative Face Completion | CVPR | code | 212 |
| VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition | ICCV | code | 210 |
| The Reversible Residual Network: Backpropagation Without Storing Activations | NIPS | code | 210 |
| Recurrent Scale Approximation for Object Detection in CNN | ICCV | code | 209 |
| Learning From Synthetic Humans | CVPR | code | 207 |
| Spatially Adaptive Computation Time for Residual Networks | CVPR | code | 203 |
| Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis | ICCV | code | 202 |
| 3D Bounding Box Estimation Using Deep Learning and Geometry | CVPR | code | 200 |
| Multi-View 3D Object Detection Network for Autonomous Driving | CVPR | code | 199 |
| Visual Dialog | CVPR | code | 199 |
| Interpretable Explanations of Black Boxes by Meaningful Perturbation | ICCV | code | 192 |
| Inverse Compositional Spatial Transformer Networks | CVPR | code | 189 |
| FastMask: Segment Multi-Scale Object Candidates in One Shot | CVPR | code | 189 |
| OnACID: Online Analysis of Calcium Imaging Data in Real Time | NIPS | code | 189 |
| Semantic Scene Completion From a Single Depth Image | CVPR | code | 188 |
| Learning Efficient Convolutional Networks Through Network Slimming | ICCV | code | 186 |
| Learning Feature Pyramids for Human Pose Estimation | ICCV | code | 185 |
| Be Your Own Prada: Fashion Synthesis With Structural Coherence | ICCV | code | 183 |
| Scene Graph Generation by Iterative Message Passing | CVPR | code | 182 |
| Fast Image Processing With Fully-Convolutional Networks | ICCV | code | 180 |
| Learning Multiple Tasks with Multilinear Relationship Networks | NIPS | code | 178 |
| Learning to Reason: End-To-End Module Networks for Visual Question Answering | ICCV | code | 178 |
| Single Shot Text Detector With Regional Attention | ICCV | code | 176 |
| Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources | ICCV | code | 175 |
| Deep Feature Interpolation for Image Content Changes | CVPR | code | 170 |
| On Human Motion Prediction Using Recurrent Neural Networks | CVPR | code | 167 |
| Image Super-Resolution via Deep Recursive Residual Network | CVPR | code | 163 |
| Learning Cross-Modal Embeddings for Cooking Recipes and Food Images | CVPR | code | 160 |
| Input Convex Neural Networks | ICML | code | 159 |
| Simple Does It: Weakly Supervised Instance and Semantic Segmentation | CVPR | code | 159 |
| Low-Shot Visual Recognition by Shrinking and Hallucinating Features | ICCV | code | 158 |
| Oriented Response Networks | CVPR | code | 157 |
| Soft Proposal Networks for Weakly Supervised Object Localization | ICCV | code | 154 |
| Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks | ICML | code | 147 |
| Axiomatic Attribution for Deep Networks | ICML | code | 146 |
| Gradient Episodic Memory for Continual Learning | NIPS | code | 146 |
| DSAC - Differentiable RANSAC for Camera Localization | CVPR | code | 144 |
| Attend to You: Personalized Image Captioning With Context Sequence Memory Networks | CVPR | code | 143 |
| Conditional Similarity Networks | CVPR | code | 142 |
| Language Modeling with Recurrent Highway Hypernetworks | NIPS | code | 141 |
| Triple Generative Adversarial Nets | NIPS | code | 138 |
| Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning | NIPS | code | 138 |
| One-Sided Unsupervised Domain Mapping | NIPS | code | 137 |
| Detecting Visual Relationships With Deep Relational Networks | CVPR | code | 137 |
| Attentive Recurrent Comparators | ICML | code | 136 |
| Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach | ICCV | code | 136 |
| Learning a Multi-View Stereo Machine | NIPS | code | 135 |
| Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model | NIPS | code | 134 |
| Multi-Context Attention for Human Pose Estimation | CVPR | code | 131 |
| Controlling Perceptual Factors in Neural Style Transfer | CVPR | code | 130 |
| Bayesian Compression for Deep Learning | NIPS | code | 130 |
| Adversarial Discriminative Domain Adaptation | CVPR | code | 129 |
| Working hard to know your neighbor's margins: Local descriptor learning loss | NIPS | code | 128 |
| Concrete Dropout | NIPS | code | 127 |
| SegFlow: Joint Learning for Video Object Segmentation and Optical Flow | ICCV | code | 127 |
| Segmentation-Aware Convolutional Networks Using Local Attention Masks | ICCV | code | 126 |
| Detail-Revealing Deep Video Super-Resolution | ICCV | code | 126 |
| CREST: Convolutional Residual Learning for Visual Tracking | ICCV | code | 126 |
| Discriminative Correlation Filter With Channel and Spatial Reliability | CVPR | code | 124 |
| SVDNet for Pedestrian Retrieval | ICCV | code | 121 |
| Semantic Image Synthesis via Adversarial Learning | ICCV | code | 121 |
| Spatiotemporal Multiplier Networks for Video Action Recognition | CVPR | code | 121 |
| PoseTrack: Joint Multi-Person Pose Estimation and Tracking | CVPR | code | 121 |
| Hierarchical Attentive Recurrent Tracking | NIPS | code | 121 |
| Good Semi-supervised Learning That Requires a Bad GAN | NIPS | code | 120 |
| Deep Watershed Transform for Instance Segmentation | CVPR | code | 120 |
| Associative Domain Adaptation | ICCV | code | 119 |
| Learning by Association -- A Versatile Semi-Supervised Training Method for Neural Networks | CVPR | code | 119 |
| Value Prediction Network | NIPS | code | 119 |
| Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation | ICCV | code | 119 |
| MemNet: A Persistent Memory Network for Image Restoration | ICCV | code | 119 |
| Bayesian Optimization with Gradients | NIPS | code | 117 |
| TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning | NIPS | code | 117 |
| Compressed Sensing using Generative Models | ICML | code | 116 |
| Switching Convolutional Neural Network for Crowd Counting | CVPR | code | 116 |
| WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation | CVPR | code | 116 |
| Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner | ICCV | code | 115 |
| Video Frame Synthesis Using Deep Voxel Flow | ICCV | code | 114 |
| Multiple Instance Detection Network With Online Instance Classifier Refinement | CVPR | code | 113 |
| Deep Pyramidal Residual Networks | CVPR | code | 112 |
| Train longer, generalize better: closing the generalization gap in large batch training of neural networks | NIPS | code | 112 |
| Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction | CVPR | code | 110 |
| Unite the People: Closing the Loop Between 3D and 2D Human Representations | CVPR | code | 110 |
| Learning Combinatorial Optimization Algorithms over Graphs | NIPS | code | 109 |
| FeUdal Networks for Hierarchical Reinforcement Learning | ICML | code | 107 |
| ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression | ICCV | code | 105 |
| Learning a Deep Embedding Model for Zero-Shot Learning | CVPR | code | 104 |
| ECO: Efficient Convolution Operators for Tracking | CVPR | code | 103 |
| SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning | CVPR | code | 102 |
| Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency | CVPR | code | 100 |
| Task-based End-to-end Model Learning in Stochastic Optimization | NIPS | code | 100 |
| Learning to Compose Domain-Specific Transformations for Data Augmentation | NIPS | code | 97 |
| Genetic CNN | ICCV | code | 97 |
| HashNet: Deep Learning to Hash by Continuation | ICCV | code | 97 |
| Interleaved Group Convolutions | ICCV | code | 95 |
| Deeply-Learned Part-Aligned Representations for Person Re-Identification | ICCV | code | 95 |
| Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model | NIPS | code | 94 |
| Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation | CVPR | code | 93 |
| Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs | ICCV | code | 92 |
| Semantic Autoencoder for Zero-Shot Learning | CVPR | code | 92 |
| Deep Hyperspherical Learning | NIPS | code | 92 |
| Decoupled Neural Interfaces using Synthetic Gradients | ICML | code | 90 |
| Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks | NIPS | code | 90 |
| Practical Bayesian Optimization for Model Fitting with Bayesian Adaptive Direct Search | NIPS | code | 90 |
| Optical Flow Estimation Using a Spatial Pyramid Network | CVPR | code | 90 |
| AMC: Attention guided Multi-modal Correlation Learning for Image Search | CVPR | code | 90 |
| Deep Video Deblurring for Hand-Held Cameras | CVPR | code | 89 |
| Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data | NIPS | code | 88 |
| Causal Effect Inference with Deep Latent-Variable Models | NIPS | code | 87 |
| GANs for Biological Image Synthesis | ICCV | code | 85 |
| MMD GAN: Towards Deeper Understanding of Moment Matching Network | NIPS | code | 84 |
| Representation Learning by Learning to Count | ICCV | code | 84 |
| Optical Flow in Mostly Rigid Scenes | CVPR | code | 83 |
| Fast-Slow Recurrent Neural Networks | NIPS | code | 82 |
| Unsupervised Video Summarization With Adversarial LSTM Networks | CVPR | code | 82 |
| Constrained Policy Optimization | ICML | code | 81 |
| A-NICE-MC: Adversarial Training for MCMC | NIPS | code | 80 |
| Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose | CVPR | code | 80 |
| End-To-End Instance Segmentation With Recurrent Attention | CVPR | code | 78 |
| DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data | CVPR | code | 78 |
| Learning Shape Abstractions by Assembling Volumetric Primitives | CVPR | code | 77 |
| Local Binary Convolutional Neural Networks | CVPR | code | 77 |
| Raster-To-Vector: Revisiting Floorplan Transformation | ICCV | code | 76 |
| Positive-Unlabeled Learning with Non-Negative Risk Estimator | NIPS | code | 76 |
| Hard-Aware Deeply Cascaded Embedding | ICCV | code | 75 |
| Deep Image Harmonization | CVPR | code | 73 |
| Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis | CVPR | code | 73 |
| Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade | CVPR | code | 73 |
| Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning | CVPR | code | 72 |
| Query-Guided Regression Network With Context Policy for Phrase Grounding | ICCV | code | 72 |
| Top-Down Visual Saliency Guided by Captions | CVPR | code | 72 |
| Feedback Networks | CVPR | code | 72 |
| What Actions Are Needed for Understanding Human Actions in Videos? | ICCV | code | 71 |
| Xception: Deep Learning With Depthwise Separable Convolutions | CVPR | code | 71 |
| Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning | CVPR | code | 71 |
| Video Propagation Networks | CVPR | code | 70 |
| Image-To-Image Translation With Conditional Adversarial Networks | CVPR | code | 70 |
| Quality Aware Network for Set to Set Recognition | CVPR | code | 69 |
| Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces | CVPR | code | 69 |
| Deep Subspace Clustering Networks | NIPS | code | 68 |
| Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models | ICCV | code | 68 |
| A Distributional Perspective on Reinforcement Learning | ICML | code | 68 |
| Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks | CVPR | code | 67 |
| Deep Transfer Learning with Joint Adaptation Networks | ICML | code | 67 |
| Training Deep Networks without Learning Rates Through Coin Betting | NIPS | code | 66 |
| Full Resolution Image Compression With Recurrent Neural Networks | CVPR | code | 66 |
| SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis | ICCV | code | 66 |
| Doubly Stochastic Variational Inference for Deep Gaussian Processes | NIPS | code | 66 |
| TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals | ICCV | code | 66 |
| Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification | ICCV | code | 65 |
| Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks | CVPR | code | 65 |
| Dance Dance Convolution | ICML | code | 65 |
| Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning | CVPR | code | 64 |
| Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes | ICCV | code | 64 |
| Toward Controlled Generation of Text | ICML | code | 63 |
| Person Re-Identification in the Wild | CVPR | code | 63 |
| ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching | NIPS | code | 63 |
| Differentiable Learning of Logical Rules for Knowledge Base Reasoning | NIPS | code | 62 |
| Person Search With Natural Language Description | CVPR | code | 61 |
| Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising | ICCV | code | 61 |
| Playing for Benchmarks | ICCV | code | 61 |
| Unsupervised Learning by Predicting Noise | ICML | code | 60 |
| Localizing Moments in Video With Natural Language | ICCV | code | 60 |
| End-To-End 3D Face Reconstruction With Deep Neural Networks | CVPR | code | 60 |
| CoupleNet: Coupling Global Structure With Local Parts for Object Detection | ICCV | code | 59 |
| AdaGAN: Boosting Generative Models | NIPS | code | 59 |
| Convolutional Gaussian Processes | NIPS | code | 57 |
| A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection | CVPR | code | 57 |
| Modeling Relationships in Referential Expressions With Compositional Modular Networks | CVPR | code | 57 |
| Curiosity-driven Exploration by Self-supervised Prediction | ICML | code | 56 |
| Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution | ICCV | code | 56 |
| The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process | NIPS | code | 56 |
| Online and Linear-Time Attention by Enforcing Monotonic Alignments | ICML | code | 56 |
| Neural Expectation Maximization | NIPS | code | 56 |
| Dense-Captioning Events in Videos | ICCV | code | 55 |
| Factorized Bilinear Models for Image Recognition | ICCV | code | 55 |
| Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee | NIPS | code | 54 |
| On-the-fly Operation Batching in Dynamic Computation Graphs | NIPS | code | 54 |
| Visual Translation Embedding Network for Visual Relation Detection | CVPR | code | 54 |
| Learning Blind Motion Deblurring | ICCV | code | 54 |
| A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning | NIPS | code | 53 |
| Towards Diverse and Natural Image Descriptions via a Conditional GAN | ICCV | code | 53 |
| CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos | CVPR | code | 53 |
| A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing | ICCV | code | 52 |
| Deep IV: A Flexible Approach for Counterfactual Prediction | ICML | code | 52 |
| Triangle Generative Adversarial Networks | NIPS | code | 51 |
| EAST: An Efficient and Accurate Scene Text Detector | CVPR | code | 51 |
| SST: Single-Stream Temporal Action Proposals | CVPR | code | 51 |
| Predicting Deeper Into the Future of Semantic Segmentation | ICCV | code | 51 |
| L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space | CVPR | code | 51 |
| TALL: Temporal Activity Localization via Language Query | ICCV | code | 50 |
| Hybrid Reward Architecture for Reinforcement Learning | NIPS | code | 50 |
| Fast Fourier Color Constancy | CVPR | code | 49 |
| Modulating early visual processing by language | NIPS | code | 49 |
| Adversarial Examples for Semantic Segmentation and Object Detection | ICCV | code | 49 |
| Learning Discrete Representations via Information Maximizing Self-Augmented Training | ICML | code | 49 |
| Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations | CVPR | code | 48 |
| Real Time Image Saliency for Black Box Classifiers | NIPS | code | 48 |
| FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling | CVPR | code | 47 |
| Multiple People Tracking by Lifted Multicut and Person Re-Identification | CVPR | code | 47 |
| Learned D-AMP: Principled Neural Network based Compressive Image Recovery | NIPS | code | 47 |
| GP CaKe: Effective brain connectivity with causal kernels | NIPS | code | 46 |
| Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network | NIPS | code | 46 |
| Semantic Video CNNs Through Representation Warping | ICCV | code | 46 |
| Grammar Variational Autoencoder | ICML | code | 46 |
| EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis | ICCV | code | 46 |
| Safe Model-based Reinforcement Learning with Stability Guarantees | NIPS | code | 45 |
| Deep Spectral Clustering Learning | ICML | code | 45 |
| Semantic Compositional Networks for Visual Captioning | CVPR | code | 45 |
| On-Demand Learning for Deep Image Restoration | ICCV | code | 45 |
| Video Pixel Networks | ICML | code | 45 |
| Stabilizing Training of Generative Adversarial Networks through Regularization | NIPS | code | 45 |
| Structured Bayesian Pruning via Log-Normal Multiplicative Noise | NIPS | code | 44 |
| Deriving Neural Architectures from Sequence and Graph Kernels | ICML | code | 44 |
| Masked Autoregressive Flow for Density Estimation | NIPS | code | 44 |
| Unsupervised Adaptation for Deep Stereo | ICCV | code | 44 |
| Learning Residual Images for Face Attribute Manipulation | CVPR | code | 43 |
| Learning to Generate Long-term Future via Hierarchical Prediction | ICML | code | 43 |
| Accurate Optical Flow via Direct Cost Volume Processing | CVPR | code | 42 |
| Generalized Orderless Pooling Performs Implicit Salient Matching | ICCV | code | 42 |
| Comparative Evaluation of Hand-Crafted and Learned Local Features | CVPR | code | 42 |
| SchNet: A continuous-filter convolutional neural network for modeling quantum interactions | NIPS | code | 41 |
| Temporal Generative Adversarial Nets With Singular Value Clipping | ICCV | code | 41 |
| Multiplicative Normalizing Flows for Variational Bayesian Neural Networks | ICML | code | 41 |
| Neural Scene De-Rendering | CVPR | code | 40 |
| Semantic Image Inpainting With Deep Generative Models | CVPR | code | 40 |
| A Linear-Time Kernel Goodness-of-Fit Test | NIPS | code | 40 |
| Least Squares Generative Adversarial Networks | ICCV | code | 39 |
| Diversified Texture Synthesis With Feed-Forward Networks | CVPR | code | 39 |
| No Fuss Distance Metric Learning Using Proxies | ICCV | code | 38 |
| Template Matching With Deformable Diversity Similarity | CVPR | code | 38 |
| What's in a Question: Using Visual Questions as a Form of Supervision | CVPR | code | 38 |
| Face Normals "In-The-Wild" Using Fully Convolutional Networks | CVPR | code | 38 |
| Conditional Image Synthesis with Auxiliary Classifier GANs | ICML | code | 37 |
| Neural Episodic Control | ICML | code | 37 |
| 3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks | ICCV | code | 37 |
| Structured Embedding Models for Grouped Data | NIPS | code | 36 |
| Learning Active Learning from Data | NIPS | code | 36 |
| Unified Deep Supervised Domain Adaptation and Generalization | ICCV | code | 35 |
| Transformation-Grounded Image Generation Network for Novel 3D View Synthesis | CVPR | code | 35 |
| Structured Attentions for Visual Question Answering | ICCV | code | 34 |
| Geometric Loss Functions for Camera Pose Regression With Deep Learning | CVPR | code | 34 |
| VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization | CVPR | code | 34 |
| QMDP-Net: Deep Learning for Planning under Partial Observability | NIPS | code | 34 |
| Using Ranking-CNN for Age Estimation | CVPR | code | 33 |
| Hierarchical Boundary-Aware Neural Encoder for Video Captioning | CVPR | code | 33 |
| Unsupervised Learning of Disentangled Representations from Video | NIPS | code | 32 |
| Deep Learning on Lie Groups for Skeleton-Based Action Recognition | CVPR | code | 32 |
| Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection | CVPR | code | 32 |
| 3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder | CVPR | code | 32 |
| StyleNet: Generating Attractive Visual Captions With Styles | CVPR | code | 32 |
| Dynamic Word Embeddings | ICML | code | 32 |
| Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon | NIPS | code | 31 |
| Continual Learning Through Synaptic Intelligence | ICML | code | 31 |
| Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes | CVPR | code | 31 |
| Learning Detection With Diverse Proposals | CVPR | code | 31 |
| LCNN: Lookup-Based Convolutional Neural Network | CVPR | code | 31 |
| Towards Accurate Multi-Person Pose Estimation in the Wild | CVPR | code | 30 |
| Real-Time Neural Style Transfer for Videos | CVPR | code | 30 |
| Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training | ICCV | code | 30 |
| Deep Co-Occurrence Feature Learning for Visual Object Recognition | CVPR | code | 29 |
| Joint distribution optimal transportation for domain adaptation | NIPS | code | 29 |
| Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields | CVPR | code | 29 |
| SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization | ICML | code | 29 |
| The Statistical Recurrent Unit | ICML | code | 29 |
| A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation | CVPR | code | 28 |
| Learning Spread-Out Local Feature Descriptors | ICCV | code | 28 |
| Event-Based Visual Inertial Odometry | CVPR | code | 27 |
| DropoutNet: Addressing Cold Start in Recommender Systems | NIPS | code | 27 |
| Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues | ICCV | code | 27 |
| Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations | CVPR | code | 27 |
| Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos | CVPR | code | 27 |
| Neural Message Passing for Quantum Chemistry | ICML | code | 27 |
| State-Frequency Memory Recurrent Neural Networks | ICML | code | 27 |
| DeepCD: Learning Deep Complementary Descriptors for Patch Representations | ICCV | code | 26 |
| Contrastive Learning for Image Captioning | NIPS | code | 26 |
| Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite Sum Structure | NIPS | code | 26 |
| Learning High Dynamic Range From Outdoor Panoramas | ICCV | code | 26 |
| Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors | CVPR | code | 26 |
| Learning to Detect Salient Objects With Image-Level Supervision | CVPR | code | 26 |
| Improved Variational Autoencoders for Text Modeling using Dilated Convolutions | ICML | code | 26 |
| Interspecies Knowledge Transfer for Facial Keypoint Detection | CVPR | code | 25 |
| YASS: Yet Another Spike Sorter | NIPS | code | 25 |
| Open Set Domain Adaptation | ICCV | code | 25 |
| Domain-Adaptive Deep Network Compression | ICCV | code | 24 |
| Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization | ICCV | code | 24 |
| Temporal Context Network for Activity Localization in Videos | ICCV | code | 24 |
| Incremental Learning of Object Detectors Without Catastrophic Forgetting | ICCV | code | 24 |
| Dense Captioning With Joint Inference and Visual Context | CVPR | code | 24 |
| Universal Adversarial Perturbations | CVPR | code | 24 |
| Asymmetric Tri-training for Unsupervised Domain Adaptation | ICML | code | 24 |
| Reducing Reparameterization Gradient Variance | NIPS | code | 24 |
| Exploiting Saliency for Object Segmentation From Image Level Labels | CVPR | code | 24 |
| A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering | NIPS | code | 24 |
| Shading Annotations in the Wild | CVPR | code | 24 |
| Straight to Shapes: Real-Time Detection of Encoded Shapes | CVPR | code | 23 |
| Dual Discriminator Generative Adversarial Nets | NIPS | code | 23 |
| Zero-Order Reverse Filtering | ICCV | code | 23 |
| Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net | NIPS | code | 23 |
| Learning Spherical Convolution for Fast Features from 360° Imagery | NIPS | code | 22 |
| Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier | ICML | code | 22 |
| Deep Cross-Modal Hashing | CVPR | code | 22 |
| When Unsupervised Domain Adaptation Meets Tensor Representations | ICCV | code | 22 |
| Image Super-Resolution Using Dense Skip Connections | ICCV | code | 22 |
| Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer | CVPR | code | 22 |
| STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling | CVPR | code | 22 |
| Learning Continuous Semantic Representations of Symbolic Expressions | ICML | code | 22 |
| Deep Growing Learning | ICCV | code | 21 |
| Combined Group and Exclusive Sparsity for Deep Neural Networks | ICML | code | 21 |
| Hash Embeddings for Efficient Word Representations | NIPS | code | 21 |
| Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM | NIPS | code | 21 |
| Disentangled Representation Learning GAN for Pose-Invariant Face Recognition | CVPR | code | 21 |
| Learning to Pivot with Adversarial Networks | NIPS | code | 21 |
| Learning Dynamic Siamese Network for Visual Object Tracking | ICCV | code | 21 |
| POSEidon: Face-From-Depth for Driver Pose Estimation | CVPR | code | 20 |
| Deep Metric Learning via Facility Location | CVPR | code | 20 |
| Automatic Spatially-Aware Fashion Concept Discovery | ICCV | code | 20 |
| The Numerics of GANs | NIPS | code | 20 |
| From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur | CVPR | code | 20 |
| Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks | ICCV | code | 20 |
| Zero-Inflated Exponential Family Embeddings | ICML | code | 20 |
| InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations | NIPS | code | 20 |
| Weakly-Supervised Learning of Visual Relations | ICCV | code | 20 |
| Multi-Label Image Recognition by Recurrently Discovering Attentional Regions | ICCV | code | 20 |
| Scene Parsing With Global Context Embedding | ICCV | code | 20 |
| Context Selection for Embedding Models | NIPS | code | 20 |
| Deep Mean-Shift Priors for Image Restoration | NIPS | code | 20 |
| Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition | CVPR | code | 20 |
| Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification | CVPR | code | 19 |
| Learning Compact Geometric Features | ICCV | code | 19 |
| Structured Generative Adversarial Networks | NIPS | code | 19 |
| Joint Gap Detection and Inpainting of Line Drawings | CVPR | code | 19 |
| Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection | ICCV | code | 19 |
| Adversarial Feature Matching for Text Generation | ICML | code | 18 |
| BIER - Boosting Independent Embeddings Robustly | ICCV | code | 18 |
| Predictive-Corrective Networks for Action Detection | CVPR | code | 18 |
| Stochastic Generative Hashing | ICML | code | 18 |
| A Bayesian Data Augmentation Approach for Learning Deep Models | NIPS | code | 18 |
| Attentive Semantic Video Generation Using Captions | ICCV | code | 18 |
| MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network | CVPR | code | 18 |
| Deep Unsupervised Similarity Learning Using Partially Ordered Sets | CVPR | code | 17 |
| DualNet: Learn Complementary Features for Image Recognition | ICCV | code | 17 |
| Neural system identification for large populations separating “what” and “where” | NIPS | code | 17 |
| FALKON: An Optimal Large Scale Kernel Method | NIPS | code | 17 |
| Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks | CVPR | code | 17 |
| Deep Learning with Topological Signatures | NIPS | code | 17 |
| Streaming Sparse Gaussian Process Approximations | NIPS | code | 17 |
| RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos | ICCV | code | 17 |
| Awesome Typography: Statistics-Based Text Effects Transfer | CVPR | code | 17 |
| RoomNet: End-To-End Room Layout Estimation | ICCV | code | 17 |
| Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval | ICCV | code | 16 |
| Deep Supervised Discrete Hashing | NIPS | code | 16 |
| Few-Shot Learning Through an Information Retrieval Lens | NIPS | code | 16 |
| Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach | NIPS | code | 16 |
| Learning to Push the Limits of Efficient FFT-Based Image Deconvolution | ICCV | code | 16 |
| Federated Multi-Task Learning | NIPS | code | 16 |
| Label Distribution Learning Forests | NIPS | code | 16 |
| Deep Multitask Architecture for Integrated 2D and 3D Human Sensing | CVPR | code | 16 |
| Estimating Mutual Information for Discrete-Continuous Mixtures | NIPS | code | 16 |
| Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes | CVPR | code | 16 |
| StyleBank: An Explicit Representation for Neural Image Style Transfer | CVPR | code | 16 |
| Surface Normals in the Wild | ICCV | code | 15 |
| Automatic Discovery of the Statistical Types of Variables in a Dataset | ICML | code | 15 |
| Learning Diverse Image Colorization | CVPR | code | 15 |
| Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems | ICCV | code | 15 |
| Non-Local Deep Features for Salient Object Detection | CVPR | code | 15 |
| Structure-Measure: A New Way to Evaluate Foreground Maps | ICCV | code | 15 |
| Shallow Updates for Deep Reinforcement Learning | NIPS | code | 15 |
| Wasserstein Generative Adversarial Networks | ICML | code | 15 |
| Recurrent 3D Pose Sequence Machines | CVPR | code | 15 |
| Variational Dropout Sparsifies Deep Neural Networks | ICML | code | 15 |
| Captioning Images With Diverse Objects | CVPR | code | 15 |
| Off-policy evaluation for slate recommendation | NIPS | code | 15 |
| Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning | ICCV | code | 14 |
| Benchmarking Denoising Algorithms With Real Photographs | CVPR | code | 14 |
| Neural Aggregation Network for Video Face Recognition | CVPR | code | 14 |
| Learned Contextual Feature Reweighting for Image Geo-Localization | CVPR | code | 14 |
| Streaming Weak Submodularity: Interpreting Neural Networks on the Fly | NIPS | code | 14 |
| CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training | ICCV | code | 14 |
| VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation | ICCV | code | 14 |
| Spherical convolutions and their application in molecular modelling | NIPS | code | 14 |
| Multi-Information Source Optimization | NIPS | code | 14 |
| Convolutional Neural Network Architecture for Geometric Matching | CVPR | code | 14 |
| Neural Face Editing With Intrinsic Image Disentangling | CVPR | code | 14 |
| Realistic Dynamic Facial Textures From a Single Image Using GANs | ICCV | code | 14 |
| Predictive State Recurrent Neural Networks | NIPS | code | 13 |
| Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework | ICCV | code | 13 |
| ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events | NIPS | code | 13 |
| Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs | NIPS | code | 13 |
| Consensus Convolutional Sparse Coding | ICCV | code | 13 |
| Weakly Supervised Affordance Detection | CVPR | code | 13 |
| Joint Learning of Object and Action Detectors | ICCV | code | 13 |
| Light Field Blind Motion Deblurring | CVPR | code | 13 |
| Asynchronous Stochastic Gradient Descent with Delay Compensation | ICML | code | 13 |
| Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations | ICCV | code | 12 |
| Maximizing Subset Accuracy with Recurrent Neural Networks in Multi-label Classification | NIPS | code | 12 |
| Self-Organized Text Detection With Minimal Post-Processing via Border Learning | ICCV | code | 12 |
| Coordinated Multi-Agent Imitation Learning | ICML | code | 12 |
| Gradient descent GAN optimization is locally stable | NIPS | code | 12 |
| Removing Rain From Single Images via a Deep Detail Network | CVPR | code | 12 |
| Convexified Convolutional Neural Networks | ICML | code | 12 |
| Multigrid Neural Architectures | CVPR | code | 12 |
| VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization | ICCV | code | 12 |
| Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin | NIPS | code | 12 |
| Differential Angular Imaging for Material Recognition | CVPR | code | 12 |
| A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras | ICCV | code | 11 |
| Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation | NIPS | code | 11 |
| Max-value Entropy Search for Efficient Bayesian Optimization | ICML | code | 11 |
| Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization | ICCV | code | 11 |
| Generalized Deep Image to Image Regression | CVPR | code | 11 |
| Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective | ICCV | code | 11 |
| Predicting Human Activities Using Stochastic Grammar | ICCV | code | 11 |
| DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents | CVPR | code | 11 |
| Fisher GAN | NIPS | code | 11 |
| High-Order Attention Models for Visual Question Answering | NIPS | code | 11 |
| IM2CAD | CVPR | code | 11 |
| On Fairness and Calibration | NIPS | code | 11 |
| DeepPermNet: Visual Permutation Learning | CVPR | code | 10 |
| f-GANs in an Information Geometric Nutshell | NIPS | code | 10 |
| Revisiting IM2GPS in the Deep Learning Era | ICCV | code | 10 |
| Attentional Correlation Filter Network for Adaptive Visual Tracking | CVPR | code | 10 |
| Learning Cross-Modal Deep Representations for Robust Pedestrian Detection | CVPR | code | 10 |
| Confident Multiple Choice Learning | ICML | code | 10 |
| Curriculum Dropout | ICCV | code | 9 |
| Cognitive Mapping and Planning for Visual Navigation | CVPR | code | 9 |
| Optimized Pre-Processing for Discrimination Prevention | NIPS | code | 9 |
| Learning Motion Patterns in Videos | CVPR | code | 9 |
| Scalable Log Determinants for Gaussian Process Kernel Learning | NIPS | code | 9 |
| A Hierarchical Approach for Generating Descriptive Image Paragraphs | CVPR | code | 9 |
| Deep Crisp Boundaries | CVPR | code | 9 |
| Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization | NIPS | code | 9 |
| Practical Data-Dependent Metric Compression with Provable Guarantees | NIPS | code | 9 |
| Do Deep Neural Networks Suffer from Crowding? | NIPS | code | 9 |
| A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting | CVPR | code | 9 |
| End-To-End Learning of Geometry and Context for Deep Stereo Regression | ICCV | code | 9 |
| From Bayesian Sparsity to Gated Recurrent Nets | NIPS | code | 8 |
| Regret Minimization in MDPs with Options without Prior Knowledge | NIPS | code | 8 |
| Following Gaze in Video | ICCV | code | 8 |
| Model-Powered Conditional Independence Test | NIPS | code | 8 |
| Cost efficient gradient boosting | NIPS | code | 8 |
| Reflectance Adaptive Filtering Improves Intrinsic Image Estimation | CVPR | code | 8 |
| DeepNav: Learning to Navigate Large Cities | CVPR | code | 8 |
| Look, Listen and Learn | ICCV | code | 8 |
| Attention-Aware Face Hallucination via Deep Reinforcement Learning | CVPR | code | 8 |
| Plan, Attend, Generate: Planning for Sequence-to-Sequence Models | NIPS | code | 8 |
| Introspective Neural Networks for Generative Modeling | ICCV | code | 8 |
| Affinity Clustering: Hierarchical Clustering at Scale | NIPS | code | 8 |
| Gaze Embeddings for Zero-Shot Image Classification | CVPR | code | 8 |
| Input Switched Affine Networks: An RNN Architecture Designed for Interpretability | ICML | code | 8 |
| Online multiclass boosting | NIPS | code | 8 |
| Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images | ICCV | code | 8 |
| SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition | ICCV | code | 7 |
| Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition | NIPS | code | 7 |
| Unsupervised Monocular Depth Estimation With Left-Right Consistency | CVPR | code | 7 |
| Personalized Image Aesthetics | ICCV | code | 7 |
| Reasoning About Fine-Grained Attribute Phrases Using Reference Games | ICCV | code | 7 |
| Lost Relatives of the Gumbel Trick | ICML | code | 7 |
| Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction | ICCV | code | 7 |
| Centered Weight Normalization in Accelerating Training of Deep Neural Networks | ICCV | code | 6 |
| Scalable Planning with Tensorflow for Hybrid Nonlinear Domains | NIPS | code | 6 |
| Convex Global 3D Registration With Lagrangian Duality | CVPR | code | 6 |
| Building a Regular Decision Boundary With Deep Networks | CVPR | code | 6 |
| Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification | CVPR | code | 6 |
| Forecasting Human Dynamics From Static Images | CVPR | code | 6 |
| AOD-Net: All-In-One Dehazing Network | ICCV | code | 6 |
| K-Medoids For K-Means Seeding | NIPS | code | 6 |
| Diverse Image Annotation | CVPR | code | 6 |
| Practical Hash Functions for Similarity Estimation and Dimensionality Reduction | NIPS | code | 6 |
| Deep Adaptive Image Clustering | ICCV | code | 6 |
| Robust Adversarial Reinforcement Learning | ICML | code | 6 |
| Improving Training of Deep Neural Networks via Singular Value Bounding | CVPR | code | 6 |
| Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems | NIPS | code | 6 |
| Tensor Belief Propagation | ICML | code | 6 |
| Sparse convolutional coding for neuronal assembly detection | NIPS | code | 6 |
| Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks | CVPR | code | 6 |
| Bayesian inference on random simple graphs with power law degree distributions | ICML | code | 6 |
| Tensor Biclustering | NIPS | code | 6 |
| Riemannian approach to batch normalization | NIPS | code | 6 |
| Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings | ICCV | code | 6 |
| Rolling-Shutter-Aware Differential SfM and Image Rectification | ICCV | code | 5 |
| Active Decision Boundary Annotation With Deep Generative Models | ICCV | code | 5 |
| Object Co-Skeletonization With Co-Segmentation | CVPR | code | 5 |
| Discover and Learn New Objects From Documentaries | CVPR | code | 5 |
| Understanding Black-box Predictions via Influence Functions | ICML | code | 5 |
| Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach | CVPR | code | 5 |
| Decoupling "when to update" from "how to update" | NIPS | code | 5 |
| MarioQA: Answering Questions by Watching Gameplay Videos | ICCV | code | 5 |
| Differentially private Bayesian learning on distributed data | NIPS | code | 5 |
| Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization | ICCV | code | 5 |
| Question Asking as Program Generation | NIPS | code | 5 |
| Conic Scan-and-Cover algorithms for nonparametric topic modeling | NIPS | code | 5 |
| Lip Reading Sentences in the Wild | CVPR | code | 5 |
| ROAM: A Rich Object Appearance Model With Application to Rotoscoping | CVPR | code | 5 |
| NeuralFDR: Learning Discovery Thresholds from Hypothesis Features | NIPS | code | 5 |
| Viraliency: Pooling Local Virality | CVPR | code | 5 |
| Learning Algorithms for Active Learning | ICML | code | 5 |
| Point to Set Similarity Based Deep Feature Learning for Person Re-Identification | CVPR | code | 5 |
| Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation | ICCV | code | 5 |
| The World of Fast Moving Objects | CVPR | code | 5 |
| Cross-Modality Binary Code Learning via Fusion Similarity Hashing | CVPR | code | 5 |
| Testing and Learning on Distributions with Symmetric Noise Invariance | NIPS | code | 5 |
| Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference | NIPS | code | 5 |
| Diving into the shallows: a computational perspective on large-scale shallow learning | NIPS | code | 5 |
| Rotation Equivariant Vector Field Networks | ICCV | code | 5 |
| Recursive Sampling for the Nystrom Method | NIPS | code | 5 |
| Learning From Video and Text via Large-Scale Discriminative Clustering | ICCV | code | 5 |
| Global optimization of Lipschitz functions | ICML | code | 5 |
| Device Placement Optimization with Reinforcement Learning | ICML | code | 4 |
| Alternating Direction Graph Matching | CVPR | code | 4 |
| MEC: Memory-efficient Convolution for Deep Neural Network | ICML | code | 4 |
| Expert Gate: Lifelong Learning With a Network of Experts | CVPR | code | 4 |
| A Simple yet Effective Baseline for 3D Human Pose Estimation | ICCV | code | 4 |
| On Structured Prediction Theory with Calibrated Convex Surrogate Losses | NIPS | code | 4 |
| Sub-sampled Cubic Regularization for Non-convex Optimization | ICML | code | 4 |
| Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval | CVPR | code | 4 |
| Bottleneck Conditional Density Estimation | ICML | code | 4 |
| Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning | ICCV | code | 4 |
| Multi-way Interacting Regression via Factorization Machines | NIPS | code | 4 |
| Joint Discovery of Object States and Manipulation Actions | ICCV | code | 4 |
| Predicting Salient Face in Multiple-Face Videos | CVPR | code | 4 |
| From Red Wine to Red Tomato: Composition With Context | CVPR | code | 4 |
| Encoder Based Lifelong Learning | ICCV | code | 4 |
| Deep Recurrent Neural Network-Based Identification of Precursor microRNAs | NIPS | code | 4 |
| Guarantees for Greedy Maximization of Non-submodular Functions with Applications | ICML | code | 4 |
| Pose-Aware Person Recognition | CVPR | code | 4 |
| Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths | CVPR | code | 4 |
| Asynchronous Distributed Variational Gaussian Processes for Regression | ICML | code | 3 |
| Saliency Pattern Detection by Ranking Structured Trees | ICCV | code | 3 |
| Toward Goal-Driven Neural Network Models for the Rodent Whisker-Trigeminal System | NIPS | code | 3 |
| Learning Non-Maximum Suppression | CVPR | code | 3 |
| Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC | ICML | code | 3 |
| Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries | CVPR | code | 3 |
| AdaNet: Adaptive Structural Learning of Artificial Neural Networks | ICML | code | 3 |
| Large Margin Object Tracking With Circulant Feature Maps | CVPR | code | 3 |
| Compatible Reward Inverse Reinforcement Learning | NIPS | code | 3 |
| Adversarial Surrogate Losses for Ordinal Regression | NIPS | code | 3 |
| Non-monotone Continuous DR-submodular Maximization: Structure and Algorithms | NIPS | code | 3 |
| Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning | NIPS | code | 3 |
| A framework for Multi-A(rmed)/B(andit) Testing with Online FDR Control | NIPS | code | 3 |
| Counting Everyday Objects in Everyday Scenes | CVPR | code | 3 |
| Loss Max-Pooling for Semantic Image Segmentation | CVPR | code | 3 |
| Aesthetic Critiques Generation for Photos | ICCV | code | 3 |
| Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems | NIPS | code | 3 |
| Near-Optimal Edge Evaluation in Explicit Generalized Binomial Graphs | NIPS | code | 3 |