Publications on Google Scholar
Publications on Semantic Scholar
2023
Inproceedings

Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems Inproceedings
In: USENIX Security Symposium (USENIX Security)}, 2023.

UnGANable: Defending Against GAN-based Face Manipulation Inproceedings
In: USENIX Security Symposium (USENIX Security), 2023.
2022
Journal Articles

Understanding Utility and Privacy of Demographic Data in Education Technology by Causal Analysis and Adversarial-Censoring Journal Article
In: Proceedings on Privacy Enhancing Technologies, vol. 2022, no. 2, pp. 245–262, 2022.
Inproceedings

Private Set Generation with Discriminative Information Inproceedings
In: Neural Information Processing Systems (NeurIPS), 2022.
ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models Inproceedings
In: USENIX Security Symposium (USENIX Security), 2022.

ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training Inproceedings
In: International Conference on Machine Learning (ICML), 2022.

B-cos Networks: Alignment is All We Need for Interpretability Inproceedings
In: Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources Inproceedings
In: Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

Responsible Disclosure of Generative Models Using Scalable Fingerprinting Inproceedings
In: International Conference on Representation Learning (ICLR), 2022.

RelaxLoss: Defending Membership Inference Attacks without Losing Utility Inproceedings
In: International Conference on Representation Learning (ICLR), 2022.

Practical Challenges in Differentially-Private Federated Survival Analysis of Medical Data Inproceedings
In: Conference on Health, Inference, and Learning (CHIL), 2022.
2021
Journal Articles

Semantic Bottlenecks: Quantifying and Improving Inspectability of Deep Representations Journal Article
In: International Journal of Computer Vision (IJCV), 2021.

Privacy considerations for sharing genomics data Journal Article
In: EXCLI Journal, 2021.
Inproceedings

Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data Inproceedings
In: International Conference on Computer Vision (ICCV), 2021.

Dual Contrastive Loss and Attention for GANs Inproceedings
In: International Conference on Computer Vision (ICCV), 2021.

Beyond the Spectrum: Detecting Deepfakes via Re-Synthesis Inproceedings
In: 30th International Joint Conference on Artificial Intelligence (IJCAI), 2021.

Convolutional Dynamic Alignment Networks for Interpretable Classifications Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding Inproceedings
In: IEEE Symposium on Security and Privacy (S&P), 2021.

Future Moment Assessment for Action Query Inproceedings
In: IEEE Winter Conference on Applications of Computer Vision (WACV ’20), 2021.
Technical Reports

Backdoor Attacks on Network Certification via Data Poisoning Technical Report
arXiv:2108.11299, 2021.

ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models Technical Report
arXiv:2102.02551, 2021.
Workshops

Moving Target Defense Workshop in conjuncture with CCS, 2021.

SampleFix: Learning to Generate Functionally Diverse Fixes Workshop
1st International Workshop on Machine Learning in Software Engineering in conjuncture with ECML PKDD, Springer, 2021.

IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis Workshop
1st International Workshop on Machine Learning in Software Engineering in conjuncture with ECML PKDD, Springer, 2021.

InfoScrub: Towards Attribute Privacy by Targeted Obfuscation Workshop
CVPR Workshop on Fair, Data-Efficient, and Trusted Computer Vision (TCV), 2021.

MLCapsule: Guarded Offline Deployment of Machine Learning as a Service Workshop
CVPR Workshop on Fair, Data-Efficient, and Trusted Computer Vision (TCV), 2021.
2020
Journal Articles

Person Recognition in Personal Photo Collections Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020.

Deep Gaze Pooling: Inferring and Visually Decoding Search Intents From Human Gaze Fixations Journal Article
In: Neurocomputing, vol. 387, pp. 369–382, 2020.
Inproceedings

GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators Inproceedings
In: Neural Information Processing Systems (NeurIPS), 2020.

GAN-Leaks: A Taxonomy of Membership Inference Attacks against GANs Inproceedings
In: ACM Conference on Computer and Communications Security (CCS) , 2020.

VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity Inproceedings
In: ACM Conference on Computer and Communications Security (CCS) , 2020.

Haar Wavelet based Block Autoregressive Flows for Trajectories Inproceedings
In: German Conference on Pattern Recognition (GCPR), 2020.

Long-Tailed Recognition Using Class-Balanced Experts Inproceedings
In: German Conference on Pattern Recognition (GCPR), 2020.

Semantic Bottlenecks: Quantifying & Improving Inspectability of Deep Representations Inproceedings
In: German Conference on Patter Recognition (GCPR), 2020.

Towards Automated Testing and Robustification by Semantic Adversarial Data Generation Inproceedings
In: European Conference on Computer Vision (ECCV), 2020.

Inclusive GAN: Improving Data and Minority Coverage in Generative Models Inproceedings
In: European Conference on Computer Vision (ECCV), 2020.

Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation Inproceedings
In: European Conference on Computer Vision (ECCV), 2020.

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Inproceedings
In: USENIX Security Symposium (USENIX Security 20), 2020.

Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition, 2020.

Normalizing Flows with Multi-scale Autoregressive Priors Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition, 2020.

Automatically Detecting Bystanders in Photos to Reduce Privacy Risks Inproceedings
In: IEEE Symposium on Security and Privacy (S&P), 2020.

Prediction Poisoning: Utility-Constrained Defenses Against Model Stealing Attacks Inproceedings
In: International Conference on Representation Learning (ICLR), 2020.
Technical Reports

CosSGD: Nonlinear Quantization for Communication-efficient Federated Learning Technical Report
arXiv:2012.08241 , 2020.

Responsible Disclosure of Generative Models Using Scalable Fingerprinting Technical Report
arXiv:2012.08726 , 2020.

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs Technical Report
arXiv:2011.14107 , 2020.

Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding Technical Report
arXiv:2009.03015 [cs.CR], 2020.

Black-Box Watermarking for Generative Adversarial Networks Technical Report
arXiv:2007.08457, 2020.

IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis Technical Report
arXiv:2006.10720, 2020.

GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators Technical Report
arXiv:2006.08265 , 2020.

InfoScrub: Towards Attribute Privacy by Targeted Obfuscation Technical Report
arXiv:2005.10329 , 2020.

Inclusive GAN: Improving Data and Minority Coverage in Generative Models Technical Report
2020.

Long-Tailed Recognition Using Class-Balanced Experts Technical Report
arXiv:2004.03706 , 2020.
Workshops

SampleFix: Learning to Correct Programs by Sampling Diverse Fixes Workshop
NeurIPS Workshop on Computer-Assisted Programming, 2020.

IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis Workshop
NeurIPS Workshop on Computer-Assisted Programming, 2020.

Haar Wavelet based Block Autoregressive Flows for Trajectories Workshop
NeurIPS Workshop on Machine Learning for Autonomous Driving, 2020.

Body Shape Privacy in Images: Understanding Privacy and Preventing Automatic Shape Extraction Workshop
Workshop on The Bright and Dark Sides of Computer Vision: Challenges and Opportunities for Privacy and Security CVCOPS (ECCV-W), 2020.

Synthetic Convolutional Features for Improved Semantic Segmentation Workshop
Workshop on Assistive Computer Vision and Robotics at European Conference on Computer Vision (ECCV-W), 2020.
2019
Journal Articles

MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation Journal Article
In: Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019.
Incollections

Towards reverse-engineering black-box neural networks Incollection
In: Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, 2019.
Inproceedings

Attributing Fake Images to GANs: Learning and Analyzing GAN Fingerprints Inproceedings
In: International Conference on Computer Vision (ICCV), 2019.

Deep Appearance Maps Inproceedings
In: International Conference on Computer Vision (ICCV), 2019.

Knockoff Nets: Stealing Functionality of Black-Box Models Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

Time-Conditioned Action Anticipation in One Shot Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

Bayesian Prediction of Future Street Scenes using Synthetic Likelihoods Inproceedings
In: International Conference on Representation Learning (ICLR), 2019.

ML-Leaks: Model and Data Independent Membership Inference Attacks and Defenses on Machine Learning Models Inproceedings
In: Annual Network and Distributed System Security Symposium (NDSS), 2019.

Fashion is Taking Shape: Understanding Clothing Preference Based on Body Shape From Online Sources Inproceedings
In: IEEE Winter Conference on Applications of Computer Vision (WACV), 2019.
Technical Reports

Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation Technical Report
arXiv:1912.09685, 2019.


"Best-of-Many-Samples" Distribution Matching Technical Report
arXiv:1909.12598, 2019.

GAN-Leaks: A Taxonomy of Membership Inference Attacks against GANs Technical Report
arXiv:1909.03935, 2019.

WhiteNet: Phishing Website Detection by Visual Whitelists Technical Report
arXiv:1909.00300, 2019.

Conditional Flow Variational Autoencoders for Structured Sequence Prediction Technical Report
arXiv:1908.09008, 2019.

Interpretability Beyond Classification Output: Semantic Bottleneck Networks Technical Report
arXiv:1907.10882 , 2019.

Prediction Poisoning: Utility-Constrained Defenses Against Model Stealing Attacks Technical Report
2019.

SampleFix: Learning to Correct Programs by Sampling Diverse Fixes Technical Report
arXiv:1906.10502, 2019.

Shape Evasion: Preventing Body Shape Inference of Multi-Stage Approaches Technical Report
arXiv:1905.11503 , 2019.

Learning Manipulation under Physics Constraints with Visual Perception Technical Report
2019.

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Technical Report
2019.
Workshops

Differential Privacy Defenses and Sampling Attacks for Membership Inference Workshop
NeurIPS Workshop on Privacy in Machine Learning (PRIML), 2019.

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Workshop
Hot Topics in Privacy Enhancing Technologies (HotPETs), 2019.

Understanding and Recognizing Bystanders in Images for Privacy Protection Workshop
Privacy, Usability, and Transparency (PUT) @ PETs, 2019.
2018
Journal Articles

Advanced Steel Microstructural Classification by Deep Learning Methods Journal Article
In: Scientific Reports, 2018.

Reflectance and Natural Illumination from Single-Material Specular Objects Using Deep Learning Journal Article
In: Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018.
Inproceedings

Adversarial Scene Editing: Automatic Object Removal from Weak Supervision Inproceedings
In: Neural Information Processing Systems (NIPS), 2018.

Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes Inproceedings
In: European Conference on Computer Vision (ECCV), 2018.

A Hybrid Model for Identity Obfuscation by Face Replacement Inproceedings
In: European Conference on Computer Vision, 2018.

Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions Inproceedings
In: Visual Learning and Embodied Agents in Simulation Environments Workshop at European Conference on Computer Vision, 2018.

Sequential Attacks on Agents for Long-Term Adversarial Goals Inproceedings
In: 2. ACM Computer Science in Cars Symposium -- Future Challenges in Artificial Intelligence & Security for Autonomous Vehicles, 2018.

A4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation Inproceedings
In: 27th USENIX Security Symposium (USENIX Security 18), 2018.

Accurate and Diverse Sampling of Sequences based on a “Best of Many” Sample Objective Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Natural and Effective Obfuscation by Head Inpainting Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Disentangled Person Image Generation Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Towards Reverse-Engineering Black-Box Neural Networks Inproceedings
In: Internation Conference on Representation Learning (ICLR), 2018.

Long-Term Image Boundary Prediction Inproceedings
In: Association for the Advancement of Artificial Intelligence (AAAI), 2018.
Technical Reports

Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation Technical Report
arXiv:1812.06707, 2018.

Knockoff Nets: Stealing Functionality of Black-Box Models Technical Report
arXiv:1812.02766, 2018.

Attributing Fake Images to GANs: Analyzing Fingerprints in Generated Images Technical Report
2018.

MLCapsule: Guarded Offline Deployment of Machine Learning as a Service Technical Report
arXiv:1808.00590 [cs.CR], 2018.

Fashion is Taking Shape: Understanding Clothing Preference Based on Body Shape From Online Sources Technical Report
arXiv:1807.03235, 2018.

ML-Leaks: Model and Data Independent Membership Inference Attacks and Defenses on Machine Learning Models Technical Report
arXiv:1806.01246 [cs.CR], 2018.

Adversarial Scene Editing: Automatic Object Removal from Weak Supervision Technical Report
2018.

Sequential Attacks on Agents for Long-Term Adversarial Goals Technical Report
arXiv:1805.12487, 2018.

Understanding and Controlling User Linkability in Decentralized Learning Technical Report
arXiv:1805.05838 [cs.CR], 2018.

A Hybrid Model for Identity Obfuscation by Face Replacement Technical Report
arXiv:1804.04779 [cs.CV], 2018.

Deep Appearance Maps Technical Report
arXiv:1804.00863 [cs.CV], 2018.
2017
Journal Articles

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Journal Article
In: International Journal of Computer Vision (IJCV), 2017, (to appear).

Novel Views of Objects from a Single Image Journal Article
In: Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2017.
Inproceedings

Predicting the Category and Attributes of Visual Search Targets Using Deep Gaze Pooling Inproceedings
In: Mutual Benefits of Cognitive and Computer Vision Workshop at International Conference on Computer Vision (ICCV-W), 2017.

Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2017.

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2017.

Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2017, 2017.

What Is Around The Camera? Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2017.

Learning Dilation Factors for Semantic Segmentation of Street Scenes Inproceedings
In: German Conference on Pattern Recognition (GCPR), 2017.

STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

A Domain Based Approach to Social Relation Recognition Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

Exploiting saliency for object segmentation from image level labels Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

It's Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation Inproceedings
In: 1st International Workshop on Deep Affective Learning and Context Modeling at Computer Vision and Pattern Recognition Conference (CVPR-W), 2017.

Visual Stability Prediction for Robotic Manipulation Inproceedings
In: IEEE International Conference on Robotics and Automation (ICRA), 2017, (to appear).
Miscellaneous

Long-Term On-Board Prediction of Pedestrians in Traffic Scenes Miscellaneous
Conference on Robot Learning (CoRL), 2017.

From Understanding to Controlling Privacy against Automatic Person Identification in Social Media Miscellaneous
The Bright and Dark Sides of Computer Vision: Challenges and Opportunities for Privacy and Security (CV-COPS 2017), 2017.

Visual Stability Prediction and Its Application to Manipulation Miscellaneous
AAAI Spring Symposium Series: Interactive Multi-Sensory Object Perception for Embodied Agents, 2017.
Technical Reports

Disentangled Person Image Generation Technical Report
arXiv:1712.02621 [cs.CV], 2017.

Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images Technical Report
arXiv:1712.01066 [cs.CV], 2017.

Natural and Effective Obfuscation by Head Inpainting Technical Report
arXiv:1711.09001 [cs.CV], 2017.

Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty Technical Report
arXiv:1711.09026 [cs.CV], 2017.

A4NT : Author Attribute Anonymity by Adversarial Training of Neural Machine Translation Technical Report
arXiv:1711.01921, 2017.

Whitening Black-Box Neural Networks Technical Report
arXiv:1711.01768, 2017.

Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning Technical Report
arXiv:1711.00267, 2017.

Person Recognition in Social Media Photos Technical Report
no. arXiv:1710.03224 [cs.CV], 2017.

Advanced Steel Microstructure Classification by Deep Learning Methods Technical Report
arXiv:1706.06480 [cs.CV], 2017.

Visual Decoding of Targets During Visual Search From Human Eye Fixations Technical Report
arXiv:1706.05993 [cs.CV], 2017.

Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images Technical Report
arXiv:1703.10660 [cs.CV], 2017.

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training Technical Report
arXiv:1703.10476 [cs.CV], 2017.

Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective Technical Report
arXiv:1703.09471 [cs.CV], 2017.

Exploiting saliency for object segmentation from image level labels Technical Report
arXiv:1701.08261 [cs.CV], 2017.
2016
Inproceedings

Faceless Person Recognition; Privacy Implications in Social Media Inproceedings
In: European Conference on Computer Vision (ECCV), 2016, (to appear).

VConv-DAE: Deep Volumetric Shape Learning Without Object Labels Inproceedings
In: Geometry Meets Deep Learning Workshop at European Conference on Computer Vision (ECCV-W), 2016.

Towards Segmenting Consumer Stereo Videos: Benchmark, Baselines and Ensembles Inproceedings
In: Asian Conference on Computer Vision (ACCV), 2016, (to appear).

Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task Inproceedings
In: British Machine Vision Conference (BMVC), 2016, (to appear).

Deep Reflectance Maps Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

Multi-Cue Zero-Shot Learning with Strong Supervision Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

I-Pic: A Platform for Privacy-Compliant Image Capture Inproceedings
In: The 14th International Conference on Mobile Systems, Applications, and Services (MobiSys'16), Singapore, 2016.

Learning to Select Long Track Features for Structure-From-Motion & Visual SLAM Inproceedings
In: German Conference on Pattern Recognition (GCPR), 2016.

Contextual Media Retrieval Using Natural Language Queries Inproceedings
In: ACM International Conference on Multimedia Retrieval (ICMR), 2016, (to appear).

Recognition of Ongoing Complex Activities by Sequence Prediction over a Hierarchical Label Space Inproceedings
In: IEEE Winter Conference on Applications of Computer Vision (WACV), 2016.
Miscellaneous

Visual Stability Prediction and Its Application to Manipulation Miscellaneous
NIPS Workshop on Intuitive Physics, 2016.

Long Term Boundary Extrapolation for Deterministic Motion Miscellaneous
NIPS Workshop on Intuitive Physics, 2016.

Faceless Person Recognition; Privacy Implications in Social Media Miscellaneous
4th Workshop on Web-scale Vision and Social Media (VSM), ECCV'16, 2016.

Ask Your Neurons Again: Analysis of Deep Methods with Global Image Representation Miscellaneous
VQA Challenge Workshop at CVPR, 2016.
PhD Theses

Bayesian Non-Parametrics for Multi-Modal Segmentation PhD Thesis
2016.
Technical Reports

Predicting the Category and Attributes of Visual Search Targets Using Deep Gaze Pooling Technical Report
arXiv:1611.10162 [cs.CV], 2016.

Long-Term Image Boundary Extrapolation Technical Report
arXiv:1611.08841 [cs.CV], 2016.

Natural Illumination from Multiple Materials Using Deep Learning Technical Report
arXiv:1611.09325 [cs.CV], 2016.

It's Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation Technical Report
arXiv:1611.08860 [cs.CV], 2016.

Tutorial on Answering Questions about Images with Deep Learning Technical Report
arXiv:1610.01076 [cs.CV], 2016, (Tutorial given at 2nd Summer School on Integrating Vision and Language: Deep Learning).

Visual Stability Prediction and Its Application to Manipulation Technical Report
arXiv:1609.04861 [cs.CV], 2016.

Spatio-Temporal Image Boundary Extrapolation Technical Report
arXiv:1605.07363 [cs.CV], 2016.

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Technical Report
arXiv:1605.02697 [cs.CV], 2016.

VConv-DAE: Deep Volumetric Shape Learning Without Object Labels Technical Report
arXiv:1604.03755 [cs.CV], 2016.

RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling Technical Report
arXiv:1604.02388 [cs.CV], 2016.

To Fall Or Not To Fall: A Visual Approach to Physical Stability Prediction Technical Report
arXiv:1604.00066 [cs.CV], 2016.

DeLight-Net: Decomposing Reflectance Maps into Specular Materials and Natural Illumination Technical Report
arXiv:1602.00328 [cs.CV], 2016.

Multi-Cue Zero-Shot Learning with Strong Supervision Technical Report
arXiv:1603.08754 [cs.CV], 2016.

Novel Views of Objects from a Single Image Technical Report
arXiv:1602.00328 [cs.CV], 2016.

Contextual Media Retrieval Using Natural Language Queries Technical Report
arXiv:1602.04983 [cs.IR], 2016.
2015
Journal Articles

Learning to detect visual grasp affordance Journal Article
In: IEEE Transactions on Automation Science and Engineering (TASE), 2015.
Inproceedings

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2015, (oral).

See the Difference: Direct Pre-Image Reconstruction and Pose Estimation by Differentiating HOG Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2015.

Person Recognition in Personal Photo Collections Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2015.

Teaching Robots the Use of Human Tools from Demonstration with Non-Dexterous End-Effectors Inproceedings
In: IEEE RAS International Conference on Humanoid Robots (HUMANOIDS), 2015, (to appear).

Appearance-based gaze estimation in the wild Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.

Prediction of search targets from fixations in open-world settings Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.

Joint Segmentation and Activity Discovery using Semantic and Temporal Priors Inproceedings
In: IEEE Internation Conference on Pervasive Computing and Communication (PERCOM), 2015.

Hard to Cheat: A Turing Test based on Answering Questions about Images Inproceedings
In: AAAI Workshop Beyond The Turing Test, 2015.
Masters Theses

Contextual Media Retrieval Using Natural Language Queries Masters Thesis
Saarland University, 2015.
Miscellaneous

Bridging the Gap Between Synthetic and Real Data Miscellaneous
Machine Learning with Interdependent and Non-identically Distributed Data (Dagstuhl Seminar 15152), 2015, (to appear).
Technical Reports

Deep Reflectance Maps Technical Report
arXiv:1511.04384 [cs.CV], 2015.

Person Recognition in Personal Photo Collections Technical Report
arXiv:1509.03502 [cs.CV], 2015.

Appearance-based gaze estimation in the wild Technical Report
arXiv:1504.02863, 2015.

Prediction of search targets from fixations in open-world settings Technical Report
arXiv:1502.05137 [cs.CV], 2015.

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Technical Report
arXiv:1505.01121, 2015.

See the Difference: Direct Pre-Image Reconstruction and Pose Estimation by Differentiating HOG Technical Report
arXiv:1505.00663 [cs.CV], 2015.

GazeDPM: Early Integration of Gaze Information in Deformable Part Models Technical Report
arXiv:1505.05753 [cs.CV], 2015.
2014
Inproceedings

A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input Inproceedings
In: Neural Information Processing Systems (NIPS), 2014.

Anytime Recognition of Objects and Scenes Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, (oral).

Image-based Synthesis and Re-Synthesis of Viewpoints Guided by 3D Models Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, (oral).

Towards a Visual Turing Challenge Inproceedings
In: NIPS Workshop on Learning Semantics, 2014.

Object Disambiguation for Augmented Reality Applications Inproceedings
In: British Machine Vision Conference (BMVC), 2014.

Ubic: Bridging the Gap Between Digital Cryptography and the Physical World Inproceedings
In: European Symposium on Research in Computer Security (ESORICS), 2014.

Scene Segmentation in Adverse Vision Conditions Inproceedings
In: Young Researcher Forum at GCPR based on master thesis supervised by Mario Fritz, 2014.

Learning Multi-Scale Representations for Material Classification Inproceedings
In: Young Researcher Forum at GCPR based on master thesis supervised by Mario Fritz, 2014.
Technical Reports

A Pooling Approach to Modelling Spatial Relations for Image Retrieval and Annotation Technical Report
arXiv:1411.5190 [cs.CV], 2014.

Learning Multi-Scale Representations for Material Classification Technical Report
arXiv:1408.2938 [cs.CV], 2014.

Ubic: Bridging the gap between digital cryptography and the physical world Technical Report
arXiv:1403.1343 [cs.CR], 2014.
2013
Incollections

A Category-Level 3D Object Dataset: Putting the Kinect to Work Incollection
In: Fossati, Andrea; Gall, Juergen; Grabner, Helmut; Ren, Xiaofeng; Konolige, Kurt (Ed.): Consumer Depth Cameras for Computer Vision, Springer London, 2013.
Inproceedings

Sequential Bayesian Model Update under Structured Scene Prior for Semantic Road Scenes Labeling Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2013.

Learning Smooth Pooling Regions for Visual Recognition Inproceedings
In: British Machine Vision Conference (BMVC), 2013.

Dynamic Feature Selection for Classification on a Budget Inproceedings
In: ICML Workshop on Prediction with Sequential Models, 2013.

Multi-Class Video Co-Segmentation with a Generative Multi-Video Model Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.
Masters Theses

Scene Segmentation in Adverse Vision Conditions Masters Thesis
Saarland University, 2013.

Multi-Scale Feature Learning for Material Recognition Masters Thesis
Saarland University, 2013.
Technical Reports

Learnable Pooling Regions for Image Classification Technical Report
arXiv:1301.3516 [cs.CV], 2013, (Workshop at International Conference on Learning Representations).
2012
Journal Articles

A Geometric Approach To Robotic Laundry Folding Journal Article
In: International Journal of Robotics Research (IJRR), 2012.
Inproceedings

Kernel Density Topic Models: Visual Topics Without Visual Words Inproceedings
In: NIPS Workshop Modern Non-Parametric Methods in Machine Learning, 2012.

Active Metric Learning for Object Recognition Inproceedings
In: DAGM/OAGM Symposium, 2012.

Semi-Supervised Learning on a Budget: Scaling up to Large Datasets Inproceedings
In: Asian Conference on Computer Vision (ACCV), 2012.

The Pooled NBNN Kernel: Beyond Image-to-Class and Image-to-Image Inproceedings
In: Asian Conference on Computer Vision (ACCV), 2012.

Timely Object Recognition Inproceedings
In: Advances in Neural Information Processing Systems (NIPS), 2012.

Sparselet Models for Efficient Multiclass Object Detection Inproceedings
In: European Conference on Computer Vision (ECCV), 2012.

Recognizing Materials from Virtual Examples Inproceedings
In: European Conference on Computer Vision (ECCV), 2012.

RALF: A Reinforced Active Learning Formulation for Object Class Recognition Inproceedings
In: IEEE Computer Vision and Pattern Recognition (CVPR), 2012.
2011
Inproceedings

Parameterized Shape Models for Clothing Inproceedings
In: IEEE International Conference on Robotics and Automation (ICRA), 2011.

A Probabilistic Model for Recursive Factorized Image Features Inproceedings
In: IEEE Computer Vision and Pattern Recognition (CVPR), 2011.

Pick your Neighborhood -- Improving Labels and Neighborhood Structure for Label Propagation Inproceedings
In: Pattern Recognition, DAGM Symposium, 2011.

Improving the Kinect by Cross-Modal Stereo Inproceedings
In: British Machine Vision Conference (BMVC), 2011.

Practical 3-D Object Detection Using Category and Instance-level Appearance Models Inproceedings
In: IEEE International Conference on Intelligent Robots and Systems (IROS), 2011.

Perception for the Manipulation of Socks Inproceedings
In: IEEE International Conference on Intelligent Robots and Systems (IROS), 2011.

The NBNN kernel Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2011.

I spy with my little eye: Learning Optimal Filters for Cross-Modal Stereo under Projected Patterns Inproceedings
In: IEEE Workshop on Consumer Depth Cameras for Computer Vision, 2011.

Visual Grasp Affordances From Appearance-Based Cues Inproceedings
In: IEEE Workshop on Challenges and Opportunities in Robot Perception, 2011.

A Category-Level 3-D Object Dataset: Putting the Kinect to Work Inproceedings
In: IEEE Workshop on Consumer Depth Cameras for Computer Vision, 2011.
Masters Theses
Optimization Algorithms in the Reconstruction of MR Images: A Comparative Study Masters Thesis
Saarland University, 2011.
2010
Journal Articles

Tutor-based learning of visual categories using different levels of supervision Journal Article
In: Computer Vision and Image Understanding, vol. 114, pp. 564–573, 2010, ISSN: 1077-3142.

Classifying materials in the real world Journal Article
In: Image and Vision Computing, vol. 28, pp. 150–163, 2010, ISSN: 0262-8856.
Incollections

Categorical Perception Incollection
In: Cognitive Systems, Springer, 2010.

Multi-Modal Learning Incollection
In: Cognitive Systems, Springer, 2010.

Size Matters: Metric Visual Search Constraints from Monocular Metadata Incollection
In: Advances in Neural Information Processing Systems (NIPS), 2010.
Inproceedings

Adapting visual category models to new domains Inproceedings
In: European Conference on Computer Vision (ECCV), 2010.
2009
Incollections

Towards Integration of Different Paradigms in Modeling, Representation and Learning of Visual Categories Incollection
In: Object Categorization: Computer and Human Vision Perspectives, Cambridge University Press, 2009.
Inproceedings

Discriminative Structure Learning of Hierarchical Representations for Object Detection Inproceedings
In: IEEE Computer Vision and Pattern Recognition (CVPR), 2009.

An Additive Latent Feature Model for Transparent Object Recognition Inproceedings
In: Advances in Neural Information Processing Systems (NIPS), 2009, ((oral)).
PhD Theses

Modeling, Representing and Learning of Visual Categories PhD Thesis
TU Darmstadt, 2009.
Proceedings

Springer, vol. 5815, 2009, ISBN: 978-3-642-04666-7.
2008
Inproceedings

Discovery of activity patterns using topic models Inproceedings
In: International Conference on Ubiquitous computing (UbiComp), 2008.

Hierarchical Support Vector Random Fields: Joint Training to Combine Local and Global Features Inproceedings
In: European Conference on Computer Vision (ECCV), 2008.

Decomposition, Discovery and Detection of Visual Categories Using Topic Models Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
2007
Inproceedings

Towards Robust Pedestrian Detection in Crowded Image Sequences Inproceedings
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2007.

Cross-Modal Learning of Visual Categories using Different Levels of Supervision Inproceedings
In: International Conference on Vision Systems (ICVS), 2007.
2006
Inproceedings

The 2005 PASCAL Visual Object Classes Challenge Inproceedings
In: Selected Proceedings of the first PASCAL Challenges Workshop, 2006.

Towards Unsupervised Discovery of Visual Categories Inproceedings
In: Pattern Recognition, DAGM Symposium, 2006.
2005
Inproceedings

Integrating Representative and Discriminant Models for Object Category Detection Inproceedings
In: IEEE International Conference on Computer Vision (ICCV), 2005.
2004
Inproceedings

On the Significance of Real-World Conditions for Material Classification Inproceedings
In: European Conference on Computer Vision (ECCV), 2004.
Miscellaneous

Categorization by Local Information Using Support Vector Machines Miscellaneous
Master Thesis, 2004.
2002
Inproceedings

Object Tracking and Pose Estimation Using Light-Field Object Models Inproceedings
In: Vision, Modeling, and Visualization Conference (VMV), 2002.
Miscellaneous

3D Objektverfolgung mit Lichtfeldern (3D object tracking using light-fields) Miscellaneous
2002, (student thesis).