• 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities,
Roman Bachmann*, Oğuzhan Fatih Kar*, David Mizrahi*, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
In NeurIPS, 2024
[Website | Demo | Code]
• How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Morphology,
Andrei Atanov*, Jiawei Fu*, Rishubh Singh*, Isabella Yu, Andrew Spielberg, Amir Zamir,
In ECCV, 2024
[Website]
• ViPer: Visual Personalization of Generative Models via Individual Preference Learning,
Sogand Salehi, Mahdi Shafiei, Roman Bachmann, Teresa Yeo, Amir Zamir,
In ECCV, 2024
[Website | Demo]
• BRAVE: Broadening the visual encoding of vision-language models,
Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari,
In ECCV, 2024
[Website]
• Unraveling the Key Components of OOD Generalization via Diversification,
Harold Benoit*, Liangze Jiang*, Andrei Atanov*, Oğuzhan Fatih Kar, Mattia Rigotti, Amir Zamir
In ICLR, 2024
[Paper]
• 4M: Massively Multimodal Masked Modeling,
David Mizrahi*, Roman Bachmann*, Oguzhan Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir
In NeurIPS, 2023 - [Spotlight]
[Website | Demo | Code]
• Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback,
Teresa Yeo, Oğuzhan Fatih Kar, Zahra Sodagar, Amir Zamir
In ICCV, 2023
[ Paper | Website]
• Modality-invariant Visual Odometry for Embodied Navigation,
Marius Memmel, Roman Bachmann, Amir Zamir
In CVPR, 2023
[ Paper | Code | Website]
• MultiMAE: Multi-modal Multi-task Masked Autoencoders,
Roman Bachmann*, David Mizrahi*, Andrei Atanov, Amir Zamir
In ECCV, 2022
[Interactive Visualizations | Live Demo | Paper | Code | Website]
• Task Discovery: Finding the Tasks that Neural Networks Generalize on,
Andrei Atanov, Andrey Filatov, Teresa Yeo, Ajay Sohmshetty, Amir Zamir
In NeurIPS, 2022
[Interactive Visualizations | Paper | Code | Website]
• PALMER: Perception-Action Loop with Memory Reorganization for Planning,
Onur Beker, Mohammad Mohammadi, Amir Zamir
In NeurIPS, 2022
[Website | Paper]
• CLIPasso: Semantically-Aware Object Sketching,
Yael Vinker, Ehsan Pajouheshgar, Jessica Y. Bo, Roman Bachmann, Amit Haim Bermano, Daniel Cohen-Or, Amir Zamir, Ariel Shamir
In Transactions on Graphics (Proceedings of SIGGRAPH), 2022
[Best Paper Award]
[Website | Collab | Code]
• 3D Common Corruptions and Data Augmentation,
Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir
In CVPR, 2022 - [Oral]
[Website | Live Demo | Paper | Code]
• Robustness via Cross-Domain Ensembles,
Teresa Yeo*, Oğuzhan Fatih Kar*, Alexander Sax, Amir Zamir
In ICCV, 2021 - [Oral]
In Arxiv, 2021
[Website | Paper | Code]
• Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans,
Ainaz Eftekhar*, Alexander Sax*, Roman Bachmann, Jitendra Malik, Amir Zamir
In Arxiv 2021, ICCV 2021
[Live Demo | Dataset | Code | Website | Paper]
• Robust Learning Through Cross-Task Consistency,
Amir Zamir*, Alexander Sax*, Teresa Yeo, Oğuzhan Kar, Nikhil Cheerla, Rohan Suri, Zhangjie Cao, Jitendra Malik, Leonidas Guibas
In CVPR, 2020 - [Best Paper Award Nominee],[Oral]
In Arxiv, 2020
[Live Demo | Visulizations | Website | Paper]
• Which Tasks Should Be Learned Together in Multi-task Learning?,
Trevor Standley, Amir Zamir, Dawn Chen, Leonidas Guibas, Jitendra Malik, Silvio Savarese
In ICML, 2020
[Website | Paper]
• Side-Tuning: Network Adaptation via Additive Side Networks,
Jeffrey Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik
In ECCV, 2020 - [Spotlight]
[Website | Paper]
• Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation,
Bryan Chen, Sasha Sax, Lerrel Pinto, Francis Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik
In Conference on Robot Learning (CoRL), 2020
[Website | Paper]
• Learning to Navigate Using Mid-level Visual Priors,
Alexander Sax, Jeffery Zhang, Bradley Emi, Amir Zamir, Leonidas Guibas, Silvio Savarese, Jitendra Malik
In Conference on Robot Learning (CoRL), 2019
[Policy Visulizations | Website | Paper]
• 3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera,
Iro Armeni, Zhiyang He, JunYoung Gwak, Amir Zamir, Martin Fischer, Jitendra Malik, Silvio Savarese,
In ICCV, 2019
[Interactive Database Visualization | Website | Paper]
• Taskonomy: Disentangling Task Transfer Learning,
Amir Zamir, Alexander Sax*, William Shen*, Leonidas Guibas, Jitendra Malik, Silvio Savarese,
In CVPR, 2018 [Best Paper Award]
In IJCAI, 2019 [Invited Paper, Sister Conference Best Papers Track]
[Transfer Learning API | Live Demo | Website | Paper]
• Gibson Env: Real-World Perception for Embodied Agents,
Amir Zamir*, Fei Xia*, Jerry He*, Alexander Sax, Jitendra Malik, Silvio Savarese,
In CVPR, 2018 - [Spotlight Oral],[NVIDIA Pioneering Research Award]
[Gibson Environments | Github | Website | Paper]
• Patent: Systems and Methods for Performing Three-Dimensional Semantic Parsing of Indoor Spaces,
Iro Armeni, Ozan Sener, Amir Zamir, Martin Fischer, Silvio Savarese,
US Patent App. 5/619,422, 2017.
[Link]
• Feedback Networks,
Amir Zamir*, Te-Lin Wu*, Lin Sun, William B. Shen, Bertram Shi, Jitendra Malik, Silvio Savarese,
In CVPR, 2017.
[PDF | Project Page]
• Generic 3D Representation via Pose Estimation and Matching,
Amir Zamir, Pulkit Agrawal, Tilman Wekel, Jitendra Malik, Silvio Savarese,
In ECCV, 2016.
[PDF | 3DRepresentation website | Dataset]
• Structural-RNN: Deep Leaning on Spatio-Temporal Graphs,Ashesh Jain, Amir Zamir, Silvio Savarese, Ashutosh Saxena,
In CVPR, 2016 [Best Student Paper Award]
[PDF | Project Page ]
• 3D Semantic Parsing of Large-Scale Indoor Spaces ,Iro Armeni, Ozan Sener, Amir Zamir, Martin Fischer, Silvio Savarese,
In CVPR, 2016 - [Oral] (acceptance rate ~3%)
[PDF | 3D PC Parser website (Demo, Code, Data)]
• Book: Large-Scale Visual Geo-Localization,
Amir Zamir, Asaad Hakeem, Luc Van Gool, Mubarak Shah, Richard Szeliski,
Springer, 2016[Front Matter | Cover | Springer Page]
• The THUMOS Challenge on Action Recognition for Videos "in the Wild",Haroon Idrees, Amir Zamir, Yu-Gang Jiang, Alex Gorban, Ivan Laptev, Rahul Sukthankar, Mubarak Shah,
In Computer Vision and Image Understanding (CVIU), 2016[PDF | Project Page ]
• Unsupervised Semantic Parsing of Video Collections,Ozan Sener, Amir Zamir, Silvio Savarese, Ashutosh Saxena,
In Proceedings of International Conference on Computer Vision (ICCV), 2015[PDF | Project Page ]
• Action Recognition by Hierarchical Mid-level Action Elements,Tian Lan, Yuke Zhu, Amir Zamir, Silvio Savarese,
In Proceedings of International Conference on Computer Vision (ICCV), 2015[PDF | Project Page | 1 min Summary]
• DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition,Rui Hou, Amir Zamir, Rahul Sukthankar, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2014 [PDF | BibTeX | Project Page | 1 min Summary]
• GIS-Assisted Object Detection and Geospatial Localization,Shervin Ardeshir, Amir Zamir, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2014 [PDF | BibTeX | Project Page | 1 min Summary]
• GPS-Tag Refinement using Random Walks with an Adaptive Damping Factor,Amir Zamir, Shervin Ardeshir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2014.[PDF | 1 min Summary | 20 min Presentation | BibTeX | Project Page]
• Video Classification using Semantic Concept Co-occurrences,Shayan Modiri, Amir Zamir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2014. [PDF | 1 min Summary | BibTeX | Project Page]
• Invited Book Chapter: "Action Recognition in Realistic Sports Videos",Khurram Soomro, Amir Zamir,
in Computer Vision in Sports, Springer, 2014. [PDF | BibTeX ]
• Image Geo-localization Based on Multiple Nearest Neighbor Feature Matching using Generalized Graphs,Amir Zamir, Mubarak Shah,
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2014 [Preprint PDF | BibTeX | Web Page]
• Visual Business Recognition - A Multimodal Approach,Amir Zamir, Afshin Dehghan, Mubarak Shah,
In Proceeding of ACM International Conference on Multimedia (ACM MM), 2013 [PDF | Video | BibTeX | Project Page]
• GMCP-Tracker: Global Multi-object Tracking using Generalized Minimum Clique Graphs,Amir Zamir, Afshin Dehghan, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2012 [PDF | Project Page | 20 min Presentation | BibTeX ]
• City Scale Geo-spatial Trajectory Estimation of a Moving Camera,Gonzalo Vaca, Amir Zamir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2012 [PDF | BibTeX | Project Page]
• Accurate Image Localization Based on Google Maps Street View,Amir Zamir, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2010 [PDF | BibTeX | Project Page]
• Recognition of 101 human actions from videos in the wild,Khurram Soomro, Amir Zamir, Mubarak Shah,
In arXiv preprint arXiv:1212.0402, November, 2012. [PDF | BibTeX | Project Page | PDF2]
• Automatic Detection and Tracking of Pedestrians in Videos with Various Crowd Densities,Afshin Dehghan, Haroon Idrees, Amir Zamir, Mubarak Shah,
In Proceedings of PED, June 2012 [PDF | BibTeX | Project Page]
• Street View Challenge: Identification of Commercial Entities in Street View Imagery,Amir Zamir, Alexander Darino, Ryan Patrick, Mubarak Shah,
In Proceedings of ICMLA, 2011