Amir Zamir @ Swiss Federal Institute of Technology EPFL (2025)

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities,
Roman Bachmann*, Oğuzhan Fatih Kar*, David Mizrahi*, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
In NeurIPS, 2024
[Website | Demo | Code]

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Morphology,
Andrei Atanov*, Jiawei Fu*, Rishubh Singh*, Isabella Yu, Andrew Spielberg, Amir Zamir,
In ECCV, 2024
[Website]

ViPer: Visual Personalization of Generative Models via Individual Preference Learning,
Sogand Salehi, Mahdi Shafiei, Roman Bachmann, Teresa Yeo, Amir Zamir,
In ECCV, 2024
[Website | Demo]

BRAVE: Broadening the visual encoding of vision-language models,
Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari,
In ECCV, 2024
[Website]

Unraveling the Key Components of OOD Generalization via Diversification,
Harold Benoit*, Liangze Jiang*, Andrei Atanov*, Oğuzhan Fatih Kar, Mattia Rigotti, Amir Zamir
In ICLR, 2024
[Paper]

4M: Massively Multimodal Masked Modeling,
David Mizrahi*, Roman Bachmann*, Oguzhan Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir
In NeurIPS, 2023 - [Spotlight]
[Website | Demo | Code]

Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback,
Teresa Yeo, Oğuzhan Fatih Kar, Zahra Sodagar, Amir Zamir
In ICCV, 2023
[ Paper | Website]

Modality-invariant Visual Odometry for Embodied Navigation,
Marius Memmel, Roman Bachmann, Amir Zamir
In CVPR, 2023
[ Paper | Code | Website]

MultiMAE: Multi-modal Multi-task Masked Autoencoders,
Roman Bachmann*, David Mizrahi*, Andrei Atanov, Amir Zamir
In ECCV, 2022
[Interactive Visualizations | Live Demo | Paper | Code | Website]

Task Discovery: Finding the Tasks that Neural Networks Generalize on,
Andrei Atanov, Andrey Filatov, Teresa Yeo, Ajay Sohmshetty, Amir Zamir
In NeurIPS, 2022
[Interactive Visualizations | Paper | Code | Website]

PALMER: Perception-Action Loop with Memory Reorganization for Planning,
Onur Beker, Mohammad Mohammadi, Amir Zamir
In NeurIPS, 2022
[Website | Paper]

CLIPasso: Semantically-Aware Object Sketching,
Yael Vinker, Ehsan Pajouheshgar, Jessica Y. Bo, Roman Bachmann, Amit Haim Bermano, Daniel Cohen-Or, Amir Zamir, Ariel Shamir
In Transactions on Graphics (Proceedings of SIGGRAPH), 2022
[Best Paper Award]
[Website | Collab | Code]

3D Common Corruptions and Data Augmentation,
Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir
In CVPR, 2022 - [Oral]
[Website | Live Demo | Paper | Code]

Robustness via Cross-Domain Ensembles,
Teresa Yeo*, Oğuzhan Fatih Kar*, Alexander Sax, Amir Zamir
In ICCV, 2021 - [Oral]
In Arxiv, 2021
[Website | Paper | Code]

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans,
Ainaz Eftekhar*, Alexander Sax*, Roman Bachmann, Jitendra Malik, Amir Zamir
In Arxiv 2021, ICCV 2021
[Live Demo | Dataset | Code | Website | Paper]

Robust Learning Through Cross-Task Consistency,
Amir Zamir*, Alexander Sax*, Teresa Yeo, Oğuzhan Kar, Nikhil Cheerla, Rohan Suri, Zhangjie Cao, Jitendra Malik, Leonidas Guibas
In CVPR, 2020 - [Best Paper Award Nominee],[Oral]
In Arxiv, 2020
[Live Demo | Visulizations | Website | Paper]

Which Tasks Should Be Learned Together in Multi-task Learning?,
Trevor Standley, Amir Zamir, Dawn Chen, Leonidas Guibas, Jitendra Malik, Silvio Savarese
In ICML, 2020
[Website | Paper]

Side-Tuning: Network Adaptation via Additive Side Networks,
Jeffrey Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik
In ECCV, 2020 - [Spotlight]
[Website | Paper]

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation,
Bryan Chen, Sasha Sax, Lerrel Pinto, Francis Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik
In Conference on Robot Learning (CoRL), 2020
[Website | Paper]

Learning to Navigate Using Mid-level Visual Priors,
Alexander Sax, Jeffery Zhang, Bradley Emi, Amir Zamir, Leonidas Guibas, Silvio Savarese, Jitendra Malik
In Conference on Robot Learning (CoRL), 2019
[Policy Visulizations | Website | Paper]

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera,
Iro Armeni, Zhiyang He, JunYoung Gwak, Amir Zamir, Martin Fischer, Jitendra Malik, Silvio Savarese,
In ICCV, 2019
[Interactive Database Visualization | Website | Paper]

Taskonomy: Disentangling Task Transfer Learning,
Amir Zamir, Alexander Sax*, William Shen*, Leonidas Guibas, Jitendra Malik, Silvio Savarese,
In CVPR, 2018 [Best Paper Award]
In IJCAI, 2019 [Invited Paper, Sister Conference Best Papers Track]
[Transfer Learning API | Live Demo | Website | Paper]

Gibson Env: Real-World Perception for Embodied Agents,
Amir Zamir*, Fei Xia*, Jerry He*, Alexander Sax, Jitendra Malik, Silvio Savarese,
In CVPR, 2018 - [Spotlight Oral],[NVIDIA Pioneering Research Award]
[Gibson Environments | Github | Website | Paper]

Patent: Systems and Methods for Performing Three-Dimensional Semantic Parsing of Indoor Spaces,
Iro Armeni, Ozan Sener, Amir Zamir, Martin Fischer, Silvio Savarese,
US Patent App. 5/619,422, 2017.
[Link]

Feedback Networks,
Amir Zamir*, Te-Lin Wu*, Lin Sun, William B. Shen, Bertram Shi, Jitendra Malik, Silvio Savarese,
In CVPR, 2017.
[PDF | Project Page]

Generic 3D Representation via Pose Estimation and Matching,
Amir Zamir, Pulkit Agrawal, Tilman Wekel, Jitendra Malik, Silvio Savarese,
In ECCV, 2016.
[PDF | 3DRepresentation website | Dataset]

Structural-RNN: Deep Leaning on Spatio-Temporal Graphs,Ashesh Jain, Amir Zamir, Silvio Savarese, Ashutosh Saxena,
In CVPR, 2016 [Best Student Paper Award]
[PDF | Project Page ]

3D Semantic Parsing of Large-Scale Indoor Spaces ,Iro Armeni, Ozan Sener, Amir Zamir, Martin Fischer, Silvio Savarese,
In CVPR, 2016 - [Oral] (acceptance rate ~3%)
[PDF | 3D PC Parser website (Demo, Code, Data)]

• Book: Large-Scale Visual Geo-Localization,
Amir Zamir, Asaad Hakeem, Luc Van Gool, Mubarak Shah, Richard Szeliski,
Springer, 2016[Front Matter | Cover | Springer Page]

The THUMOS Challenge on Action Recognition for Videos "in the Wild",Haroon Idrees, Amir Zamir, Yu-Gang Jiang, Alex Gorban, Ivan Laptev, Rahul Sukthankar, Mubarak Shah,
In Computer Vision and Image Understanding (CVIU), 2016[PDF | Project Page ]

Unsupervised Semantic Parsing of Video Collections,Ozan Sener, Amir Zamir, Silvio Savarese, Ashutosh Saxena,
In Proceedings of International Conference on Computer Vision (ICCV), 2015[PDF | Project Page ]

Action Recognition by Hierarchical Mid-level Action Elements,Tian Lan, Yuke Zhu, Amir Zamir, Silvio Savarese,
In Proceedings of International Conference on Computer Vision (ICCV), 2015[PDF | Project Page | 1 min Summary]

DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition,Rui Hou, Amir Zamir, Rahul Sukthankar, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2014 [PDF | BibTeX | Project Page | 1 min Summary]

GIS-Assisted Object Detection and Geospatial Localization,Shervin Ardeshir, Amir Zamir, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2014 [PDF | BibTeX | Project Page | 1 min Summary]

GPS-Tag Refinement using Random Walks with an Adaptive Damping Factor,Amir Zamir, Shervin Ardeshir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2014.[PDF | 1 min Summary | 20 min Presentation | BibTeX | Project Page]

Video Classification using Semantic Concept Co-occurrences,Shayan Modiri, Amir Zamir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2014. [PDF | 1 min Summary | BibTeX | Project Page]

Invited Book Chapter: "Action Recognition in Realistic Sports Videos",Khurram Soomro, Amir Zamir,
in Computer Vision in Sports, Springer, 2014. [PDF | BibTeX ]

Image Geo-localization Based on Multiple Nearest Neighbor Feature Matching using Generalized Graphs,Amir Zamir, Mubarak Shah,
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2014 [Preprint PDF | BibTeX | Web Page]

Visual Business Recognition - A Multimodal Approach,Amir Zamir, Afshin Dehghan, Mubarak Shah,
In Proceeding of ACM International Conference on Multimedia (ACM MM), 2013 [PDF | Video | BibTeX | Project Page]

GMCP-Tracker: Global Multi-object Tracking using Generalized Minimum Clique Graphs,Amir Zamir, Afshin Dehghan, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2012 [PDF | Project Page | 20 min Presentation | BibTeX ]

City Scale Geo-spatial Trajectory Estimation of a Moving Camera,Gonzalo Vaca, Amir Zamir, Mubarak Shah,
in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2012 [PDF | BibTeX | Project Page]

Accurate Image Localization Based on Google Maps Street View,Amir Zamir, Mubarak Shah,
In Proceedings of European Conference on Computer Vision (ECCV), 2010 [PDF | BibTeX | Project Page]

Recognition of 101 human actions from videos in the wild,Khurram Soomro, Amir Zamir, Mubarak Shah,
In arXiv preprint arXiv:1212.0402, November, 2012. [PDF | BibTeX | Project Page | PDF2]

Automatic Detection and Tracking of Pedestrians in Videos with Various Crowd Densities,Afshin Dehghan, Haroon Idrees, Amir Zamir, Mubarak Shah,
In Proceedings of PED, June 2012 [PDF | BibTeX | Project Page]

Street View Challenge: Identification of Commercial Entities in Street View Imagery,Amir Zamir, Alexander Darino, Ryan Patrick, Mubarak Shah,
In Proceedings of ICMLA, 2011

Amir Zamir @ Swiss Federal Institute of Technology EPFL (2025)
Top Articles
Latest Posts
Recommended Articles
Article information

Author: Van Hayes

Last Updated:

Views: 6777

Rating: 4.6 / 5 (46 voted)

Reviews: 85% of readers found this page helpful

Author information

Name: Van Hayes

Birthday: 1994-06-07

Address: 2004 Kling Rapid, New Destiny, MT 64658-2367

Phone: +512425013758

Job: National Farming Director

Hobby: Reading, Polo, Genealogy, amateur radio, Scouting, Stand-up comedy, Cryptography

Introduction: My name is Van Hayes, I am a thankful, friendly, smiling, calm, powerful, fine, enthusiastic person who loves writing and wants to share my knowledge and understanding with you.