Munawar Hayat

Principal AI Researcher, Qualcomm. Prev ARC DECRA Fellow

Monash University

Biography

I lead the Multimodal AI team at Qualcomm AI Research. Previously, I was a Professor and ARC DECRA Fellow at Monash University. My research spans computer vision, generative AI, and multimodal learning, bridging fundamental research and real-world applications.

Interests

Computer Vision
Generative AI
Machine Learning

Education

PhD in Computer Science, 2015
The University of Western Austraia
Masters in Space Science, 2011
Luleå Tekniska Universitet
BSc in Engineering, 2009
National University of Sciences & Technology

News

I received Dean’s Award for Excellence in Research by an Early Career Academic at FIT, Monash University.
Checkout Restormer: Efficient Transformer for High-Resolution Image Restoration in CVPR 2022.
See our paper Semantic-Aware Domain Generalized Segmentation in CVPR 2022.
Checkout Learning Enriched Features for Fast Image Restoration and Enhancement in IEEE TPAMI.
Checkout Towards Robust and Reproducible Active Learning Using Neural Networks in CVPR 2022.
Awarded ARC DECRA Fellowship 2021-2023 $425,613.
Received funding from Australian Reserch Council on a Discovery Project 2019-2021 $380,000.
See our recent work Deeply Supervised Discriminative Learning for Adversarial Defense in IEEE TPAMI.
See our paper Learning Enriched Features for Real Image Restoration and Enhancement in ECCV 2020.
See our paper titled A Self-supervised Approach for Adversarial Robustness in CVPR 2020.
See our paper titled CycleISP: Real Image Restoration via Improved Data Synthesis in CVPR 2020.
See our paper titled iTAML: An Incremental Task-Agnostic Meta-learning Approach in CVPR 2020.
Checkout Random path selection for continual learning in NeurIPS 2019.
2 Papers accepted in ICCV 2019.
2 Papers accepted in CVPR 2019.

Featured Publications

S. Borse, S. Choi, S. Park, J. Kim, S. Kadambi, R. Garrepalli, S. Yun, M. Hayat, F. Porikli

December 2025 Advances in Neural Information Processing Systems (NeurIPS 2025)

MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans

Generation of images containing multiple humans, performing complex actions, while preserving their facial identities, is a significant challenge. MultiHuman-Testbench is a novel benchmark comprising 1,800 samples with carefully curated text prompts matched with 5,550 unique human face images, sampled uniformly to ensure diversity across age, ethnic background, and gender. This benchmark enables comprehensive evaluation of multi-human image generation models.

PDF Code

J. Lee, D. Das, M. Hayat, S. Choi, K. Hwang, F. Porikli

June 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)

CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation

CustomKD proposes a novel knowledge distillation approach that effectively leverages large vision foundation models (LVFMs) to enhance the performance of edge models (e.g., MobileNetV3). CustomKD customizes the well-generalized features inherent in LVFMs to a given student model to reduce model discrepancies, achieving state-of-the-art performances in unsupervised domain adaptation and semi-supervised learning scenarios.