Publications

ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers

Mohsen Ghafoorian, Amir Habibian

arXiv preprint, 2026

Multi-Scale Speculative Decoding for Image Generation

Elia Peruzzo, Guillaume Sautiere, Amir Habibian

arXiv preprint, 2026

PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference

Denis Korzhenkov, Adil Karjauv, Animesh Karnewar, Mohsen Ghafoorian, Amir Habibian

arXiv preprint, 2026

ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving

Farhad G. Zanjani, Hong Cai, Amir Habibian

arXiv preprint, 2026

HLA: Hadamard Linear Attention

Hanno Ackermann, Hong Cai, Mohsen Ghafoorian, Amir Habibian

arXiv preprint, 2026

Neodragon: Mobile Video Generation using Diffusion Transformer

Animesh Karnewar, Denis Korzhenkov, Ioannis Lelekas, Noor Fathima, Adil Karjauv, Mohsen Ghafoorian, Amir Habibian

arXiv preprint, 2025

Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer

Mohsen Ghafoorian, Denis Korzhenkov, Amir Habibian

arXiv preprint, 2025

MoAlign: Motion-Centric Representation Alignment for Video Diffusion Models

Aritra Bhowmik, Denis Korzhenkov, Cees G. M. Snoek, Amir Habibian, Mohsen Ghafoorian

arXiv preprint, 2025

Hybrid Gaussian Splatting for Novel Urban View Synthesis

Mohamed Omran, Farhad Zanjani, Davide Abati, Jens Petersen, Amir Habibian

ICCV Workshop, 2025 (Challenge Winner)

Mobile Video Diffusion

Haitam Ben Yahia, Denis Korzhenkov, Ioannis Lelekas, Amir Ghodrati, Amir Habibian

ICCV, 2025

ADAPTOR: Adaptive Token Reduction for Video Diffusion Transformers

Elia Peruzzo, Adil Karjauv, Nicu Sebe, Amir Ghodrati, Amir Habibian

CVPR Workshop, 2025
PDF

Generative Location Modeling for Spatially Aware Object Insertion

Jooyeol Yun, Davide Abati, Mohamed Omran, Jaegul Choo, Amir Habibian, Auke Wiggers

CVPR Workshop, 2025

Gaussian Splatting is an Effective Data Generator for 3D Object Detection

Farhad G. Zanjani, Davide Abati, Auke Wiggers, Dimitris Kalatzis, Jens Petersen, Hong Cai, Amir Habibian

arXiv preprint, 2025
PDF

Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection

Jens Petersen, Davide Abati, Amir Habibian, Auke Wiggers

arXiv preprint, 2025
PDF

Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Mohamed Omran, Dimitris Kalatzis, Jens Petersen, Amir Habibian, Auke Wiggers

arXiv preprint, 2025
PDF

MoViE: Mobile Diffusion for Video Editing

Adil Karjauv, Noor Fathima, Ioannis Lelekas, Fatih Porikli, Amir Ghodrati, Amir Habibian

arXiv preprint, 2024

Object-Centric Diffusion for Efficient Video Editing

Kumara Kahatapitiya, Adil Karjauv, Davide Abati, Fatih Porikli, Yuki M. Asano, Amir Habibian

ECCV, 2024

Clockwork Diffusion: Efficient Generation With Model-Step Distillation

Amir Habibian, Amir Ghodrati, Noor Fathima, Guillaume Sautiere, Risheek Garrepalli, Fatih Porikli, Jens Petersen

CVPR, 2024 (Highlight)

VaLID: Variable-Length Input Diffusion for Novel View Synthesis

Shijie Li, Farhad G. Zanjani, Haitam Ben Yahia, Yuki M. Asano, Juergen Gall, Amir Habibian

WACV, 2023
PDF

ResQ: Residual Quantization for Video Perception

Davide Abati, Haitam Ben Yahia, Markus Nagel, Amir Habibian

ICCV, 2023
PDF

Skip-Attention: Improving Vision Transformers by Paying Less Attention

Shashanka Venkataramanan, Amir Ghodrati, Yuki M. Asano, Fatih Porikli, Amir Habibian

ICLR, 2023
PDF

SALISA: Saliency-Based Input Sampling for Efficient Video Object Detection

Babak Ehteshami Bejnordi, Amir Habibian, Fatih Porikli, Amir Ghodrati

ECCV, 2022
PDF

Delta Distillation for Efficient Video Processing

Amir Habibian, Haitam Ben Yahia, Davide Abati, Efstratios Gavves, Fatih Porikli

ECCV, 2022

Simple and Efficient Architectures for Semantic Segmentation

Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amir Habibian, Tijmen Blankevoort

CVPR workshop, 2022

Region-of-Interest based Neural Video Compression

Yura Perugachi-Diaz, Guillaume Sautiere, Davide Abati, Yang Yang, Amir Habibian, Taco S. Cohen

BMVC, 2022
PDF

Frame-Exit: Conditional Early Exiting for Efficient Video Recognition

Amir Ghodrati, Babak Ehteshami Bejnordi, Amir Habibian

CVPR, 2021 (Oral)

Skip-Convolutions for Efficient Video Processing

Amir Habibian, Davide Abati, Taco S. Cohen, Babak Ehteshami Bejnordi

CVPR, 2021

Conditional Model Selection for Efficient Video Understanding

Mihir Jain, Haitam Ben Yahia, Amir Ghodrati, Fatih Porikli, Amir Habibian

BMVC, 2021
PDF

Efficient Video Super Resolution by Gated Local Self Attention

Davide Abati, Amir Ghodrati, Amir Habibian

BMVC, 2021
PDF

Spatio-Temporal Gated Transformers for Efficient Video Processing

Yawei Li, Babak Ehteshami Bejnordi, Bert Moons, Tijmen Blankevoort, Amir Habibian, Radu Timofte, Luc V Gool

NeurIPS workshop, 2021

Learning Variations in Human Motion via Mix-and-Match Perturbation

Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Petersson, Stephen Gould, Amir Habibian

CVPR, 2020
PDF

Adversarial Distortion for Learned Video Compression

Vijay Veerabadran, Reza Pourreza, Amir Habibian, Taco S. Cohen

CVPR Workshop, 2020
PDF

Video Compression with Rate-Distortion Autoencoders

Amir Habibian, Ties van Rozendaal, Jakub M. Tomczak, Taco S. Cohen

ICCV, 2019
PDF

Recognizing Compressed Videos: Challenges and Promises

Reza Pourreza, Amir Ghodrati, Amir Habibian

ICCV Workshop, 2019
PDF

Video2vec Embeddings Recognize Events when Examples are Scarce

Amir Habibian, Thomas Mensink, Cees G. M. Snoek

T-PAMI, 2016
PDF

Discovering Semantic Vocabularies for Cross-Media Retrieval

Amir Habibian, Thomas Mensink, Cees G. M. Snoek

ICMR, 2015
PDF

Videostory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events

Amir Habibian, Thomas Mensink, Cees G. M. Snoek

ACM MM, 2014 (Best Paper Award)
PDF

Recommendations for recognizing video events by concept vocabularies

Amir Habibian, Cees G. M. Snoek

CVIU, 2014
PDF

Composite Concept Discovery for Zero-shot Video Event Detection

Amir Habibian, Thomas Mensink, Cees G. M. Snoek

ICMR, 2014
PDF

Recommendations for Video Event Recognition using Concept Vocabularies

Amir Habibian, Koen E. A. van de Sande, Cees G. M. Snoek

ICMR, 2013
PDF