Recent News
[02/2023]: 1 paper at CVPR 2023
[07/2022]: 1 paper at ECCV 2022
[07/2021]: 1 paper at ICCV 2021
[10/2019]: 1 paper at Information Fusion
[07/2019]: 1 paper at Pattern Recognition
[03/2019]: 1 paper at CVPR 2019
[05/2019]: 1 paper at ICIP 2019
[05/2019]: 1 paper at Neural Computing and Application
[08/2018]: 1 paper at Pattern Recognition
[04/2018]: 3 papers at ICPR 2018
Education
| | | | | | University of Edinburgh, UK
PhD at School of Informatics
May 2023 - Present
|
| | | | | | Jadavpur University, India
Bachelor's in Electrical Engineering
July 2016 - May 2020
|
Research Experiences
| | | | | | MBZUAI, Abu Dhabi
Research Assistant.
Nov 2020 - Apr 2023
|
| | | | | | University of Manitoba, Winnipeg, Canada
MITACS Globallink Internship, 2019.
May 2019 - August 2019
|
| | | | | | Robert Bosch, Bangalore, India
Computer Vision Lab, RTC Department
June 2018 - July 2018 |
| | | | | | Indian Institute of Technology Roorkee, India
Under Prof. Partha Pratim Roy
June 2017 - May 2020
|
I'm a PhD student in the School of Informatics at the University of Edinburgh. I work with Dr. Hakan Bilen in the Visual Computing Group.
Previously, I was a full-time research assistant at MBZUAI, Abu Dhabi, where I worked on computer vision and machine learning under Dr Fahad Shahbaz Khan and Dr Salman Khan. I mostly work on generative models and visual perception models from a small amount of data. Previously, I have worked with Dr Yang Wang at the University of Manitoba, Canada and Dr Partha Roy at IIT Roorkee, India. I have done my bachalors from Jadavpur University, Kolkata in Electrical Engineering.
- Publications
- Research
- Contact
-
|
Person Image Synthesis via Denoising Diffusion Model
Ankan Kumar Bhunia,
Salman Khan,
Hisham Cholakkal,
Rao Anwer,
Jorma Laaksonen,
Mubarak Shah,
Fahad Khan,
CVPR, 2023
|
|
DoodleFormer: Creative Sketch Drawing with Transformers
Ankan Kumar Bhunia,
Salman Khan,
Hisham Cholakkal,
Rao Anwer,
Fahad Khan,
Jorma Laaksonen,
Michael Felsberg,
ECCV, 2022
|
|
Handwriting Transformers
Ankan Kumar Bhunia,
Salman Khan,
Hisham Cholakkal,
Rao Anwer,
Fahad Khan,
Mubarak Shah,
ICCV, 2021
|
|
Handwriting Recognition in Low-resource Scripts using Adversarial Learning
Ayan Kumar Bhunia,
Abhirup Das,
Ankan Kumar Bhunia,
Sairaj Kishore,
Partha Pratim Roy,
CVPR, 2019
|
|
Improving Document Binarization via Adversarial Noise-Texture Augmentation
Ankan Kumar Bhunia,
Ayan Kumar Bhunia,
Aneeshan Sain,
Partha Pratim Roy,
ICIP, 2019
|
|
A Deep One-Shot Network for Query-based Logo Retrieval
Ayan Kumar Bhunia,
Ankan Kumar Bhunia,
Shuvozit Ghose,
Partha Pratim Roy,
Umapada Pal,
Pattern Recognition, 2019
|
|
Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network
Ankan Kumar Bhunia,
Aishik Konwer,
Abir Bhowmik,
Ayan Kumar Bhunia,
Partha Pratim Roy,
Pattern Recognition, 2019
|
|
Signature Verification Approach using Fusion of Hybrid Texture Features
Ankan Kumar Bhunia,
Alireza Alaei,
Partha Pratim Roy,
Neural computing and Applications, 2019
|
|
Word Level Font-to-Font Image Translation using Convolutional Recurrent Generative Adversarial Networks
Ankan Kumar Bhunia,
Ayan Kumar Bhunia,
Prithaj Banerjee,
Aishik Konwer,
Abir Bhowmik,
Partha Pratim Roy,
Umapada Pal,
ICPR, 2018
|
|
Staff line Removal using Generative Adversarial Networks
Aishik Konwer,
Ayan Kumar Bhunia,
Abir Bhowmik,
Ankan Kumar Bhunia,
Prithaj Banerjee,
Partha Pratim Roy,
Umapada Pal,
ICPR, 2018
|
|
Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network
Ayan Kumar Bhunia,
Abir Bhowmik,
Ankan Kumar Bhunia,
Aishik Konwer,
Prithaj Banerjee,
Partha Pratim Roy,
Umapada Pal,
ICPR, 2018
|
Title: Person Image Synthesis via Denoising Diffusion Model (CVPR'23) [ Paper / GitHub / Project / Demo]
The Pose-guided person image synthesis task aims to render a person’s image with a desired pose and appearance. Specifically, the appearance is defined by a given source image and the pose by a set of keypoints. Having control over the synthesized person images in terms of pose and style is an important requisite for applications such as e-commerce, virtual reality, metaverse and content generation for the entertainment industry. Read our paper here.
Title: Generative Multiplane Neural Radiance for 3D-Aware Image Generation [ Paper / GitHub]
We introduced an approach, named GMNR, that focuses at efficiently generating 3D-aware high-resolution images that are view-consistent across multiple camera pose. Qualitative and quantitative experiments on three datasets
demonstrate the merits of our contributions, leading to favorable performance in terms of image generation quality
and computational efficiency, compared to existing works.
Title: Handwriting Transformers (ICCV'21) [ Paper / GitHub / Demo]
Automatic handwritten text generation can be beneficial for people having disabilities or injuries that prevent them from writing, adapting an author's writing style or gathering additional data for training deep learning-based handwritten text recognition models. Here, we investigate the problem of realistic handwritten text generation of text sequences with arbitrary length and diverse calligraphic attributes representing writing styles of a writer. For more visit our project page here.
Title: Doodleformer: Creative sketch drawing with transformers (ECCV'22) [ Paper / GitHub]
Humans have an outstanding ability to easily communicate and express abstract ideas
and emotions through sketch drawings. Creative sketching or doodling is an expressive activity, where imaginative and previously unseen depictions of everyday visual objects are drawn. Creative sketch image generation is a challenging vision problem, where the task is to generate diverse, yet realistic creative sketches possessing the unseen composition of the visual-world objects. Please visit our project page here.
❮
❯
-
The Bayes Centre, Edinburgh EH8 9BT
Email: ankankumarbhunia@gmail.com
Phone: +44-7770723790
Whatsapp: +91-9038858890
|
|