Computer Vision Group

Objective

The Computer Vision Group is focused on the research and application of computer vision and image/video processing. It aims to deploy cutting-edge methodologies in various areas of computer vision by conducting research in both the theoretical and application aspects of computer vision. The research focuses on following areas:

  • Computer Vision and Deep Learning
  • Biometric Recognition and Medical Image Processing
  • Content based Image Retrieval
  • Image to Image Transformation
  • Image/Video/View Inpainting
  • Depth Estimation from Multiple Views
  • Virtual/Augmented Reality

The computer vision group at IIIT Sri City actively looks for the sponsored projects funded by the Govt. Bodies and Industries. Some typical problems being solved are content based image retrieval, image-to-image transformation including night to day conversion, biometric recognition, exploring the better CNN architectures for different problems, image inpainting, and deep learning applications in autonomous navigation, biomedical image analysis, etc. The group is also dedicated towards producing the highly trained students in the broad areas of computer vision. The research produced by the Computer Vision Group will have a significant impact on companies working on Computer Vision, Image/Video Processing, Artificial Intelligence, Robotics and Machine Learning..

Faculties
No. Name of Faculty Area of Specialization
1. Dr. Himangshu Sarma
himangshu.sarma@iiits.in
Virtual reality, Augmented reality, Human Computer Interaction, Natural Language Processing
2. Dr. Mrinmoy Ghorai
mrinmoy.ghorai@iiits.in
Machine Learning, Image/Video/View Inpainting, Image Classification
3. Dr. Rakesh Kumar Sanodiya
rakesh.s@iiits.in
Computer Vision, Pattern Recognition, Machine Learning, Internet of Things

 

Sponsored Research Projects
Title PI and Co-PIs Duration Amount Funding Agency
Computer Vision Algorithm for Transformation of Night Time Images to Corresponding Day Time Images PI: Dr. Shiv Ram Dubey
Co-PI: Dr. Himangshu Sarma
2020 - 2021 Rs. 9,94,943/- DRDO Young Scientist Laboratory – Artificial Intelligence (DYSL-AI), DRDO
Development of Deep Learning based Hashing Techniques for Image Retrieval PI: Dr. Shiv Ram Dubey 2020 - 2023 Rs. 28,28,580/- DST/GITA (India) and MOST (Taiwan)
under Indo-Taiwan Joint Research Call
EMGNet: Development of Deep Learning Based Model for Hand Movements Classification using Surface EMG Signals PI: Dr. Anish Turlapaty
CoPI: Dr. Shiv Ram Dubey
2020 - 2022 Rs. 34,06,558/- SERB-DST, India under Core Research Grant
Biomedical Image Analysis and Design of an Information Interchange Framework Host PI: Dr. Shiv Ram Dubey
Project Fellow: Ms. Mavis Gezimati, HIT Zimbabwe
2020 - 2020 Rs. 4,00,000/- DST, India under Research and Training Fellowship
for Developing Country Scientists (RTF-DCS)
Development of Image Retrieval Methods for Biomedical and Health Informatics PI: Dr. Shiv Ram Dubey 2017 - 2020 Rs. 18,52,330/- SERB-DST, India under Early Career Research Award
Publications
2020
  • S.K. Roy, S.R. Dubey, S. Chatterjee and B.B. Chaudhuri, “FuSENet: Fused Squeeze-and-Excitation Network for Spectral-Spatial Hyperspectral Image Classification”, IET Image Processing, March 2020. DOI: 10.1049/iet-ipr.2019.1462
  • Shiv Ram Dubey and Snehasis Mukherjee. LDOP: Local Directional Order Pattern for Robust Face Retrieval. Multimedia Tools and Applications (MTAP), 79:6363–6382, March 2020. (Springer)
  • S.K. Roy, G. Krishna, S.R. Dubey and B.B. Chaudhuri, “HybridSN: Exploring 3D-2D CNN Feature Hierarchy for Hyperspectral Image Classification”, IEEE Geoscience and Remote Sensing Letters, 17(2):277-281, Feb 2020.
  • S.H. Shabbeer Basha, S.R. Dubey, P. Viswanath and S. Mukherjee, “Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification”, Neurocomputing, 378:112-119, Feb 2020. (Elsevier)
  • S.K. Roy, B. Chanda, B.B. Chaudhuri, D.K. Ghosh and S.R. Dubey, “Local Jet Pattern: A Robust Descriptor for Texture Classification”, Multimedia Tools and Applications (MTAP), 79:4783–4809, Feb 2020. (Springer)
  • S.K. Roy, D.K. Ghosh, S.R. Dubey, S. Bhattacharyya and B.B. Chaudhuri, “Unconstrained Texture Classification Using Efficient Jet Texton Learning”, Applied Soft Computing, 86:105910, Jan 2020. (Elsevier)
2019
  • S.R. Dubey, S.K. Roy, S. Chakraborty, S. Mukherjee and B.B. Chaudhuri, “Local Bit-plane Decoded Convolutional Neural Network Features for Biomedical Image Retrieval”, Neural Computing and Applications, 2019. (Springer)
  • S.R. Dubey, S. Chakraborty, S.K. Roy, S. Mukherjee, S.K. Singh and B.B. Chaudhuri. “diffGrad: An Optimization Method for Convolutional Neural Networks”, IEEE Transactions on Neural Networks and Learning Systems, Dec 2019.
  • S.R. Dubey, “Local Directional Relation Pattern for Unconstrained and Robust Face Retrieval”, Multimedia Tools and Applications (MTAP), 78(19):28063-28088, Oct 2019. (Springer)
  • S.R. Dubey, “Face Retrieval using Frequency Decoded Local Descriptor”, Multimedia Tools and Applications (MTAP), 78(12):16411-16431, June 2019. (Springer)
  • S.K. Roy, S.R. Dubey and B.B. Chaudhuri, “Local ZigZag Max Histograms of Pooling Pattern for Texture Classification”, IET Electronics Letters, 55(7): 382-384, 2019.
  • R.K. Thakur and S. Mukherjee, “Conditional Adversarial Network for Scene Flow Estimation”, Proc. of IEEE Ro-MAN, New Delhi, India, 2019.
  • A. Singh, A. Garg, J. Zhou, S.R. Dubey and D. Dutta, “NASIB: Neural Architecture Search withIn Budget”, NeurIPS Workshop on Meta-Learning (MetaLearn), Vancouver, Canada, 2019.
  • S.P.T. Reddy, S.T. Karri, S.R. Dubey and S. Mukherjee, “Spontaneous Facial Micro-Expression Recognition using 3D Spatiotemporal Convolutional Neural Networks”, IEEE International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019.
  • C. Nagpal and S.R. Dubey, “A Performance Evaluation of Convolutional Neural Networks for Face Anti Spoofing”, IEEE International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019.
  • Y. Srivastava, V. Murali and S.R. Dubey, “A Performance Evaluation of Loss Functions in Convolutional Neural Networks for Face Recognition”, Seventh National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG),
    India, 2019.
  • V.K. Repala and S.R. Dubey, “Dual CNN Models for Unsupervised Monocular Depth Estimation”, 8th International Conference on Pattern Recognition and Machine Intelligence (PReMI), India, 2019.
  • Y. Srivastava, V. Murali and S.R. Dubey, “PSNet: Parametric Sigmoid Norm Based CNN for Face Recognition”, Third IEEE Conference on Information and Communication Technology (CICT), IIIT Allahabad, India, 2019.
  • V.C. Sekhar, P. Mukherjee, D.S. Guru and P. Viswanath, “Online Signature Verification Based on Writer Specific Feature Selection and Fuzzy Similarity Measure in Applications of Computer Vision and Pattern Recognition to Media Forensics”, CVPRW, Long Beach California, USA, 2019.
  • V.C. Sekhar, P. Mukherjee, D.S. Guru and P. Viswanath, “OSVNet: Convolutional Siamese Network for Writer Independent Online Signature Verification”, in International Conference on Document Analysis and Recognition (ICDAR), University of Technology Sydney (UTS), Australia, 2019.
2018
  • S.K. Roy, B. Chanda, B.B. Chaudhuri, S. Banerjee, D.K. Ghosh, and S.R. Dubey, “Local Directional ZigZag Pattern: A Rotation Invariant Descriptor for Texture Classification”, Pattern Recognition Letters, 108:23-30, 2018. (Elsevier)
  • S.K. Roy, B. Chanda, B.B. Chaudhuri, D.K. Ghosh, and S.R. Dubey, “Local Morphological Pattern: A Scale Space Shape Descriptor for Texture Classification”, Digital Signal Processing, 82:152-165, 2018. (Elsevier)
  • P. Makula, A. Kumar and S. Mukherjee, “Measuring Level of Cuteness of Baby Images: A Supervised Learning Scheme”, Multimedia Tools and Applications, Springer, Vol. 77, No. 13, pp.- 16867-16885 (2018).
  • S. Mukherjee and K.K. Singh, “Human Action and Event Recognition Using A Novel Descriptor Based on Improved Dense Trajectories”, Multimedia Tools and Applications, Springer, Vol. 77, No. 11, pp.- 13661-13678 (2018).
  • S.H. Shabbeer Basha, S. Ghosh, K.K. Babu, S.R. Dubey, P. Viswanath and S. Mukherjee, “RCCNet: An Efficient Convolutional Neural Network for Histological Routine Colon Cancer Nuclei Classification”, 15th International Conference on Control, Automation, Robotics and Vision (ICARCV), Singapore, 2018.
  • S.R. Dubey and S. Mukherjee, “A Multi-Face Challenging Dataset for Robust Face Recognition”, 15th International Conference on Control, Automation, Robotics and Vision (ICARCV), Singapore, 2018.
  • R.K. Thakur and S. Mukherjee, “SceneEDNet: A Deep Learning Approach for Scene Flow Estimation”, Proc. of ICARCV 2018, Singapore, IEEE Computer Society, pp.- 394-399
  • Ch. Nagalakshmi and S. Mukherjee, “Classification of Yoga Asana from Single Image by Learning 3D View of Human Pose”, Proc. of WDH@ICVGIP 2018, IIIT Hyderabad, Springer LNCS (Accepted).
  • K. Borkar and S. Mukherjee, “Video Dehazing Using LMNN with Respect to Augmented MRF”, Proc. of ICVGIP 2018, IIIT Hyderabad, ACM, pp.- 42:1-42:9.
  •  K.S. Suma, G. Aditya and S. Mukherjee, “Activity Recognition in Egocentric Videos Using Bag of Key Action Units”, Proc. of ICVGIP 2018, IIIT Hyderabad, ACM, pp.- 9:1-9:9.
  • S.K. Roy, S.R. Dubey, B. Chanda and B.B. Chaudhuri, “TexFusionNet: An Ensemble of Deep CNN Feature for Texture Classification”, 3rd International Conference on Computer Vision and Image Processing (CVIP), India, 2018.2017
  • S.R. Dubey, S.K. Singh and R.K. Singh, “Local SVD based NIR Face Retrieval” Journal of Visual Communication and Image Representation, 49(C): 141-152, 2017. (Elsevier)
  • K.K. Singh and S. Mukherjee, “Recognizing Human Activities in Videos Using Improved Dense Trajectories Over LSTM”, Sixth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), IIT Mandi, India, 2017.
  • S.K. Roy, B. Chanda, B.B. Chaudhuri and S.R. Dubey, “A Complete Dual-Cross Pattern for Unconstrained Texture Classification”, Fourth Asian Conference on Pattern Recognition (ACPR 2017), Nanjing, China, 2017.