Dr. Kalluri Shareef Babu

Dr. Kalluri Shareef Babu

Assistant Professor - Senior Scale

Profile Summary

Dr. Kalluri Shareef Babu is a dedicated researcher and academician with expertise in Speech Signal Processing and Machine Learning. His research interests lie in the areas of Speaker Profiling, Speaker and Language Diarization, Automatic Speech Recognition (ASR), and Human Behaviour Analysis using both text and speech modalities. He earned his Ph.D. in Electronics and Communication Engineering from the National Institute of Technology Karnataka (NITK), Surathkal, in 2021. His doctoral work focused on developing speaker profiling techniques to estimate physical attributes such as height, age, and weight from speech signals.

Dr. Shareef Babu has held prestigious postdoctoral positions at Seikei University, Japan, where he worked on speech-based behavioural and psychological assessments through Motivational Interviewing, and at the Indian Institute of Science (IISc), Bangalore, where his research centred on speech technologies in multilingual and multi-speaker conversational environments.

He is currently serving as the lead organiser of the Diarization of Speaker and Language in Conversational Environments (DISPLACE) 2024 Challenge— part of a special session at Interspeech 2024, to be held in Kos Island, Greece.

Work Experience

  • Assistant Professor, School of Computer Science, UPES, Dehradun
    (December 2024 – Present)
  • Postdoctoral Researcher, LEAP Lab, Department of Electrical Engineering, Indian Institute of Science (IISc), Bangalore
    (2023 – 2024)
  • Postdoctoral Researcher, Department of Computer and Information Science, Seikei University, Tokyo, Japan
    (May 2021 – March 2023)

Research Interests

Dr. Shareef Babu’s research focuses on the application of deep learning techniques in Speech Technologies, with an emphasis on addressing challenges in real-world conversational environments. His key areas of interest include:

  • Speaker diarization in multi-speaker conversations
  • Speaker profiling
  • Language diarization in code-mixed and code-switching scenarios
  • Automatic Speech Recognition (ASR)
  • Speech pathology

His work aims to bridge the gap between advanced speech technologies and their practical deployment, adopting an interdisciplinary and AI-driven approach to enhance human-computer interaction.

Teaching Philosophy

Dr. Shareef Babu advocates for a hands-on, research-driven approach to teaching that fosters curiosity, critical thinking, and innovation. His teaching philosophy is rooted in the following principles:

  • Promoting application-based learning through real-world datasets and problem-solving.
  • Creating an inclusive and adaptive learning environment that respects diverse learning styles.
  • Encouraging students to engage with cutting-edge research and participate in multidisciplinary projects.
  • Integrating open-source tools such as Python, TensorFlow, and PyTorch for practical implementation and experimentation.
  • Bridging the gap between theoretical concepts and industry-relevant skills to prepare students for real-world challenges.

He aims to empower students to become confident, competent, and ethically responsible professionals in the evolving field of computer science.

Awards and Grants

  • MHRD Fellowship for Ph.D., Government of India
    (December 2013 – December 2018)
  • Awarded USD 280 - Student Volunteer Coordination Grant, ICASSP 2019, Brighton, UK
    (May 2019)
  • IEEE Signal Processing Society – Student Travel Grant for IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) 2016, Salerno, Italy
  • TEQIP-II International Travel Grant, MLSP 2016, Salerno, Italy
  • First Prize, National Level Student Technical Symposium (INTERACT2K10), Anil Neerukonda Institute of Technology, Visakhapatnam. (2010)

Professional Services

  • Reviewer, Speech Communication Journal
  • Reviewer, Interspeech (2021 – 2024)
  • Reviewer, SPECOM 2023
  • Reviewer, IEEE CONNECT 2020
  • Program Committee Member, Annual Conference on Intelligent User Interfaces (IUI) (2022 – 2024)

Scholarly Activities

  • Delivered a talk on Diarization of Speaker and Language in Conversational Environments (DISPLACE) at the EECS Symposium, IISc Bangalore, April 2024
  • Presented a poster at Interspeech 2024, Kos Island, Greece, September 2024
  • Presented a paper at ICASSP 2021, Toronto, Canada (Virtual), June 2021
  • Presented a poster at ICASSP 2019, Brighton, UK, May 2019
  • Presented a poster at IEEE TENCON 2017, Penang, Malaysia, November 2017
  • Presented a poster at IEEE MLSP 2016, Salerno, Italy, September 2016
  • Presented a paper at IEEE ADCOM 2016, Bangalore, India, September 2016
  • Attended Interspeech 2018, Hyderabad, India, September 2018

Research Mentorship and Grants

  • Automatic Multilingual Speaker Profiling and Forensics
    Role: Coordination of grant release and technical report writing
    Funding Agency: SERB, Government of India
  • Speech Analytics and Conversational AI
    Role: Proposal development, technical report writing, and action plan for 2024–2025
    Funding Agency: British Telecom
  • National Language Translation Mission (NLTM) – BHASHINI
    Role: Technical report writing and coordination of grant fund release (as part of IISc-NITK consortium)
    Funding Agency: MeitY, Government of Indiaedge graph - Network Modelling Analysis in Health Informatics and Bioinformatics, 2023. 

Publications

  1. Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy. “Automatic Speaker Profiling from Short Duration Speech Data”. Journal of Speech Communications, vol.121, pg.16-28, May 2020.
  2. Shareef Babu Kalluri, Prachi Singh, Pratik Roy Chowdhuri, Apoorva Kulkarni, Shikha Baghel, Pradyoth Hegde, Swapnil Sontakke, Deepak K T, Mahadeva Prasanna S R, Deepu Vijayasenan, Sriram Ganapathy, “The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments,” in the proceedings of the Interspeech 2024.
  3. Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy, Ragesh Rajan M, Prashant Krishnan, “NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling,” in the proceedings of the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario, Canada, IEEE, 2021.
  4. Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy, “A Deep Neural Network based End to End Model for Joint Height and Age Estimation from Short Duration Speech,” in the proceedings of 44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019.
  5. Shareef Babu Kalluri and Deepu Vijayasenan, “Robust Features for Automatic Estimation of Physical Parameters from Speech”, in the proceedings of TENCON 2017-2017 IEEE Region 10 Conference, IEEE, 2017.
  6. Shareef Babu Kalluri, Ashwin Vijayakumar, Deepu Vijayasenan, and Rita Singh, “Estimating multiple physical parameters from speech data,” in the proceedings of IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 2016.
  7. Kotra Venkata Sai Ritwik, Shareef Babu Kalluri, Deepu Vijayasenan, “COVID-19 Detection from Spectral features on the DiCOVA Dataset”, in the proceedings of Interspeech 2021, Brno, Czech Republic.
  8. Deepu Vijayasenan, Shareef Babu Kalluri, K. Sreekanth, and Ansal Issac. “Study of Wireless Channel Effects on Audio Forensics” in the proceedings of 22nd Annual International Conference on Advanced Computing and Communication (ADCOM), pp.
    33-37.IEEE,2016.
  9. Kotra Venkata Sai Ritwik, Shareef Babu Kalluri, Deepu Vijayasenan,“COVID-19 Patient Detection from Telephone Quality Speech Data”, arXiv preprint arXiv:2011.04299 (2020).