International Journal Of Engineering And Management Research
  • Year: 2025
  • Volume: 15
  • Issue: 6

GenAI Based YouTube Video Summarizer

  • Author:
  • DK Jadhav1,*, SR Devardekar2, AB Talandage3, OT Bhokare4, AA Kudale5, VS Patil6
  • Total Page Count: 7
  • Page Number: 17 to 23

1Deepali Kishor Jadhav, Assistant Professor, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

2Samiksha Raghunath Devardekar, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

3Amruta Bharat Talandage, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

4Onkar Tanajirao Bhokare, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

5Akshata Ashok Kudale, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

6Vaishnavi Shivaji Patil, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India

*Corresponding Author Deepali Kishor Jadhav, Assistant Professor, Department of Computer Science and Engineering, KITCOEK, Kolhapur, Maharashtra, India, Email: jadhav.deepali@kitcoek.in

Online published on 12 March, 2026.

Abstract

This paper proposes an intelligent, web-based application—AI video summarizer—that efficiently extracts, Tran- scribes, and summarizes YouTube video content using advanced AI models such as Google Gemini. By simply entering a video link, users can obtain multilingual transcripts (in English, Hindi, and Marathi), concise summaries, and time stamped highlights of key moments. Furthermore, the application converts the generated summaries into audio using GTTS and offers options to download or copy full transcripts. Built with Streamlit, it provides an interactive and user-friendly interface. This solution addresses the growing challenge of overwhelming digital video content, offering a more accessible, time-saving, and language- inclusive way to understand and utilize video information across various fields.

Keywords

Video Summarization, Artificial Intelligence, Text-To-Speech, Multilingual Transcription, Streamlit Interfac