Skip to content

Scrapping and summaryzing text, pdf and text document files, web contents, LinkedIn posts, pdf, and YouTube videos content.

Notifications You must be signed in to change notification settings

BeboFekry/Octobot-Smart-Summarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

94 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Octobot - Smart Summarizer

Scrapping and summaryzing text, pdf and text document files, web contents, LinkedIn posts, pdf, and YouTube videos content.

Try The Project: Link

Key Points

  • Scrapping web pages to get web content.
  • Scrapping YouTube videos links to get text subtitles in Arabic or English languages.
  • Scrapping text files (PDF & Text) to get text content.
  • Summarizing the text contents using smart chatbot with message history based on LLM model (Google Gemini) using Langchain, focus on the important notes, and adding Q/A.
  • Chatbot can talking to the users, summarizing text messages, answering questions on the summarized contents, and can help users to use the web page explaining step by step.
  • Designed a user friendly graphical interface using Streamlit.

๐Ÿง  Tech Stack

  • Graphical Interface: Streamlit
  • Backend: Python
  • LLM: Google Gemini (gemini-2.5-flash)
  • Other Libraries: langchain, sentence_transformers, beautiful_soup, youtube_transcript_api, validators.

๐Ÿš€ Getting Started

๐Ÿ”ง Prerequisites

Install required packages: pip install -r requirements.txt

Run the app: streamlit run app.py


Screenshots

๐ŸŽฌ Watch the Demo Video


Contact

Developed by Abdallah Fekry

๐Ÿ“ง [email protected]

๐ŸŒ LinkedIn | GitHub

About

Scrapping and summaryzing text, pdf and text document files, web contents, LinkedIn posts, pdf, and YouTube videos content.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages