Scrapping and summaryzing text, pdf and text document files, web contents, LinkedIn posts, pdf, and YouTube videos content.
Try The Project: Link
- Scrapping web pages to get web content.
- Scrapping YouTube videos links to get text subtitles in Arabic or English languages.
- Scrapping text files (PDF & Text) to get text content.
- Summarizing the text contents using smart chatbot with message history based on LLM model (Google Gemini) using Langchain, focus on the important notes, and adding Q/A.
- Chatbot can talking to the users, summarizing text messages, answering questions on the summarized contents, and can help users to use the web page explaining step by step.
- Designed a user friendly graphical interface using Streamlit.
- Graphical Interface: Streamlit
- Backend: Python
- LLM: Google Gemini (gemini-2.5-flash)
- Other Libraries: langchain, sentence_transformers, beautiful_soup, youtube_transcript_api, validators.
Install required packages:
pip install -r requirements.txt
Run the app:
streamlit run app.py
๐ฌ Watch the Demo Video
Developed by Abdallah Fekry
๐ง [email protected]