The Advanced Web Scraping Tool (AWT) is a powerful Python-based utility designed to scrape dynamic web content and perform comprehensive reconnaissance on websites. It leverages tools like Selenium, Nikto, Amass, WhatWeb, and more to gather data efficiently. Whether you're extracting product details, performing security audits, or conducting research, AWT provides a streamlined workflow. DISCLAIMER! Use for educational purposes only, at your own risk if caught:)
- Dynamic Content Handling: Automatically scrolls through pages to load all dynamic content.
- Customizable Selectors: Supports both XPath and CSS selectors for flexible data extraction.
- Headless Mode: Option to run the browser in headless mode for faster execution.
- Comprehensive Reconnaissance:
- WHOIS Lookup
- NSLookup
- DIG Queries (A, AAAA, MX, NS, TXT records)
- WhatWeb Scan
- Curl Headers Collection
- Nikto Vulnerability Scan
- Amass Subdomain Enumeration
- Log Management: Saves all scraped data and reconnaissance results into a customizable log file.
- User-Friendly Interface: Interactive prompts guide users through the setup and execution process.
Before running the script, ensure you have the following installed:
- Python 3.x: Download Python
- Firefox Browser: Download Firefox
- Geckodriver: Download Geckodriver
- Add
geckodriverto your system's PATH. OR use the Chrome Driver, just make sure to change the path in the script.
- Add
- Required Tools:
nikto: Install via package manager (e.g.,sudo apt install niktoon Ubuntu).amass: Install Amasswhatweb: Install via package manager (e.g.,sudo apt install whatwebon Ubuntu).
- Python Dependencies:
pip install selenium
git clone https://github.com/ANONYMOUSx46/Advanced-Web-Scraping-Too.git
cd Advanced-Web-Scraping-Tool-
Ensure all external tools (nikto, amass, whatweb, etc.) are installed and added to your system's PATH. You can verify this by running the tool names (e.g., nikto, amass) in Command Prompt.
-
Run the script:
python scrape.py
The script has been tested on both Windows and Kali Linux. However, some tools like dig and curl may require additional installations on Windows. Use tools like Cygwin or WSL (Windows Subsystem for Linux) to access these utilities. Ensure all required tools are accessible from your terminal or command prompt before running the script.