Tweet Scraper X.com (v2.3.3)

[English Version] | Versi Bahasa Indonesia

🇬🇧 English Version

Overview

A powerful and user-friendly desktop application for scraping tweets from X.com (formerly Twitter). Built with Python and PyQt5, it offers advanced features like parallel scraping, sentiment analysis, trend detection, and a comprehensive analytics dashboard.

Key Features

🚀 Multi-Threaded Scraping: Significantly faster data collection using parallel threads.
📊 Analytics Dashboard: Built-in visualization for sentiment distribution (Pie Chart), top hashtags, and keywords (Bar Charts).
🧠 Custom Sentiment Analysis: Uses a specialized Indonesian lexicon (modified-lexicon_v2.txt) for accurate sentiment scoring (Positive, Negative, Neutral).
📈 Trend Detection: Automatically identifies trending topics and keywords from scraped data.
💾 Flexible Export: Save data in CSV, JSON, or Excel formats.
🔔 Smart Notifications: Desktop notifications and sound alerts ("Ting!") when scraping finishes or errors occur.
🎨 Modern UI: Responsive GUI with Dark/Light mode support.
🔐 Authentication: Supports cookie-based authentication to scrape deeper and faster.

Prerequisites

Python 3.8 or higher
Google Chrome Browser
Internet Connection

Installation

Clone the repository

git clone https://github.com/YourUsername/TweetScrapper.git
cd TweetScrapper

Install dependencies
```
pip install -r requirements.txt
```
Note: Ensure selenium, pandas, PyQt5, matplotlib, openpyxl, and winsound (windows default) are installed.
Prepare Lexicon Ensure modified-lexicon_v2.txt is present in the project root for sentiment analysis to work correctly.

Usage

Run the Application
```
python main.py
```
Login / Auth Token
- To ensure uninterrupted scraping, paste your X.com auth_token cookie into the "Autentikasi" field in the GUI.
Start Scraping
- Enter a Keyword (e.g., "Python", "#AI").
- Select Search Type (Latest/Top).
- Set Date Range and Limit.
- (Optional) Enable Multi-Threading for speed.
- Click ▶️ Mulai Scraping.
View Analytics
- Once finished, switch to the Analytics Dashboard tab to view charts and statistics.

Project Structure

TweetScrapper/
├── src/
│   ├── analysis/       # Sentiment & Trend logic
│   ├── config/         # Constants
│   ├── core/           # Deduplicator, Theme Manager
│   ├── gui/            # PyQt5 Windows & Widgets
│   └── scraper/        # Selenium Scraper Logic
├── modified-lexicon_v2.txt  # Custom Sentiment Lexicon
├── main.py             # Entry point
└── README.md           # Documentation

Disclaimer

This tool is for educational and research purposes only. Please respect X.com's Terms of Service and robots.txt policies. Users are responsible for how they use the scraped data.

🇮🇩 Tweet Scraper X.com - Indonesian Version

Ringkasan

Aplikasi desktop yang canggih dan mudah digunakan untuk mengambil data tweet dari X.com (Twitter). Dibangun dengan Python dan PyQt5, aplikasi ini menawarkan fitur-fitur seperti scraping paralel, analisis sentimen, deteksi tren, dan dashboard analitik yang lengkap.

Fitur Utama

🚀 Scraping Multi-Thread: Pengumpulan data jauh lebih cepat menggunakan parallel threads (banyak proses sekaligus).
📊 Dashboard Analitik: Visualisasi bawaan untuk distribusi sentimen (Digram Lingkaran), top hashtag, dan kata kunci (Diagram Batang).
🧠 Analisis Sentimen Khusus: Menggunakan kamus bahasa Indonesia khusus (modified-lexicon_v2.txt) untuk penilaian sentimen yang akurat (Positif, Negatif, Netral).
📈 Deteksi Tren: Mengidentifikasi topik dan kata kunci yang sedang tren secara otomatis dari data yang diambil.
💾 Ekspor Fleksibel: Simpan data dalam format CSV, JSON, atau Excel.
🔔 Notifikasi Pintar: Notifikasi desktop dan suara ("Ting!") saat proses selesai atau terjadi error.
🎨 UI Modern: Tampilan antarmuka responsif dengan dukungan Mode Gelap/Terang.
🔐 Autentikasi: Mendukung penggunaan cookie autentikasi untuk akses scraping yang lebih dalam dan stabil.

Prasyarat

Python 3.8 atau lebih baru
Browser Google Chrome
Koneksi Internet

Instalasi

Clone repositori

git clone https://github.com/UsernameAnda/TweetScrapper.git
cd TweetScrapper

Install dependensi
```
pip install -r requirements.txt
```
Catatan: Pastikan selenium, pandas, PyQt5, matplotlib, openpyxl, dan library pendukung lainnya terinstall.
Siapkan Lexicon Pastikan file modified-lexicon_v2.txt ada di folder utama proyek agar analisis sentimen bahasa Indonesia berfungsi.

Cara Penggunaan

Jalankan Aplikasi
```
python main.py
```
Login / Token Auth
- Masukkan cookie auth_token akun X.com Anda ke kolom "Autentikasi" di aplikasi untuk hasil terbaik.
Mulai Scraping
- Masukkan Kata Kunci (misal: "Pemilu", "#Teknologi").
- Pilih Tipe Pencarian (Latest/Top).
- Atur Rentang Tanggal dan Jumlah Tweet.
- (Opsional) Aktifkan Multi-Threading di menu opsi untuk kecepatan ekstra.
- Klik ▶️ Mulai Scraping.
Lihat Analitik
- Setelah selesai, pindah ke tab Analytics Dashboard untuk melihat grafik sentimen dan tren data yang baru saja diambil.

Struktur Proyek

TweetScrapper/
├── src/
│   ├── analysis/       # Logika Sentimen & Tren
│   ├── config/         # Konstanta
│   ├── core/           # Deduplikator, Manajer Tema
│   ├── gui/            # Jendela & Widget PyQt5
│   └── scraper/        # Logika Selenium Scraper
├── modified-lexicon_v2.txt  # Kamus Sentimen Custom
├── main.py             # File Utama
└── README.md           # Dokumentasi

Penafian (Disclaimer)

Alat ini dibuat hanya untuk tujuan pendidikan dan penelitian. Harap hormati Ketentuan Layanan (Terms of Service) X.com. Pengguna bertanggung jawab penuh atas penggunaan data yang diambil.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
output		output
src		src
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
TweetScraper.spec		TweetScraper.spec
auto-py-to-exe-config.json		auto-py-to-exe-config.json
build_gui.py		build_gui.py
main.py		main.py
modified-lexicon_v2.txt		modified-lexicon_v2.txt
requirements.txt		requirements.txt
simple_build.bat		simple_build.bat
tweet_dedup.db		tweet_dedup.db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tweet Scraper X.com (v2.3.3)

🇬🇧 English Version

Overview

Key Features

Prerequisites

Installation

Usage

Project Structure

Disclaimer

🇮🇩 Tweet Scraper X.com - Indonesian Version

Ringkasan

Fitur Utama

Prasyarat

Instalasi

Cara Penggunaan

Struktur Proyek

Penafian (Disclaimer)

About

Uh oh!

Releases 2

Packages

Languages

License

DhaniAAA/Scrapping-Qt5-Tweet

Folders and files

Latest commit

History

Repository files navigation

Tweet Scraper X.com (v2.3.3)

🇬🇧 English Version

Overview

Key Features

Prerequisites

Installation

Usage

Project Structure

Disclaimer

🇮🇩 Tweet Scraper X.com - Indonesian Version

Ringkasan

Fitur Utama

Prasyarat

Instalasi

Cara Penggunaan

Struktur Proyek

Penafian (Disclaimer)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages