Skip to content

Effortlessly extract comprehensive book details from your favorite online bookstore with my web scraper. Perfect for building your own book database or conducting research.

Notifications You must be signed in to change notification settings

michaelvbend/book-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraper for Book Data

This project is a web scraper designed to extract book data from a specified website. The scraper uses Selenium to navigate the website, locate book details, and save the extracted data into a JSON file.

Features

  • Extracts book details such as title, thumbnail, author, published date, description, page count, genre, and language.
  • Handles cookies and navigation through multiple pages.
  • Saves the extracted data incrementally to avoid data loss in case of interruptions.

Requirements

  • Python 3.x
  • Selenium
  • Chrome WebDriver

Installation

  1. Clone the repository:
    git clone https://github.com/michaelvbend/book-scraper.git
    cd webscraper
  2. Install Python packages:
    pip install -r requirements.txt

Usage

  1. Run the scraper
    python main.py
  2. Output: The scraped data will be saved in book_data.json

About

Effortlessly extract comprehensive book details from your favorite online bookstore with my web scraper. Perfect for building your own book database or conducting research.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages