This project is a demo web scraping API built using Playwright and Express.js. It scrapes data from five publicly available websites that are safe and legal for practice.
- β
Quotes scraper from
quotes.toscrape.com - β
Book title & price scraper from
books.toscrape.com - β
HTML test scraping from
httpbin.org - β
Product scraper from
scrapeme.live/shop/
This project also demonstrates how to organize and expose RESTful API routes in a backend using Express.js. It follows a basic REST pattern:
GETmethod for fetching scraped data- Separation of concerns using routes and controllers
- Clean modular design suitable for scaling into larger applications
| Route | Description |
|---|---|
/quote |
Fetch quotes and authors |
/books |
Fetch list of books with price |
/httpbin |
Fetch title from basic static HTML |
/product |
Fetch product list from fake shop |
Each route is handled by a separate controller in controller/ folder.
git clone https://github.com/your-username/playwright-scraping-api.git
cd playwright-scraping-api