Skip to content

This is a web scraper built using Puppeteer that can extract data from YouTube videos, websites, and tweets. It automates browser interactions to fetch relevant details efficiently.

Notifications You must be signed in to change notification settings

faizanr27/Web-scrapper

Repository files navigation

Web Scraper using Puppeteer

Overview

This is a web scraper built using Puppeteer that can extract data from YouTube videos, websites, and tweets. It automates browser interactions to fetch relevant details efficiently.

Features

YouTube Scraping

Provide a YouTube video link, and the scraper extracts:

  • Video Title
  • Description
  • Auto-generated Transcript (if available)

Website Scraping

Extracts text-based content from any given website URL.

Tweet Scraping

Fetches the content of a given tweet, including text and metadata.

Installation

Make sure you have Node.js installed.

# Clone the repository
git clone https://github.com/faizanr27/Web-scrapper.git
cd web-scrapper

# Install dependencies
npm install

Usage

Run the scraper using:
node service.js

Modify the script to pass different URLs depending on the data you want to scrape.

Dependencies

Puppeteer - Headless Chrome automation

For any issues or feature requests, feel free to create an issue in the repository!

About

This is a web scraper built using Puppeteer that can extract data from YouTube videos, websites, and tweets. It automates browser interactions to fetch relevant details efficiently.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published