🛡️ PhishGuard: BERT-Based Phishing Detection

PhishGuard is an NLP-powered phishing detection model built using BERT.
It classifies text (e.g., emails, URLs, or messages) as phishing or legitimate.

The model is trained and hosted on Hugging Face, while this repository contains the training pipeline, preprocessing scripts, and evaluation code.

🚀 Model

The fine-tuned model is available on Hugging Face:
👉 bert-phishing-detector

You can load it directly in Python:

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Swathi37/bert-phishing-detector")
model = AutoModelForSequenceClassification.from_pretrained("Swathi37/bert-phishing-detector")

text = "Your account has been suspended. Click here to verify."
inputs = tokenizer(text, return_tensors="pt")
outputs = model(**inputs)
print(outputs.logits)

📦 Installation

git clone https://github.com/SwathiPriya37/PhishGuard.git
cd PhishGuard
pip install -r requirements.txt

🏋️ Training

To fine-tune BERT on your phishing dataset:

python src/train_bert.py

📊 Evaluation

Evaluate the trained model:

python src/evaluate_model.py

📂 Using Your Own Dataset

To train on your own dataset, prepare a CSV file with the following format:

csv text,label "Your account is locked. Verify now.",phishing "Meeting is scheduled at 3 PM tomorrow.",legitimate Then run:

bash

python src/train_bert.py --data data/your_dataset.csv

🤝 Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you’d like to add.

📜 License

This project is licensed under the MIT License.

👩‍💻 Author

Developed by Swathi Priya R Model: Swathi37/bert-phishing-detector

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
data		data
models		models
notebooks		notebooks
reports/figures		reports/figures
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛡️ PhishGuard: BERT-Based Phishing Detection

🚀 Model

📦 Installation

🏋️ Training

📊 Evaluation

📂 Using Your Own Dataset

🤝 Contributing

📜 License

👩‍💻 Author

About

Uh oh!

Releases

Packages

Languages

SwathiPriya37/PhishGuard

Folders and files

Latest commit

History

Repository files navigation

🛡️ PhishGuard: BERT-Based Phishing Detection

🚀 Model

📦 Installation

🏋️ Training

📊 Evaluation

📂 Using Your Own Dataset

🤝 Contributing

📜 License

👩‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages