Skip to content

chile2706/processing-large-files

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Processing Large Files

Purpose of the Program

  • compare the 10 most popular words of two large files True.csv and Fake.csv from Kaggle which contain real and fake news articles accordingly to see how these files are different

Design of the Program

  • Read the content of both files and counts the number of each word
  • Ignore uninteresting words like ‘the’, ‘is’, ‘in’ in ignore_words.txt
  • Create two new files with the 10 most popular words on the corresponding files.

Output of the Program

10 most popular words in True.csv

10 most popular words in Fake.csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages