Skip to content

alexradu04/JBG030-DrakeJet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

120 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

JBG030-DrakeJet

Here are all the imports required to run our code:

pip install pysqlite3
pip install tqdm
pip install langdetect
pip install googletrans==4.0.0-rc1
pip install geopy==2.2.0
pip install pandas
pip install numpy
pip install vaderSentiment 
pip install fuzzywuzzy
pip install requests
pip install seaborn
pip install matplotlib
pip install spacy
python -m spacy download en_core_web_sm

Please install the latest Python version and the latest VSCode iteration.

In order to run our code, you will need to install the latest iteration of Python,and you will also need to install the latest iteration of VSCode with the SQLite extension. This extension can be installed on the extensions tab by searching up SQLite, after installing VSCode. It is sufficient to install the one published by specifically [alexcvzz.vscode-sqlite]. Be sure to also have installed the Python extension in VSCode. After installing these extensions, the code will be able to work.

How to run our code: First of all please unzip the folder, and then when you open up VSCode go to file, click 'open folder', and choose the folder you downloaded. Then open terminal and install the libraries above. And also add the extension for SQLite3. Then you may begin running the code.

  1. Run the data cleaning file [FINISHED DB.py]. Please be sure to specify the path of your data directory in the file [path_to_data.txt]

  2. Proceed to run [main.py]. Please note that there are often errors. In the case that one of the files does not run, an error will show up. Please manually run the codes from there on. The code will print out the file at which it was not able to find the directory. It is sufficient to just run it again manually and continue that way. The running time will depend on your specifications and internet speed (translation and geopy), but it should be within 8 hours for all (We were able to run it in 4-5 hours, worst case 12).

  3. Run the Visualization Map Code [Plotly_VIS_complete.py], and you will be able to see a pop-up on your browser with the interactive maps.

  4. After this is complete, you may run the demo.py file to load up all the visualizations used on our poster. the boxplot, unique conversations, and vis distribution all have start and end dates in the code. You can change this as you please. If you just run it now, it will show the total statistics for all the data without restrictions.

A few possible differences

Given that Geopy is rate limited at times, there could be instances where later on in your database, there may be different numbers of locations (though the difference is VERY minimal). This is due to the fact that Geopy can sometimes come up with an ERROR throughout the running of the code. This is not of concern though, as after a while the Error prints will stop, and the code will continue to run.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •