-
Notifications
You must be signed in to change notification settings - Fork 0
Preprocessor
Kai edited this page Mar 10, 2023
·
4 revisions
The preprocessor parses the data contained in words.csv. This file contains the Greek New Testament separated by verse and word variants (obtained from Alan Bunning). After taking in the data, it is constructed into gword (Greek word) objects and writes the list to word_list.pkl. gword objects contain the word and a dictionary with the key as a Verse, and the value as a list of [Verse, Occurences]. The word_list.pkl is one of two inputs to the Probabilistic Data Synthesis program.