-
Notifications
You must be signed in to change notification settings - Fork 6
Description
Hi Matthew,
First of all, I'like to thank you for share with us your knowledge. I appreciate so much. I have learned a lot.
In relation to the functions cosine_distance and fuzz_score, I used in my dataframe and gave me very strange results.
For Cosine_distance, I should have results between -1 and 1 and I have -2.2240446049250313E-16. Please take a look:
full_name1 |full_name2 |cosine_distance |
+-------------------+-----------------------------+----------------------+
|John Stevenson Due |John Stevenson Due |-2.220446049250313E-16|
For Fuzzy_Score for the same example, I had a score of 52, instead of result between 0 and 1.
Fuzzy Score
+-------------------+-----------------------------+------------+
|full_name1 |full_name2 |fuzzy_score|
+-------------------+-----------------------------+-----------+
|John Stevenson Due |John Stevenson Due |52
Could you please check it out?
Thanks a lot,
Rodrigo