Skip to content

Cosine_distance and Fuzz_Score #13

@schammass-zz

Description

@schammass-zz

Hi Matthew,

First of all, I'like to thank you for share with us your knowledge. I appreciate so much. I have learned a lot.

In relation to the functions cosine_distance and fuzz_score, I used in my dataframe and gave me very strange results.

For Cosine_distance, I should have results between -1 and 1 and I have -2.2240446049250313E-16. Please take a look:

full_name1 |full_name2 |cosine_distance |
+-------------------+-----------------------------+----------------------+
|John Stevenson Due |John Stevenson Due |-2.220446049250313E-16|

For Fuzzy_Score for the same example, I had a score of 52, instead of result between 0 and 1.

Fuzzy Score
+-------------------+-----------------------------+------------+
|full_name1 |full_name2 |fuzzy_score|
+-------------------+-----------------------------+-----------+
|John Stevenson Due |John Stevenson Due |52

Could you please check it out?

Thanks a lot,

Rodrigo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions