on 10-19-2020 11:04 AM
While calculating Match score based on Weight Method. To verify the calculation I am trying manual calculation, But for that 'Similarity Score' needs to be calculated first.
For the records where number of characters are same it is quite easy to get the 'Similarity Score'. For example
Record A Record B Similarity Smith Smith 100% Smith Smitt 80% ms@ sap.com msmith@ sap.com 84% ??How in Email address similarity is coming as 84%.? Please help me with the calculation behind it.
Hi there,
this is nothing you can check by yourself. DS uses internal code/algorithms to retrieve such values.
Nevertheless, the string similarity most likely is calculated by a variation of the basic of Smith Waterman Algorithm.
If you are really keen on testing its basic functionality you could visit this site and see which DS output gets the closest:
https://asecuritysite.com/forensics/simstring
Regards,
Julian
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
71 | |
8 | |
8 | |
6 | |
6 | |
6 | |
5 | |
5 | |
5 | |
5 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.