You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wish to calculate the normalized similarity rapidfuzz.distance.Indel.normalized_similarity(val1, val2)
I have a word to match in the second column of a large tab delaminated file. I wish to get the whole line along with the normalized similarity scores of anything matching over 85%
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I wish to calculate the normalized similarity
rapidfuzz.distance.Indel.normalized_similarity(val1, val2)
I have a word to match in the second column of a large tab delaminated file. I wish to get the whole line along with the normalized similarity scores of anything matching over 85%
For eg
The word to match "evtatinyn"
few lines from input file:
mui001 lewthtin 0.000007
xui008 levthatin 0.0010004
ui1 [vtatinyn 0.0000807
ul5 levthatin 0.000003
ppu5 gevtiktin 0.000000002
pip9 lewttin 0.00008
muix1 mewttingiants 0.0000002
ftk69 wttinoo[ys 0.00001
I wish to get lines having similarity score in the second column over 85% (something like 0.8888888888888888 ui1 [vtatinyn 0.0000807)
At present I am loading data using pandas
indata = pd.read_csv('input.tab', sep='\t', lineterminator='\n')
Any assistance will be helpful.
Thanks
Beta Was this translation helpful? Give feedback.
All reactions