petitminion I don't understant that part :s
Lets say we have a table with the lookup fields that have the important data, lower cased, punctuation removed, white space removed (along with a file_path that lets us know where the file is). Something like:
portisheadstrangers
u2wherethestreetshavenoname
queenwewillrockyou
And lets assume that there are 100k rows of this, one for each track. We must now find a search solution that if we query for something that is not quite 100% accurate. In the case below, the "s" at the end is missing:
portisheadstranger
should find
portisheadstrangers
Calculating the edit distance between two strings simply compares two strings. What we need is to implement a fuzzy search in a DB table (or document index).
Does this make more sense?
petitminion you have a table of recordings to look up for test purpose ? eg fuzzy data linked to clean data ?
How many GBs would you like? 🙂