Discussion on Stitch the Torn Wiki Challenge

3 years ago+ 0 comments

I won't post the code here, but here's how I solved it:

Setup (spliting words in spaces and special characters)
Create a dictionary counting the ammount of words for each text
Create a global dictionary for the total ammount of words
Normalize each local dictionary, considering the global frequency
Considering each word as a vector, calculate cosine distance between every text from setA versus setB.
Create similarity matrix
Find the permutation matrix to maximize PxS

Got 100% with this approach