I have a bunch of old javascript files that look very similar. I would like to implement a copied / pasted code detection tool, but I could not find the description of the algorithm ... I am already using a sonar with the javascript plugin to detect such code, but I would like to have finer control over the detection ...
Is there any "standard" algorithm for this problem? Is there a library to perform this analysis (python or java ...)?
thanks.
source share