refinr by Chris Muir
R package implementation of two algorithms from the open source software OpenRefine. These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The cluster methods used are key collision and ngram fingerprint (more info on these here). In addition, the function n_gram_merge uses approximate string matching applied to the ngram fingerprints to form clusters.