Caverphone, NYSIIS, and StatCan Added to Phonics Package

Monday September 28, 2015

•  data science •  free stuff •  linguistics •  mathematics •  Metaphone •  phonetics •  phonics •  R •  scientific computing •  software •  source code •  systems science •  text analysis • 

Over the last week, I have added Caverphone, Caverphone 2, the New York State Identification and Intelligence System, the modified New York State Identification and Intelligence System, and the Census Modified Statistics Canada phonetic algorithms to the phonics in R software package.

While Metaphone is written in C++, all of these algorithms could be implemented using regular expressions. And we all known how much I love a good regex. Each of these are written in pure R using regexes. So they are a bit slow, but they are almost certainly fast enough to get the job done.

Image by Arian Zwegers / Flickr.