java - Is there apache or other utility to get the ngrams ( character) frequency that I can use for cosinesimilarity -
i need cosine similarity of 2 strings. using lucene apis ngrams ( ngramtokenizer , chartermattribute).and create frequency hashmap. use cosinesimilarity of apache-commons-text.( not active not keen on using pkg)
i saw link on stackoverflow shows how cosinesimilarity using lucene. how use lucene
just wondering if best way or there other util can use.
Comments
Post a Comment