|
ngram_stemmer
ngram_stemmer (word_list, size, equality)
|
Added in advas.0.0.3
Reduces word_list according to the n-gram stemming method.
In: |
word_list |
a simple list of words |
|
size |
integer, length of n-grams |
|
equality |
float, degree of equality. |
Out: |
|
a list of words, already conflated if possible. |
Equality is a value between 0 and 1. The higher the value, the higher the degree of equality between two words so that they are conflated. A recommended value is between 0.8 and 0.9.
|
|