python 3.x - Custom stemming and stopwords in CountVectroizer sklearn with .txt dictionary -

python 3.x - Custom stemming and stopwords in CountVectroizer sklearn with .txt dictionary -

February 15, 2012

how can custom stemming , stopwords in countvectroizer sklearn .txt dictionary?

this how used countvectorizer:

from pandas import dataframe cv=countvectorizer(token_pattern=u'(?u)\\b\\w+\\b', min_df=0, max_df=1.0) post_textcv= cv.fit_transform(post_text) df=dataframe(post_textcv.a, columns=cv.get_feature_names()) print(df.head)

Comments