python 3.x - Custom stemming and stopwords in CountVectroizer sklearn with .txt dictionary -
how can custom stemming , stopwords in countvectroizer sklearn .txt dictionary?
this how used countvectorizer:
from pandas import dataframe cv=countvectorizer(token_pattern=u'(?u)\\b\\w+\\b', min_df=0, max_df=1.0) post_textcv= cv.fit_transform(post_text) df=dataframe(post_textcv.a, columns=cv.get_feature_names()) print(df.head)
Comments
Post a Comment