Corpus compact #112
-
What does the st.CorpusFromPandas().compact(st.AssociationCompactor(2000)) do exactly? and how does it function? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
It accumulates the highest scoring words (most associated) for each category, as selected by a set frequency rank in all classes, until no more than 2000 (or whatever the specified number is) words are collected. By default, Scaled F-Score is used, but any |
Beta Was this translation helpful? Give feedback.
It accumulates the highest scoring words (most associated) for each category, as selected by a set frequency rank in all classes, until no more than 2000 (or whatever the specified number is) words are collected.
By default, Scaled F-Score is used, but any
TermScorer
such asRankDifference
,DeltaJSDivergence
, etc. could be used,