Predictive Analytics II: Text, Web, and Social Media Analytics - Text Mining Process

4 important questions on Predictive Analytics II: Text, Web, and Social Media Analytics - Text Mining Process

What are the main steps in the text mining process?

1) Establish the Corpus
2) Create the Term-Document Matrix
3) Extract Knowledge

What is the reasons for normalizing word frequencies? What are the common methods for normalizing word frequencies?

To have a more consistent TDM for further analysis.
Common methods for normalizing: log frequencies, binary frequencies, inverse document frequencies

What is SVD? How is it used in text mining?

SVD = Singular value decomposition.
it reduces the overall dimensionality of the input matrix. Goal: have a more manageable matrix
  • Higher grades + faster learning
  • Never study anything twice
  • 100% sure, 100% understanding
Discover Study Smart

What are the main knowledge extraction methods from corpus?

Classification
Clustering
Association
Trend Analysis

The question on the page originate from the summary of the following study material:

  • A unique study and practice tool
  • Never study anything twice again
  • Get the grades you hope for
  • 100% sure, 100% understanding
Remember faster, study better. Scientifically proven.
Trustpilot Logo