| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

Class 5 Notes

Page history last edited by Alan Liu 6 years, 1 month ago

 

Preliminary Business

 

Today's topic: Computational Text Analysis (Topic Modeling)

 

Today's readings:  (red links are to professor's annotated versions)

 

 

 

 

Building Blocks of Text Analysis

 

 

  1. Counting (frequency) 
    Antconc prelude word list

  2. Co-occurrence (collocation)
    antconc prelude bigram "I"

  3. Clustering
    Lexos prelude hierarchical dendogram clustering Lexos prelude k-means voronoi space clustering

  4. Comparison with reference corpus (as "corpus" is understood in field of corpus linguistics)
  5. Other important supporting or complementary methods of text analysis:
    • Parts-of-speech analysis (POS)
    • Named entity recognition (NER)
    • Sentiment analysis
  6. Visualization
  7. Currently leading-edge advanced methods of text analysis that build on top of the lower-level "building block" methods above:
    • topic modeling
    • word embedding 
    • social network analysis
    • GIS mapping 

 

 

 

 

Topic Modeling

 

 

 

 

Class Topic Modeling Practicums

 

 

 

 

 

 

 

 

 

 

 

 

Comments (0)

You don't have permission to comment on this page.