Room 215
Intermediate Textual Data Analysis: Get deeper into the language of your corpus.
Topics covered: spaCy, TextaCy, topic modeling, word2vec
As we investigate our textual data in more detail, the techniques for analyzing such unstructured data rely on new libraries and models provided by machine learning. In this workshop, we’ll look to the cutting edge of contemporary Python text analysis libraries to learn how to mobilize their potential.
Data Club meetups typically occur twice-monthly, on Thursdays, throughout the semester. Open to everyone in the Columbia community, these informal events will start with a presentation on a specific use case for Python, R, Julia, or JavaScript, then open up to questions, collaborative work, and discussion. Computation typically occurs within a Jupyter/Colab workflow, and participants of all skill levels are welcome.