Events

Past Event

Data Club: Extending pandas with Dask

February 23, 2023
4:30 PM - 6:00 PM
America/New_York
International Affairs Building, 420 W. 118 St., New York, NY 10027 215

Room 215

Extending pandas with Dask: Do you have too much data to process locally? Do you want tools to probe datasets bigger than your laptop can handle? This workshop, the second in a two-part series on pandas, will introduce participants to Dask Dataframe, a Python library that parallelizes pandas efficient computation and lazy processing. We will explore how Dask Dataframes operates, and use it to reduce large datasets to manageable size. No prior programming experience is required.

Topics covered: pandas, dask

 

Data Club meetups typically occur twice-monthly, on Thursdays, throughout the semester. Open to everyone in the Columbia community, these informal events will start with a presentation on a specific use case for Python, R, Julia, or JavaScript, then open up to questions, collaborative work, and discussion. Computation typically occurs within a Jupyter/Colab workflow, and participants of all skill levels are welcome.

Contact Information

Moacir P. de Sa Pereira