Workshop for PhD students and Postdocs

Python Basics for Data Analysis (DSC-2023-05) | 1,5 days

When?
26.09. - 29.09.2023

26 Sep.: 08:15 - 11:45AM
27. Sep.: no training
28 Sep.: 08:15 - 11:45AM
29 Sep.: 08:15 - 11:45AM

Where?
Online (Zoom)

Speaker:
Dr. Matthias Assenmacher

Language: English

The workshop is already fully booked.

Also, if you have any questions regarding our workshops, please feel free to write us an E-MAIL.

« Back

OBJECTIVES

The Python Basics course is intended for participants who want to learn basic Python skills as well as efficient handling of data preparation, data processing and data analysis in Python. In addition, general “best practices” in Python will be taught, including, writing simple, readable and modularly extensible code. All topics presented will be explained, demonstrated, and practiced in detail with the help of participant exercises under intensive instruction.

WORKSHOP CONTENT

Part 1: Introduction to Python

Introduction to the basics of Python
Installation and use of Python and useful Python modules
Creating and working with virtual environments
Explanation of the most important data types, operators, functions and help pages
Introduction to NumPy and Pandas
Importing and exporting data
Working with DataFrames and vectors (numeric, logical, character, factors), e.g. indexing, splitting and transforming variables or data sets
Calculating statistical ratios (e.g.: mean, quantiles, variance, etc.)

Part 2: Data Wrangling in Python

Review of Python basics: built-in structures, numpy, IPython, jupyter notebook, package management, jupytext
Series and DataFrames: generation, meaning of line index, filtering, pointer vs. copy
Data cleansing: Handling missing values, editing strings, removing duplicates
Transforming data by vectorized operations like map or apply
Merging different data sources and creating a "good" table structure of the data
Grouping of data and aggregations: Split-Apply-Combine

TARGET AUDIENCE

This course is suitable for participants with no knowledge of Python or to refresh the basics in Python. The workshop is very hands-on and thus limited to max. 15 participants.

TECHNICAL REQUIREMENTS

Use a laptop/PC with reliable internet access and install the following software:

Python (at least version 3.6)
Jupyter Notebook
Zoom https://zoom.us/download
Both Python and Jupyter Notebook are included e.g. in the software Anaconda https://www.anaconda.com/distribution
Make sure that you have sufficient permissions to install Python modules (e.g. via conda install)

ABOUT THE TRAINER

Dr. Matthias Assenmacher works as trainer for Data Science Essential GmbH and is postdoctoral researcher at the Chair of Statistical Learning and Data Science (LMU) and the NFDI Consortium for Business, Economic and Related Data (BERD@NFDI). He obtained his bachelor’s degree in Economics from LMU in 2014, afterwards I turned to Statistics (with a focus on social and economic studies) and obtained his Master’s degree in 2017 (also from LMU). In October 2021 he finished his PhD with a focus on Natural Language Processing.

His expertise revolves around the practical application of state-of-the-art NLP architectures to real-world problems from various disciplines, as well as open and reproducible science.

Python Basics for Data Analysis (DSC-2023-05) | 1,5 days

The workshop is already fully booked. Also, if you have any questions regarding our workshops, please feel free to write us an E-MAIL.

OBJECTIVES

WORKSHOP CONTENT

Part 1: Introduction to Python

Part 1: Introduction to Python

Part 2: Data Wrangling in Python

Part 2: Data Wrangling in Python

TARGET AUDIENCE

TECHNICAL REQUIREMENTS

ABOUT THE TRAINER

The workshop is already fully booked.

Also, if you have any questions regarding our workshops, please feel free to write us an E-MAIL.