PyData NYC 2024

Jeroen Janssens

Jeroen Janssens, PhD, is a polyglot data science consultant and certified instructor. His expertise lies in visualizing data, implementing machine learning models, and building solutions using Python, R, JavaScript, and Bash. Jeroen is passionate about open source and sharing knowledge. He is the author of Data Science at the Command Line (O’Reilly, 2021) and is currently writing Python Polars: The Definitive Guide (O’Reilly, 2025). Every now and then he blogs at https://jeroenjanssens.com.

The speaker's profile picture

Sessions

11-06
15:10
90min
Turning DataFrames into Pretty Pictures with Plotnine
Michael Chow, Thijs Nieuwdorp, Jeroen Janssens

Learn how Plotnine, a Python package inspired by R's ggplot2, enables the creation of sophisticated and effective data visualizations with minimal effort. This tutorial will explain how Plotnine's grammar of graphics approach provides a flexible, intuitive way to visualize data, either as ad-hoc plots or fine-tuned graphs suited for communication.

Music Box
11-08
10:55
40min
What we learned by converting a large codebase from Pandas to Polars
Thijs Nieuwdorp, Jeroen Janssens

In this talk, we'll share our experience of converting a substantial data processing codebase from Pandas to Polars. We’ll discuss the motivations behind the switch, the challenges faced during the transition, and the significant performance gains we observed. Attendees will gain insights into when and why Polars can be a superior choice for data processing tasks, especially in performance-critical applications.

Music Box