Chapter 5: Data Visualization

Represent data and communicate information graphically.

1General information

2Getting started

3Visualizing amounts

4Visualizing distributions

5Visualizing multiple distributions

6Visualizing the relationship between numerical variables

7Visualizing time series

8ggplot2 extensions

9Data visualization: Exercises

About the course, the professor and the TAs

This course is intended to provide an introduction to data science using the R language. It will also provide students with notions of data management and manipulation, data analytics as well as of reproducible research, result-sharing and version control (using GitHub). At the end of the class, students should be able to automatically extract data from websites and create interactive web apps as well as dynamic reports to visualize and analyze them. Students should also be able to construct their own R packages, make them available on GitHub and document them using literate programming.

The material presented in this website was mainly developed by Stéphane Guerrier, Lionel Voirol and Yuming Zhang. Moreover, we thank Luca Insolia for his valuable contributions. Stéphane Guerrier is an Assistant Professor of Statistics and Data Science at the University of Geneva. Yuming Zhang and Lionel Voirol are PhD Candidates in Statistics at the University of Geneva. Luca Insolia is a postdoctoral researcher in Statistics at the University of Geneva.