[HTML][HTML] Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers

S Xu, M Chen, T Feng, L Zhan, L Zhou, G Yu - Frontiers in Genetics, 2021 - frontiersin.org
S Xu, M Chen, T Feng, L Zhan, L Zhou, G Yu
Frontiers in Genetics, 2021frontiersin.org
With the rapid increase of large-scale datasets, biomedical data visualization is facing
challenges. The data may be large, have different orders of magnitude, contain extreme
values, and the data distribution is not clear. Here we present an R package ggbreak that
allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting
area to deal with large datasets (especially for long sequential data), data with different
magnitudes, and contain outliers. The ggbreak package increases the available visual …
With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak).
Frontiers