1 Introduction

Nested loop plots are a data visualisation for tabular data, which contains or is dependent on several key parameters. A basic example is data coming from controlled experiments, in which
measurments are performed under pre-specified conditions. The results of such an experimental study are then stored along with the pre-specified conditions that gave rise to them (the key parameters). Often, such data is then presented in form of a table to make the performed measurements comparable across the variety of conditions. However, if there are many different experimental constraints, the resulting table easily becomes unintelligible. In such cases, a visual display can aid in the interpretation of the complex experimental data.

In statistical research, the equivalent of controlled experiments are simulation studies, which will be the focus of this vignette. Rücker and Schwarzer (2014) suggested the use of nested loop plots to facilitate transparent reporting of the results from such studies. In the following we briefly motivate statistical simulation studies, explain the basic layout of nested loop plots and demonstrate the use of the looplot package to create such visualisations.

1.1 Simulation studies in statistical research

Simulation studies in statistics or other methodological research fields are controlled experiments used to assess the properties of algorithms, gauge the performance of statistical or machine learning models, and gain insights into complex phenomena which are not readily understood analytically. While simulation studies by nature are simplifying the “true” underlying mechanisms of interest, they are useful because they allow the complete specification of the “ground truth” to which comparisons can be made and the experimental conditions are fully under control of the experimenter. Such studies typically consist of three major components:

A data generating mechanism
The methods or models to be evaluated
Evaluation criteria to assess the methods after application to the generated data

All three together make up what we will call the simulation study design.

The data generating mechanism usually simulates (i.e. creates) data according to pre-defined parameters, e.g. the number of variables (i.e. columns of the data matrix) to be generated, their correlation structure or the number of observations (i.e. rows of the data matrix). These parameters will be called design parameters in the following, and the unique combinations of their values define specific simulation scenarios. Usually, the data generation involves some uncertainty or noise when creating the data, to mimic the uncertainty when data is obtained through measurements in the real world or sampled from a population. Therefore, each simulation scenario can be conducted repeatedly to remove the effect of sampling variability.

The methods or models to be evaluated in the study are then applied to the datasets generated by the data generating mechanism and the evaluation criteria are computed, e.g. some measure of deviation from the ground truth. These are subsequently averaged to obtain performance measures for each method for a given simulation scenario.

1.2 Nested loop plots

Nested loop plots are a data visualisation for tabular data which is dependent on several design parameters. Their visual appearance is similar to trellis plots. Rücker and Schwarzer (2014) suggested nested loop plots for displaying the results accross the different scenarios of a simulation study. The key idea is to display the design parameters along with the results for each specific scenario they define and to arrange the display in a meaningful way to facilitate the clear distinction of patterns within the results. As suggested in the original publication, design parameters are drawn as step functions below the results. This is especially useful when the experimenter is conducting the simulation study and is looking for interesting ways to summarize the results. The plots also provide a good high level overview of the study suitable for a publication.

A challenge when using nested loop plots is their complexity and information density. For these reasons it is also often necessary to use large figure sizes to accurately display all of the details appropriately. It is not straightforward to design a good looking nested loop plot with clear layout and easily readable display of the data. Manual tweaking of the display is often necessary. That is the reason why this package exists to streamline the creation of nested loop plots and making them more accessible.

In short, nested loop plots work well for many possible simulation study designs, in particular if:

The simulation study design has a structure which can be captured by a few design parameters (e.g. 4 to 6), i.e. a full or partial factorial design in which simulation scenarios for each unique combination of the values of the design parameters are conducted.
The results from the simulation study can be quantitatively described by numeric performance measures.
The number of design parameters (e.g. 4 to 6) and methods evaluated is moderate.

1.3 Workflow with the `looplot` package

This package is based on ggplot2 and offers several functions which aid in the process of creating nested loop plots with facetting.

The starting point for the looplot package is a dataframe comprising the results for a single performance measure from the simulation study, very similar to how the results could be presented in a table. In fact, one could think of nested loop plots as a more intuitive, visual way to present such tabular data. The dataframe should contain:

One or several columns for the design parameters, such that each row of the dataframe corresponds to a specific simulation scenario.
One column per method containing in each row the results averaged over all repetitions per simulation scenario.

We will give an example of such a dataframe in the examples below.

The user then defines the arrangement of the data within the nested loop plot and how the data should be displayed. There are two ways to work with the looplot package: one with basic options for users who are less familiar with the underlying ggplot2 framework, and the other for users who require modular access to specific parts of the plot to fine-tune the display. In either case, the resulting plot object is a ggplot2 object and can therefore be edited using the functions from that package. Furthermore, many of the options have similar names and behaviour as comparable options in ggplot2, so some familiarity with the concepts is helpful to make the most out of this package. We will give an overview of many of the available options by demonstrating several aspects of the looplot package in the following examples.

2 Basic data example

This vignette makes use of the tidyverse environment of packages and specifically requires the dplyr, purr and magrittr packages, besides the looplot package to be available. Please refer to the R environment used to create this vignette for detailed information.

We generate an artificial example dataframe representing the output of a potential, actual simulation study which features a fully factorial design with 4 design parameters (samplesize, param1, param2, param3). For each combination of the values of the parameters we generate some artificial data that depends on these values. Normally, these would correspond to the perfomance measures of the methods which are evaluated in the study (method1, method2, method3). The resulting input dataframe for the nested loop plots is shown below. It encodes in each line the simulation design parameters for one specific simulation scenario, as well as the (summarised) results for each of the methods of interest. Note that the design parameters do not need to be numeric, but can be ordinal or even categorical.

set.seed(14)
params = list(
    samplesize = c(100, 200, 500),
    param1 = c(1, 2), 
    param2 = c(1, 2, 3), 
    param3 = c(1, 2, 3, 4)
)

design = expand.grid(params)

# add some "results"
design %<>% 
    mutate(method1 = rnorm(n = n(),
                           mean = param1 * (param2 * param3 + 1000 / samplesize), 
                           sd = 2), 
           method2 = rnorm(n = n(),
                           mean = param1 * (param2 + param3 + 2000 / samplesize), 
                           sd = 2), 
           method3 = rnorm(n = n(),
                           mean = param1 * (param2 + param3 + 3000 / samplesize), 
                           sd = 2))

knitr::kable(head(design, n = 10))

samplesize	param1	param2	param3	method1	method2	method3
100	1	1	1	9.676300	22.984443	33.793024
200	1	1	1	9.437908	12.710978	17.636887
500	1	1	1	7.243334	6.225153	6.659465
100	2	1	1	24.994307	46.274059	62.983457
200	2	1	1	11.927719	22.866110	36.833710
500	2	1	1	8.463890	13.105232	14.311812
100	1	2	1	11.870238	22.974996	35.277654
200	1	2	1	9.137987	11.857579	18.394986
500	1	2	1	3.246069	5.631446	9.701592
100	2	2	1	26.086366	48.338618	65.184775

3 Layout of the nested loop plot

The key elements of the original nested loop plot as suggested in Rücker and Schwarzer (2014) are the display of the results of each individual method and simulation scenario, as well as the display of the design parameters by virtue of plotting step functions representing the values of the parameters. In the looplot package a lot of emphasis is given to the aspect of facetting, to further divide the plot into small panels and make the plot visually less cluttered. Furthermore, facetting also allows to easily zoom into specific design parameter combinations and only drawing panels for specific parameter combinations. Each panel then stays the same, so the results can still be easily compared accross different panel configurations. Furthermore this package uses the idea of “smallest plottable units” (spu), which are defined as all results for simulation scenarios for which only a single design parameter changes (drawn on the x-axis of the plot), while all other design parameters remain at a fixed value. By separating such spus from each other, meaningful subgroups of the results can be visually identified and represented.

When designing a layout for a nested loop plot with facets it is often helpful to think of the context of the simulation study and the intended meaning of the simulation parameters. Do some of them have a natural ordering? Are some of them encoding situations with largely different results? Which of the parameters should define the facet grid and which are merely displayed in each panel as step function?

In our artificial dataset we have the parameter samplesize which has a natural order and is well suited to define an x-axis along which results can be arranged.

The design is heavily influenced by param1, which only has two categories and therefore ideally defines either the columns or the rows of the facetted plot.

The role of the other parameters is less clear and their influence on the display of the results can be easily changed in the looplot functions to interactively get a good understanding of the study results.

4 Basic plot examples

The basic plot interface of the package is a single function with a lot of parameters. This might be very intimidating, so this vignette goes through a lot of them step by step.

The most important settings are to define the x-axis of the plot, the facet grid layout and adapt the basic parameters for plotting the parameter step functions to ensure a proper display. This is hard to automate, so manual tweaking is required to get a nice visualisation. Usually this can be achieved with the following parameters:

We define the layout with the x, grid_rows, grid_cols and steps arguments by passing column names of the dataframe as strings.
We set the height of the parameter step functions in units of the y-axis to our liking using steps_y_height. We also define the top-most value of the step functions via steps_y_base.

Further details of the plot which often need to be taken care of:

We give proper names to the x- and y-axis via x_name and y_name.
We widen the gap between connected areas of the x-axis in x-axis units via spu_x_shift (spu stand for “smallest plottable” unit as outlined above).
We enable the drawing of the design parameter values alongside the parameter step functions via steps_values_annotate and adjust the font size of the annotations appropriately for the desired figure size via steps_annotation_size.
We add a horizontal line to indicate a target value of the performance measure on the y-axis (i.e. zero deviation from ground truth) via hline_intercept.
We adjust the lower display limits of the y-axis via y_expand_add. Note that this argument does not affect the y-axis data limits but expands the axis. For more details on this functionality, see the documentation of the adjust_ylim function.
We rotate the x-axis labels as they are overlapping due to the high information density. This is achieved via post-processing of the basic plot object using functionality from the ggplot2 package by adding custom theme arguments (add_custom_theme).

Many of the plot object manipulations could also be achieved by directly manipulating the resulting plot object and are incorporated into the nested_loop_plot function mainly for convenience or users with little familiarity of the ggplot2 framework.

p = nested_loop_plot(resdf = design %>% rename(`beta[LY]` = param1), 
                     x = "samplesize", steps = "param3",
                     grid_rows = "param2", grid_cols = "beta[LY]", 
                     steps_y_base = -10, steps_y_height = 5, 
                     x_name = "Samplesize", y_name = "Error",
                     spu_x_shift = 75,
                     steps_values_annotate = TRUE, steps_annotation_size = 2.5, 
                     hline_intercept = 0, 
                     y_expand_add = c(10, NULL), 
                     post_processing = list(
                         add_custom_theme = list(
                             axis.text.x = element_text(angle = -90, 
                                                        vjust = 0.5, 
                                                        size = 8) 
                         )
                     ))
print(p)

Once presented with such a plot, several patterns clearly emerge from the picture:

The similarity accross all settings is visually striking, indicating that there is no “breakdown” scenario in which one or several of the methods behave very differently. Looking for such outlying results is a good way to spot errors in the simulation or really interesting special cases.
With increasing samplesize, the error decreases, but does not reach zero.
With increasing samplesize, the methods become more similar.
Design parameter param1 has a strong influence on the data - if it has value 2, then the error generally increases compared to scenario with value 1.
With increasing param3 there seems to be a slight trend towards higher error.

While this is only an artificial example, detecting such patterns is one of the goals of a well designed simulation study and could benefit from display with nested loop plots.

4.1 Changing the layout

The layout of the plots can be easily changed as shown in the following examples. Most of these require manual tweaking of the parameter step functions.

4.1.1 Only rows

p = nested_loop_plot(resdf = design, 
                     x = "samplesize", steps = c("param2", "param3"),
                     grid_rows = "param1", 
                     steps_y_base = -10, steps_y_height = 3, steps_y_shift = 3,
                     x_name = "Samplesize", y_name = "Error",
                     spu_x_shift = 75,
                     steps_values_annotate = TRUE, steps_annotation_size = 2.5, 
                     hline_intercept = 0, 
                     y_expand_add = c(10, NULL), 
                     post_processing = list(
                         add_custom_theme = list(
                             axis.text.x = element_text(angle = -90, 
                                                        vjust = 0.5, 
                                                        size = 8) 
                         )
                     ))
print(p)

4.1.2 Only columns

p = nested_loop_plot(resdf = design, 
                     x = "samplesize", steps = c("param2", "param3"),
                     grid_cols = "param1", 
                     steps_y_base = -5, steps_y_height = 1, steps_y_shift = 3,
                     x_name = "Samplesize", y_name = "Error",
                     spu_x_shift = 75,
                     steps_values_annotate = TRUE, steps_annotation_size = 2.5, 
                     hline_intercept = 0, 
                     y_expand_add = c(10, NULL), 
                     post_processing = list(
                         add_custom_theme = list(
                             axis.text.x = element_text(angle = -90, 
                                                        vjust = 0.5, 
                                                        size = 8) 
                         )
                     ))
print(p)

4.1.3 “Classic” nested loop plots

These do not use facetting but only draw the design parameters as step functions. Furthermore, in the original publication of Rücker and Schwarzer (2014), the authors use step functions to draw the results as well.

The advantage of not using facetting is that the steps are drawn only once, wasting less screen space to repeated information. A disadvantage is the lack of visual distinction between the blocks of the data, possibly making it harder to spot patterns.

p = nested_loop_plot(resdf = design, 
                     x = "samplesize", steps = c("param1", "param2", "param3"),
                     steps_y_base = -5, steps_y_height = 1, steps_y_shift = 3,
                     x_name = "Samplesize", y_name = "Error",
                     spu_x_shift = 200,
                     steps_values_annotate = TRUE, steps_annotation_size = 2.5, 
                     hline_intercept = 0, 
                     y_expand_add = c(10, NULL), 
                     post_processing = list(
                         add_custom_theme = list(
                             axis.text.x = element_text(angle = -90, 
                                                        vjust = 0.5, 
                                                        size = 5) 
                         )
                     ))
print(p)

4.1.4 Zooming

By subsetting the data, one can easily “zoom” into specific panels of the nested loop plot. We use dplyr::filter to subset the data in the following example.

design_subset = design %>% 
    filter(param1 == 1)
p = nested_loop_plot(resdf = design_subset, 
                     x = "samplesize", steps = c("param2", "param3"),
                     grid_rows = "param1", 
                     steps_y_base = -3, steps_y_height = 1, steps_y_shift = 1,
                     x_name = "Samplesize", y_name = "Error",
                     spu_x_shift = 75,
                     steps_values_annotate = TRUE, steps_annotation_size = 2.5, 
                     hline_intercept = 0, 
                     y_expand_add = c(5, NULL), 
                     post_processing = list(
                         add_custom_theme = list(
                             axis.text.x = element_text(angle = -90, 
                                                        vjust = 0.5, 
                                                        size = 8) 
                         )
                     ))
print(p)

5 Gallery and advanced functionality

Further options and advanced usage of this package are demonstrated in the vignette “Annotated gallery of examples”.

6 R session information

## R version 4.4.2 (2024-10-31 ucrt)
## Platform: x86_64-w64-mingw32/x64
## Running under: Windows 10 x64 (build 17763)
## 
## Matrix products: default
## 
## 
## locale:
## [1] LC_COLLATE=C                    LC_CTYPE=German_Austria.1252   
## [3] LC_MONETARY=German_Austria.1252 LC_NUMERIC=C                   
## [5] LC_TIME=German_Austria.1252    
## 
## time zone: Europe/Vienna
## tzcode source: internal
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] looplot_0.5.0.9003 ggplot2_4.0.1      purrr_1.0.2        magrittr_2.0.3    
## [5] dplyr_1.1.4       
## 
## loaded via a namespace (and not attached):
##  [1] gtable_0.3.6       jsonlite_1.8.8     highr_0.10         compiler_4.4.2    
##  [5] Rcpp_1.0.12        tidyselect_1.2.1   stringr_1.5.1      jquerylib_0.1.4   
##  [9] scales_1.4.0       yaml_2.3.9         fastmap_1.1.1      plyr_1.8.9        
## [13] R6_2.6.1           labeling_0.4.3     generics_0.1.3     knitr_1.45        
## [17] forcats_1.0.0      tibble_3.2.1       bslib_0.6.1        pillar_1.9.0      
## [21] RColorBrewer_1.1-3 rlang_1.1.6        utf8_1.2.4         stringi_1.8.3     
## [25] cachem_1.0.8       xfun_0.42          sass_0.4.9         S7_0.2.0          
## [29] viridisLite_0.4.2  cli_3.6.5          withr_3.0.2        digest_0.6.35     
## [33] grid_4.4.2         rstudioapi_0.15.0  lifecycle_1.0.4    vctrs_0.6.5       
## [37] evaluate_1.0.5     glue_1.8.0         farver_2.1.1       fansi_1.0.6       
## [41] reshape2_1.4.4     rmarkdown_2.30     tools_4.4.2        pkgconfig_2.0.3   
## [45] htmltools_0.5.8.1

References

Rücker, Gerta, and Guido Schwarzer. 2014. “Presenting Simulation Results in a Nested Loop Plot.” Journal Article. BMC Medical Research Methodology 14.

looplot: A package for creating nested loop plots

Michael Kammer

2025-12-02

1 Introduction

1.1 Simulation studies in statistical research

1.2 Nested loop plots

1.3 Workflow with the `looplot` package

2 Basic data example

3 Layout of the nested loop plot

4 Basic plot examples

4.1 Changing the layout

4.1.1 Only rows

4.1.2 Only columns

4.1.3 “Classic” nested loop plots

4.1.4 Zooming

5 Gallery and advanced functionality

6 R session information

References

looplot: A package for creating nested loop plots

Michael Kammer

2025-12-02

1 Introduction

1.1 Simulation studies in statistical research

1.2 Nested loop plots

1.3 Workflow with the looplot package

2 Basic data example

3 Layout of the nested loop plot

4 Basic plot examples

4.1 Changing the layout

4.1.1 Only rows

4.1.2 Only columns

4.1.3 “Classic” nested loop plots

4.1.4 Zooming

5 Gallery and advanced functionality

6 R session information

References

1.3 Workflow with the `looplot` package