While the run_log()
, summary_log()
, and config_log()
are helpful for summarizing the model development process, model_tree()
provides a convenient way to visualize and track any of these tabulated parameters. You can create a tree diagram using a modeling directory or run log.
MODEL_DIR <- "../nonmem"
By default, models will be colored by their run number, and basic summary statistics will display as a tooltip when hovered over.
model_tree(MODEL_DIR)
If coloring by a logical column, FALSE
and TRUE
values will correspond to white and red coloring respectively. Numeric or character columns will be colored as a gradient. NA
values will appear grey regardless of the column type.
model_tree(MODEL_DIR, color_by = "star")
Specific columns can be added to the tooltip via the include_info
argument. Though you are limited to the default run_log()
columns when passing a model directory, you can pass any available columns when passing a run log dataframe (must inherit the class bbi_log_df
). The examples below to illustrate cases where you may want to do that.
log_df <- run_log(MODEL_DIR)
log_df
#> # A tibble: 9 × 10
#> absolute_model_path run yaml_md5 model_type description bbi_args
#> <chr> <chr> <chr> <chr> <chr> <list>
#> 1 /tmp/RtmpN3mqYr/temp_libpa… 1 6ccf206… nonmem original a… <named list>
#> 2 /tmp/RtmpN3mqYr/temp_libpa… 2 b5f8010… nonmem NA <named list>
#> 3 /tmp/RtmpN3mqYr/temp_libpa… 3 99d902b… nonmem NA <named list>
#> 4 /tmp/RtmpN3mqYr/temp_libpa… 4 5161777… nonmem NA <named list>
#> 5 /tmp/RtmpN3mqYr/temp_libpa… 5 690952d… nonmem NA <named list>
#> 6 /tmp/RtmpN3mqYr/temp_libpa… 6-bo… 9d66680… nmboot NA <named list>
#> 7 /tmp/RtmpN3mqYr/temp_libpa… 6 324dd95… nonmem final model <named list>
#> 8 /tmp/RtmpN3mqYr/temp_libpa… 7 661f9f1… nonmem NA <named list>
#> 9 /tmp/RtmpN3mqYr/temp_libpa… 8 994cb9f… nonmem NA <named list>
#> # ℹ 4 more variables: based_on <list>, tags <list>, notes <list>, star <lgl>
In this example we define a new column, out_of_date
, to denote whether the model or data has changed since the last run. We can color by this new column to determine if any of the models need to be re-run:
log_df %>% add_config() %>%
dplyr::mutate(out_of_date = model_has_changed | data_has_changed) %>%
model_tree(
include_info = c("model_has_changed", "data_has_changed"),
color_by = "out_of_date"
)
The model tree can also be helpful for quickly determine if any heuristics were found during any model submissions, as well as displaying specific model summary output in the tooltip.
log_df %>% add_summary() %>%
model_tree(
include_info = c("tags", "param_count", "eta_pval_significant"),
color_by = "any_heuristics"
)
Controlling the node size can be helpful for quickly determining the trend of a particular numeric column. Here, we use color_by
and size_by
to show the objective function value decreasing with each new model.
color_by
argument, only columns included in log_df
can be passed. We have to call add_summary()
to use "ofv"
even though this is included in the tooltip when add_summary = TRUE
.size_by
is not specified, the nodes are sized based on how many other models/nodes stem from it (i.e. the “final model” will be smaller than the “base model”).
log_df %>% add_summary() %>%
model_tree(color_by = "ofv", size_by = "ofv")