DescrTab2 is a powerful package with vast customization
options. With this, unfortunately, comes code that has to deal with
quite a bit of special cases an exceptions. This document aims to
describe the flow of control of the DescrTab2 package, so
that future generations may continue development and successfully fix
potential bugs.
The user interfaces mostly with the descr function.
descr does all the calculations, i.e. the evaluation of the
summary statistics on the data (mean, sd, median, etc. for continuous
variables and counts for categorical variables) and the calculation of
statistical tests. For this descr calls the
descr_cat and descr_cont functions, which
evaluate the list of summary statistics on the data.
descr_cat then calls test_cat and
descr_cont calls test_cont, which calculate
appropriate statistical tests. A detailed description for the choice of
test can be read in the “Test choice” vignette.
descr returns a DescrList object, which is
basically a named list containing all calculation results and the
formatting options.
To turn a DescrList object into pretty output, the
object has to be passed to the print function. print is a
generic function. This means that if a DescrList object is
passed to print, the specialized
print.DescrList function will be invoked automatically.
Since the proper output format is highly document type dependent,
print.DescrList creates output in two steps. The first step
is independent of the output format: The creation of a
DescrPrintObj by calling the create_printObj
function.
In this function, proper formatting is applied to the results in the
DescrList and the formatted values are saved inside a
tibble. Formatting in this case means converting numbers to characters,
reducing the number of decimal digits, combining variables like “Q1” and
“Q3” into “Q1 - Q3”, formatting small p values to display as “<0.001”
and adding “%” values to categorical variables.
Somewhat of an exception is the case
printFormat="numeric". Here, numbers are not converted
characters and consequently very little formatting can be applied.
The formatting in create_printObj is done by iterating
over all variables in the DescrList object and creating an
appropriate sub-table by calling one of
create_numeric_subtable.cat_summary,
create_numeric_subtable.cont_summary,
create_character_subtable.cat_summary or
create_character_subtable.cont_summary. Whether
create_numeric_subtable or
create_character_subtable is called is determined by the
printFormat option (all options lead to
create_character_subtable except
printFormat="numeric). Whether .cat_summary or
.cont_summary is called depends on the type of variable.
The sub-tables are then concatenated to a master table.
The DescrPintObj is the transformed into appropriate
output format by calling one of print_tex,
print_html, print_word,
print_console or print_numeric.
print_console basically prints the tibble that is
produces by create_printObj using a slightly modified
version of the default method for printing tibbles.
print_numeric basically prints the tibble produces by
create_printObj if printFormat="numeric" was
specified.
print_tex and print_html use kableExtra to
convert the tibble from create_printObj into raw tex or
html output. Some special formatting has to be applied to these outputs
to accomodate for superscripts and to escape special LaTeX
characters.
print_word produces a flextable object from the tibble
returned by create_printObj. flextables play relatively
nicely with word.