Data Analysis

Walk through a five-step workflow: import data, view descriptive statistics, check diagnostic plots, fit a statistical model, and run post-hoc comparisons.

1

Import Data

When the page loads, you see design selectors and a file import zone.

Empty analysis import state

Initial state with design selectors and import area.

Design and Type Selectors

FieldDescriptionDefault
Trial DesignCrossover, Parallel, Basket, N-of-1, UmbrellaCrossover
Data TypeCross-sectional, Longitudinal (crossover/parallel only)
Outcome TypeContinuous, BinaryContinuous

Loading Data

You can import a CSV via drag-and-drop, or choose from built-in example datasets.

Example dataset selection

Built-in example datasets including COPD crossover, arterial studies, and more.

After Import

Once data is loaded, a green badge confirms success with row/column counts and a data preview table.

Data loaded successfully

Green badge confirms successful import with data preview.

Column Mapping

Map your data columns to the required roles: Response (Outcome), Treatment, Subject ID, and Period. Crossover data may also map Carryover; umbrella data maps Biomarker.

Column mapping dropdowns

Map your CSV columns to the required roles.

2

Descriptive Statistics

Automatically computed when you enter this step. Includes participant flow, baseline characteristics, and outcome summary tables.

Descriptive statistics tables

Participant flow, baseline characteristics, and outcome summary.

3

Diagnostic Plots

Select which plot types to generate: cross-sectional, longitudinal, profiles, or period. Plots are rendered as images from the R backend.

Diagnostic plots

Visual diagnostics for model assumptions.

Reading residual plots

Look for points scattered randomly (good) in residual plots. Patterns, funnels, or clusters may indicate model assumption violations.
4

Fit Model

Configuration

FieldDescriptionDefault
Model TypeGLMM (recommended), GEEGLMM
CarryoverYes / No (crossover designs)No
Model configuration form

Select model type and options before fitting.

Results

After clicking Run Model, GLMM and GEE fits show coefficients and fit statistics. Basket models show posterior summaries; substudy-specific umbrella models show per-substudy drug-vs-control contrasts.

Model results

Coefficients table and model fit statistics.

5

Post-hoc Contrasts

Configuration

FieldDescriptionDefault
FactorModel terms dropdown
Contrast TypePairwise, Treatment vs Control, Custom ContrastsPairwise
P-Value AdjustmentNone, Bonferroni, Holm, TukeyNone
SideTwo-sided, Greater, LessTwo-sided
Post-hoc configuration

Configure contrast type and p-value adjustment.

Results

After clicking Run Post-hoc: estimated marginal means, contrasts table with p-values and confidence intervals, and EMM/contrast plots from R.

Post-hoc results

Estimated marginal means, contrasts, and plots.

P-value adjustment

When making multiple comparisons, use Bonferroni (simple), Holm (slightly more powerful), or Tukey (designed for pairwise). Use “None” only for a single pre-specified comparison.

Concepts

Data Preparation

Import a CSV file with one row per observation. The platform expects columns for the response variable, treatment group, subject ID, and (for crossover/longitudinal designs) a period column. Built-in example datasets let you explore the workflow before importing your own data.

Statistical Models

Toggle between GLMM and GEE to see subject-specific regression lines vs a single population-average line.

GLMM
Generalized Linear Mixed Model - accounts for both fixed effects (treatment, period) and random effects (patient-level variation). Recommended for most trial designs.
GEE
Generalized Estimating Equations - models population-average effects. More robust to misspecification of the correlation structure, but does not provide subject-level predictions.

Diagnostics

Select a pattern to see what good and problematic residuals look like in both plot types.

Diagnostic plots help you check whether model assumptions hold. Residual vs. fitted plots should show random scatter (no funnels or curves). QQ plots should follow the diagonal line. Violations may indicate the need for a different model or data transformation.

Post-hoc Comparisons

Adjust group means and see which comparisons reach significance under different correction methods.

Pairwise
Compares all pairs of treatment groups. Best when you have no pre-specified reference group.
Control
Compares each treatment against a selected reference control. This is more focused than all pairwise comparisons.

P-value adjustment (Bonferroni, Holm, Tukey) controls the family-wise error rate when making multiple comparisons.

Basket Models

Compare independent, BHM, and EXNEX borrowing behavior. EXNEX is available for binary outcomes only.

BHM
Bayesian Hierarchical Model - borrows strength across baskets, improving power when effects are similar.
IND
Independent model - fits each basket without borrowing information across baskets.
EXNEX
Exchangeability/non-exchangeability model - allows each basket to be either exchangeable or independent. It is currently available for binary outcomes only.

Bayesian Basics

Widen the prior SD to see the posterior follow the data; narrow it to see the prior dominate.

Bayesian methods combine prior beliefs with observed data to produce posterior distributions. In basket trial analyses, borrowing information across baskets can reduce standard errors when treatment effects are similar. Prior distributions are set automatically based on the selected borrowing method.