--- title: "DecisionDrift: Auditing Temporal Drift in Repeated Decision Systems" author: "Subir Hait" date: "`r Sys.Date()`" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{DecisionDrift: Auditing Temporal Drift in Repeated Decision Systems} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r setup, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>", fig.width = 7, fig.height = 4 ) library(DecisionDrift) set.seed(42) ``` ## Overview `DecisionDrift` provides tools for detecting, decomposing, and stress-testing **temporal drift** in repeated binary decision systems. It is designed for settings in which an institutional or algorithmic system issues a binary decision — approve or deny, flag or pass, intervene or not — repeatedly over time for the same or overlapping set of units. The package does not estimate causal effects and does not require full specification of the underlying decision rule. Instead, it treats the decision system as a **time-indexed stochastic process** and asks whether that process changed over time — and if so, how, when, and for whom. --- ## Formal framework ### 1. The decision system as a stochastic process Let: - $i = 1, \ldots, N$: units (individuals, cases, applications) - $t = 1, \ldots, T$: time points (waves, periods, rounds) - $D_{it} \in \{0, 1\}$: observed binary decision at time $t$ for unit $i$ - $G_i$: group membership (optional) The decision system is defined as the stochastic process: $$S = \{ D_{it} : i = 1, \ldots, N;\ t = 1, \ldots, T \}$$ Critically, $S$ is treated as a **generator of decisions** over time, not as a parametric model. This makes the framework applicable to hybrid human--AI systems where the decision rule $f_t$ is only partially observed. ### 2. The human--AI decision mechanism In many applied settings, decisions are produced by a joint human--AI process: $$D_{it} = f_t(X_{it},\ H_{it})$$ where $X_{it}$ are inputs (features, signals) available to the system, $H_{it}$ represents human influence (override, discretion, intervention), and $f_t$ is the time-varying decision mechanism emerging from the interaction of algorithmic and human components. Drift in this system means $f_t \neq f_{t'}$, or equivalently: $$\Pr(D_{it} = 1 \mid X_{it}, H_{it}) \text{ changes over time}$$ `DecisionDrift` audits changes in $f_t$ **without requiring its full specification**. This is the core methodological contribution. ### 3. Definition of temporal drift **Marginal drift** (level shift): a change in the marginal distribution of decisions across waves. $$\text{Drift exists if } \exists\ t_1 \neq t_2 \text{ such that } P_{t_1} \neq P_{t_2}$$ where $P_t(d) = \Pr(D_{it} = d)$. **Transition drift** (dynamic instability): a change in the conditional distribution given the prior decision. $$\text{Transition drift exists if } P_t(d_t \mid d_{t-1}) \text{ varies with } t$$ The distinction matters. A system can have **stable prevalence but unstable dynamics** (transition drift without marginal drift), or **shifting prevalence with stable dynamics** (marginal drift without transition drift). `DecisionDrift` separates these two phenomena. --- ## The four drift indices ### Decision Drift Index (DDI) The DDI estimates the linear trend in wave-level decision prevalence: $$p_t = \frac{1}{N} \sum_{i=1}^{N} D_{it}$$ $$\text{DDI} = \frac{\hat{\beta}}{\text{SD}(p_t)}$$ where $\hat{\beta}$ is the OLS slope of $p_t$ on $t$. Standardising by $\text{SD}(p_t)$ makes the DDI scale-invariant and comparable across datasets. **Properties:** - $\text{DDI} > 0$: system became more permissive over time - $\text{DDI} < 0$: system became more restrictive over time - $\text{DDI} = 0$: no linear trend in decision rate - Independent of $N$; stable under random reordering of units - Does not capture nonlinearity or abrupt shifts — this motivates the change-point module ### Transition Drift Index (TDI) Define the time-indexed transition probabilities: $$\pi_t^{ab} = \Pr(D_{it} = b \mid D_{i,t-1} = a),\quad a, b \in \{0, 1\}$$ The TDI measures average absolute change in persistence and activation probabilities: $$\text{TDI} = \frac{1}{T-1} \sum_{t=2}^{T} \frac{|\pi_t^{11} - \pi_{t-1}^{11}| + |\pi_t^{01} - \pi_{t-1}^{01}|}{2}$$ **Properties:** - $\text{TDI} = 0$ under stationarity (constant transition probabilities) - Sensitive to policy or model updates that alter switching dynamics - Independent of marginal prevalence — captures dynamics, not just levels - Captures change in the **decision rule itself**, not just its outputs ### Group Differential Drift (GDD) Let $\text{DDI}^{(g)}$ denote the DDI estimated within group $g$. For groups $A$ and $B$: $$\text{GDD}_{AB} = \text{DDI}^{(A)} - \text{DDI}^{(B)}$$ GDD measures **inequality in system evolution**, not static disparity. A positive $\text{GDD}_{AB}$ indicates the system became more permissive for group $A$ faster than for group $B$. A value near zero indicates parallel drift regardless of any baseline disparity. ### Cumulative Drift Burden (CDB) $$\text{CDB} = \sum_{t=2}^{T} |p_t - p_{t-1}|$$ CDB captures total accumulated volatility in the decision rate, regardless of direction. Unlike the DDI, CDB is non-zero even when drift oscillates (e.g., the system alternates between permissive and restrictive periods). A high CDB with a near-zero DDI indicates instability without a directional trend. --- ## Modules The five core modules each estimate one aspect of drift: | Module | Stochastic object estimated | Index | |---|---|---| | `dd_prevalence()` | $\Pr(D_t = 1)$ over time | DDI | | `dd_transition()` | $\Pr(D_t \mid D_{t-1})$ over time | TDI | | `dd_entropy_trend()` | Complexity of path distribution | --- | | `dd_group_drift()` | Heterogeneity in $\Pr(D_t)$ across $G$ | GDD | | `dd_changepoint()` | Structural break in process | --- | Together they answer: **Did the system change?** The robustness and sensitivity modules then ask: **Is that conclusion credible?** --- ## Worked example ```{r data} # Simulate a system that became more permissive after wave 4. # Group A drifts faster than Group B. n_units <- 80 n_waves <- 8 probs_A <- c(rep(0.20, 4), rep(0.60, 4)) # sharp increase at wave 5 probs_B <- c(rep(0.25, 4), rep(0.40, 4)) # modest increase at wave 5 dat <- data.frame( id = rep(seq_len(n_units), each = n_waves), time = rep(seq_len(n_waves), times = n_units), decision = c( rbinom(40 * n_waves, 1, rep(probs_A, 40)), rbinom(40 * n_waves, 1, rep(probs_B, 40)) ), group = rep(c("A", "B"), each = 40 * n_waves) ) dp <- dd_build(dat, id, time, decision, group = group, event_time = 5L) print(dp) ``` ### Module 1 — Prevalence drift ```{r prevalence} prev <- dd_prevalence(dp) print(prev) plot(prev) ``` The DDI is positive and the slope is statistically significant, confirming that the overall decision rate increased over time. ### Module 2 — Transition drift ```{r transition} tr <- dd_transition(dp) print(tr) plot(tr) ``` The TDI captures the instability in switching and persistence dynamics that accompanies the structural break at wave 5. ### Module 3 — Entropy and stability trend ```{r entropy} et <- dd_entropy_trend(dp, window = 3L) print(et) plot(et) ``` ### Module 4 — Group-differential drift ```{r group} gd <- dd_group_drift(dp) print(gd) plot(gd) ``` Group A experienced faster drift than Group B (positive GDD). The gap trajectory shows divergence over time. ### Module 5 — Change-point detection ```{r changepoint} cp <- dd_changepoint(dp) print(cp) plot(cp) ``` Both the CUSUM and segmented regression methods identify a structural break near wave 5, consistent with the known event time. ### All four indices in one call ```{r indices} idx <- dd_indices(dp) print(idx) ``` ### Full audit ```{r audit} aud <- dd_audit(dp, include_robustness = TRUE, verbose = FALSE) print(aud) ``` --- ## Robustness and sensitivity A key claim of the `DecisionDrift` framework is that observed drift is not an artefact of analytic choices or data problems. The robustness and sensitivity modules test this claim directly. **Robustness** (`dd_robustness()`) tests whether the DDI sign and magnitude are stable across: - Balanced vs. unbalanced panels - Leave-one-period-out (LOPO) - Leave-one-group-out (LOGO) - Alternative minimum follow-up requirements A **fragility index** summarises the proportion of analytic variants in which the slope changes sign. Values near zero indicate robust conclusions. **Sensitivity** (`dd_sensitivity()`) probes vulnerability to plausible data problems: - Decision miscoding - Missing-wave attrition - Threshold shifts (when decisions derive from scores) - Subgroup composition shifts A **tipping-point estimate** identifies the smallest perturbation level at which more than 50 per cent of resampled conclusions flip sign. ```{r robustness} rob <- dd_robustness(dp, variants = c("lopo", "min_waves")) print(rob) plot(rob) ``` --- ## Conceptual positioning `DecisionDrift` is designed to occupy a specific and currently underserved position in the audit toolbox: > We define temporal drift as a change in the decision-generating process > over time and introduce a set of summary indices that quantify level > shifts, dynamic instability, subgroup divergence, and cumulative change. > By implicitly treating decision systems as evolving stochastic processes, > the framework enables longitudinal auditing of hybrid human--AI systems > without requiring full specification of the underlying decision rule. This positions the package as **semi-parametric auditing of dynamic systems** — distinct from full Markov modelling, time-series estimation, and cross-sectional fairness testing. --- ## Relationship to `decisionpaths` `decisionpaths` constructs and summarises longitudinal decision paths, answering *what happened*. `DecisionDrift` tests whether the system that generated those paths changed over time, answering *did the system change*. A third package in this ecosystem, `AIBias`, will ask whether inequality accumulated or amplified over repeated decisions. --- ## References Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. *Psychometrika*, 16(3), 297--334. Hait, S. (2025). Artificial intelligence as decision infrastructure: Rethinking institutional decision processes. Preprint. Shannon, C. E. (1948). A mathematical theory of communication. *Bell System Technical Journal*, 27(3), 379--423.