A few tweaks from Paul's work on his real dataset by remlapmot · Pull Request #160 · CausalInference/SEQTaRget

remlapmot · 2026-05-29T10:45:33Z

This,

Speed up the hazard ratio calculation by fitting the Cox model with the survival C fitters directly on a prebuilt design matrix instead of coxph(formula, data), avoiding the model.frame/model.matrix rebuild on every bootstrap iteration: survival::coxph.fit() for the non-competing-event model and survival::agreg.fit() for the competing-event Fine-Gray (counting-process) model. The hazard ratio and CIs are unchanged.
Fix the competing-event Fine-Gray hazard fit to use the finegray() case weights (fgwt), which are required for a valid subdistribution-hazard estimate and were previously omitted. This is a no-op for the current hazard simulation (which has only administrative censoring, so all fgwt are 1) but corrects the estimate should the simulated data ever carry random censoring.
Report competing events per treatment arm in the @info slot as info$compevent.unique and info$compevent.nonunique, mirroring the structure of info$outcome.unique / info$outcome.nonunique. Both are grouped by baseline treatment; the non-unique table counts all competing event occurrences in the expanded data and the unique table counts distinct subjects who experienced the competing event. Both are NA when no compevent is specified.

Fit the non-competing-event univariate Cox model in the hazard step with survival::coxph.fit() on a prebuilt one-column design matrix instead of coxph(Surv(followup, event == 1) ~ get(tx_bas), data). The formula interface rebuilds the model.frame and model.matrix on every call, and handler() is run once for the full fit plus once per bootstrap replicate, so that overhead was paid on every iteration. On a 200k-row univariate fit with the heavy ties that integer follow-up produces, the formula path spends about 86% of its time in model.frame construction rather than the C fit, and the direct call is roughly 7x faster. The competing-event finegray path is unchanged. Results are identical: an ITT bootstrap hazard run (seed 42) agrees with the previous formula-based fit to ~1e-13 on the hazard ratio and CIs. coxph.fit uses method = "efron" and the default coxph.control() to match coxph()'s defaults. Add survival::coxph.fit and survival::coxph.control to the imports (roxygen @importFrom and NAMESPACE), a NEWS.md entry, and a test asserting coxph.fit and coxph(formula) give the same coefficient on tied data so a future survival change cannot silently break the equivalence.

Fit the competing-event Fine-Gray model with survival::agreg.fit() on a prebuilt design matrix instead of coxph(Surv(fgstart, fgstop, fgstatus) ~ get(tx_bas), data = hr.data). The counting-process formula fit rebuilds the model.frame and model.matrix on every call, and handler()

Pass the finegray() case weights (fgwt) to the competing-event subdistribution Cox fit. These inverse-probability-of-censoring weights are required for a valid Fine-Gray subdistribution-hazard estimate and had been omitted since the hazard function was first written (fgwt never appears anywhere in the git history). This is a no-op for the current hazard ratio: the model is fit on simulated data in which every subject-trial is followed across the full 0..followup.max grid until its first event, so the only censoring is administrative at a single time and all fgwt are exactly 1. Verified bit-for-bit in the pipeline - the competing-event bootstrap hazard (seed 123) is identical with weights = fgwt and with weights = NULL. The fix matters only if the simulated hazard data ever carries genuine random censoring (e.g. if IPCW/LTFU were folded into the simulation, or per-trial follow-up grids became non-uniform), in which case the unweighted estimate would be biased. Update the competing-event equivalence test to the weighted form (agreg.fit with weights = fgwt against coxph(formula, weights = fgwt)) so it validates the corrected call on data that does have random censoring, and add a NEWS.md entry.

Replace the simple table() counts with compevent.table(), a new helper mirroring outcome.table(), so competing event counts in @info are grouped by tx_init_bas rather than reported as a single total.

ryan-odea

Looks great!

remlapmot added 7 commits May 28, 2026 13:32

Break down @info$compevent.unique/nonunique by treatment arm

6b8e8bd

Replace the simple table() counts with compevent.table(), a new helper mirroring outcome.table(), so competing event counts in @info are grouped by tx_init_bas rather than reported as a single total.

Add compevent.unique and compevent.nonunique test

91d1fcb

Update NEWS.md

f89d79c

Bump version

b6f0231

remlapmot requested a review from ryan-odea May 29, 2026 10:45

ryan-odea approved these changes Jun 1, 2026

View reviewed changes

remlapmot merged commit 7a6e437 into main Jun 1, 2026
7 checks passed

remlapmot deleted the devel-2026-05-22-2 branch June 1, 2026 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A few tweaks from Paul's work on his real dataset#160

A few tweaks from Paul's work on his real dataset#160
remlapmot merged 7 commits into
mainfrom
devel-2026-05-22-2

remlapmot commented May 29, 2026

Uh oh!

ryan-odea left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

remlapmot commented May 29, 2026

Uh oh!

ryan-odea left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants