Bidirectional Pruning-Regrowth

This folder contains code and experiment artifacts for both one-shot magnitude pruning, iterative magnitude pruning. We implement our regrowth strategy applying to models (e.g., resnet20, vgg16, alexnet) and evaluated on CIFAR-10 dataset.

The two main regrowth implementations are:

Reference-based regrowth (rl_regrowth_nas.py): RL-based NAS allocates regrowth per-layer; within each layer weights are selected using SSIM-based layer priority + reference masks/weights.
Saliency-based regrowth (rl_saliency_regrowth.py): RL-based NAS allocates regrowth per-layer; within each layer weights are selected via: $s_i \propto \left(\frac{\partial \mathcal{L}}{\partial \theta_i}\right)^2,\theta_i^2$.

A timing benchmark comparing both approaches is provided in benchmark_regrowth_methods.py and quick_benchmark.py.

What “regrowth” means in this repo

You start from a highly sparse (pruned) network with masks (PyTorch pruning reparameterization):

Each prunable module has
- weight_orig (trainable dense weights)
- weight_mask (binary mask, 1=active, 0=pruned)
- effective weight is weight = weight_orig * weight_mask

Regrowth updates the mask by turning some pruned connections back on:

pick $K$ indices among currently pruned weights (weight_mask == 0)
set weight_mask[idx] = 1
initialize the newly regrown weights (copy from reference, zeros, Kaiming, etc.)
optionally mini-finetune / finetune to recover accuracy

Regrowth is usually done to move from 98% sparsity → 97% sparsity (i.e., regrow 1% of all weights).

Folder Overview

Training / pruning

main.py: CIFAR10 pretraining + iterative pruning loop (saves pruned checkpoints)
utils/model_loader.py: constructs models (resnet20, vgg16, alexnet, …)
utils/data_loader.py: CIFAR-10 dataloaders
utils/analysis_utils.py: pruning reparam helpers, SSIM feature extraction, mask stats

Regrowth

rl_regrowth_nas.py: RL allocation + reference-mask selection
rl_saliency_regrowth.py: RL allocation + saliency-based selection

Metrics / analysis

utils/analysis_utils.py:
- count_pruned_params(model) — counts total vs surviving parameters
- SSIM utilities: BlockwiseFeatureExtractor, compute_block_ssim
utils/saliency_analysis.py: FairPrune-style saliency computation + plots
single_layer_regrowth_analysis.py: runs “all budget on one layer” experiments (tests SSIM ↔ improvement correlation)

Benchmarking

benchmark_regrowth_methods.py: end-to-end timing benchmark (SSIM vs saliency preprocessing, selection, mask update, mini-finetune)
quick_benchmark.py: one-episode benchmark wrapper

Inspecting / selective finetuning

inspect_checkpoint.py: inspection + finetune flows for saved checkpoints

Setup

Environment

Plase check requirements.txt for details, and you can use pip install -r requirements.txt to install all required packages.

Data

utils/data_loader.py uses data_dir='./data' by default, which will download the required dataset if it is not provided.

Workflow overview

Most experiments follow this pipeline:

Pretrain a dense model (or resume from ./{model}/checkpoint/ckpt.pth)
Prune + finetune to a target sparsity (e.g., 0.99)
Regrow some weights (e.g., 2% of total, adjustable) to a less sparse target (0.97)
Finetune selectively or fully to recover accuracy

Artifacts are usually saved under:

./{model_name}/checkpoint/ckpt.pth (dense / baseline)
./{model_name}/ckpt_after_prune/pruned_finetuned_mask_{sparsity}.pth (sparse checkpoint)
./rl_regrow_savedir/... or ./rl_saliency_regrow_savedir/... (RL training outputs)

1) Pretraining + Pruning

Pretraining and pruning are combined in main.py.

Key args (from main.py):

--m_name: resnet20, vgg16, alexnet, ...
--pruner: pruning method (passed into weight_pruner_loader(args.pruner))
--iter_start, --iter_end: pruning iterations
--max_epochs, --patience: early stopping

Example (prune to 99% sparsity checkpoint expected by benchmarks):

python main.py --m_name resnet20 --pruner magnitude --iter_end 1

Prune rate note (ICLR2021 LAMP codepath)

If you run pruning via the ICLR2021 implementation under iclr2021_solution/ (used for LAMP-style layerwise sparsity), the prune rate is not controlled by main.py.

Instead, the per-iteration prune amount is returned by:

iclr2021_solution/tools/modelloaders.py → model_and_opt_loader(...) → amount

In iclr2021_solution/iterate.py, this value is loaded as:

_, amount_per_it, batch_size, opt_pre, opt_post = model_and_opt_loader(args.model, DEVICE)

and then applied each iteration as:

pruner(model, amount_per_it)

Illustration (what to edit):

# iclr2021_solution/tools/modelloaders.py
elif model_string == 'resnet20':
  model = ResNet20().to(DEVICE)
  amount = 0.985  # <-- adjust this prune rate

Practical guidance:

Larger amount means more weights pruned per iteration.
Total pruning depends on both amount and the number of prune iterations you run in iterate.py via --iter_end.

Example run:

python iclr2021_solution/iterate.py --model resnet20 --pruner lamp --iter_end 10

This should produce:

./resnet20/ckpt_after_prune/pruned_finetuned_mask_0.99.pth

2) Regrowth methods

A. Reference-based regrowth (SSIM + reference masks)

Implemented in rl_regrowth_nas.py.

Core ideas:

Layer priority is computed once using SSIM between feature maps from:
- pretrained model (./{m_name}/checkpoint/ckpt.pth)
- pruned model (typically pruned_finetuned_mask_0.99.pth)
Lower SSIM → more feature drift → higher priority for regrowth.
Allocation: an LSTM controller samples an allocation ratio per target layer.
Selection: within each layer, select weights to regrow using reference masks/weights (e.g., from 0.95 sparse checkpoint).
(Mini)Finetune to evaluate reward.

B. Saliency-based regrowth

Implemented in rl_saliency_regrowth.py.

Core ideas:

RL controller allocates the regrowth budget across layers.
A SaliencyComputer estimates per-weight importance using:

$$\text{saliency}(\theta_i) \approx \left(\frac{\partial L}{\partial \theta_i}\right)^2 \cdot \theta_i^2$$

This is a Fisher/Hessian-diagonal approximation plus magnitude scaling.

SaliencyBasedRegrowth.apply_regrowth(...) selects the top-K pruned weights by saliency and updates the mask.
Newly regrown weights can be initialized via --init_strategy.

3) Growth metrics (what to report)

This repo contains several “metrics” used to analyze regrowth decisions and results.

A. Sparsity and regrowth budget

utils/analysis_utils.py exposes count_pruned_params(model) which reports:

total parameters in prunable layers
surviving (unmasked) parameters
pruned parameters

Common derived metrics:

Sparsity: $s = \frac{\text{pruned}}{\text{total}}$
Regrowth budget (global): $$K = (s_{\text{start}} - s_{\text{target}})\cdot N_{\text{total}}$$

(In single_layer_regrowth_analysis.py this is described as “global budget”.)

B. Layer capacity

When regrowing, each layer has a capacity:

capacity(layer) = number of currently pruned weights = sum(weight_mask == 0)

This bounds how many weights you can regrow in that layer.

C. SSIM-based “growth metric” / priority

In the reference-based RL approach, each layer gets an SSIM score computed from features.

Feature extraction: BlockwiseFeatureExtractor (in utils/analysis_utils.py)
Similarity: compute_block_ssim(features_pretrained, features_pruned)

Interpretation:

Lower SSIM ⇒ larger feature drift from baseline ⇒ layer is more damaged by pruning ⇒ regrowing there is more likely to help.

single_layer_regrowth_analysis.py is designed to measure correlation between:

SSIM(layer)
accuracy improvement when regrowing only that layer

D. Saliency-based importance

Two code paths exist:

rl_saliency_regrowth.py uses SaliencyComputer (RigL-style accumulated gradients) with FairPrune formula.
utils/saliency_analysis.py provides a more general “per-class” saliency analyzer:
- supports second-order approximation (use_second_order=True)
- can compute per-class importance tensors and visualize distributions

Interpretation:

higher saliency ⇒ parameter is important for loss/accuracy
regrow highest-saliency weights among currently pruned positions (RigL-inspired)

4) Benchmark: timing comparison of regrowth methods

Use the benchmark to compare the major components:

preprocessing
- reference method: SSIM feature extraction + SSIM scores
- saliency method: gradient accumulation
selection
mask update
mini-finetuning

Quick (single episode)

python quick_benchmark.py --m_name resnet20

Full benchmark (multiple runs, plots)

python benchmark_regrowth_methods.py --m_name resnet20 --num_runs 3

Quickstart recipes

0. Produce the required sparse checkpoint(s)

Most regrowth and benchmark scripts assume the 99% sparse checkpoint exists:

python main.py --m_name resnet20 --pruner magnitude --iter_end 1

1. Run a single-episode timing comparison

python quick_benchmark.py --m_name resnet20

2. Test whether SSIM actually predicts “good layers to regrow”

single_layer_regrowth_analysis.py applies the full global regrowth budget to one layer at a time, then finetunes and measures the recovered accuracy.

python single_layer_regrowth_analysis.py --m_name resnet20

3. Compute and visualize FairPrune-style saliency (analysis-only)

Use utils/saliency_analysis.py if you want diagnostic plots (per-layer / per-class distributions). This is separate from the RL saliency regrowth code.

Experiment results (VGG16, CIFAR-10)

PART1: Regrowth

The table below summarizes the VGG16 results from your tracking sheet.

Setting A: Iterative magnitude pruning (10 iterations)

Sparsity (%)	Test acc (mean±std)
40.00	92.09 ± 0.28
64.00	92.43 ± 0.12
78.40	92.60 ± 0.15
87.04	92.68 ± 0.18
92.22	92.68 ± 0.19
95.33	92.49 ± 0.26
97.20	92.35 ± 0.02
98.32	91.92 ± 0.09
98.99	91.01 ± 0.05
99.40	90.31 ± 0.18

Setting B: One-shot magnitude pruning

Sparsity (%)	Test acc (mean±std)
95.0	92.43 ± 0.10
96.0	92.41 ± 0.22
97.0	92.27 ± 0.06
98.0	91.93 ± 0.17
99.0	91.03 ± 0.17
99.5	89.53 ± 0.10
99.9	81.08 ± 0.85

Regrowth Outcome (start from 99% sparsity)

Baseline	Step	Result	Improvement
91.93	step1	92.33	+0.40
92.27	step2	92.41	+0.14
92.41	step3	92.57	+0.16
92.43	step4	92.73	+0.30

PART2: Growth Metric Evaluation

Below is a single-episode timing breakdown (VGG16 @ 99% sparsity, regrow $K\approx 147{,}155$ weights, CUDA).

Weight selection time comparison

Method	Weight selection total (s)	Notes
Reference-based (SSIM + reference masks/weights)	0.0178	Includes loading reference masks/weights (0.0012s) and selecting 147,148 weights (0.0166s)
Saliency-based (gradient saliency)	0.0041	Selects 147,148 weights by direct saliency ranking

In this run, saliency-based selection is $\approx 4.38\times$ faster (absolute savings: 0.0137s).

Citation

If you use this codebase, results, or figures in your work, please refer:

@article{liu2025beyond,
  title={Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff},
  author={Liu, Junchen and Sheng, Yi},
  journal={arXiv preprint arXiv:2511.11675},
  year={2025}
}

We also build on the public magnitude-pruning implementation from LAMP (ICLR 2021):

@article{lee2020layer,
  title={Layer-adaptive sparsity for the magnitude-based pruning},
  author={Lee, Jaeho and Park, Sejun and Mo, Sangwoo and Ahn, Sungsoo and Shin, Jinwoo},
  journal={arXiv preprint arXiv:2010.07611},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.idea		.idea
iclr2021_solution		iclr2021_solution
models		models
result_draw		result_draw
utils		utils
.gitignore		.gitignore
README.md		README.md
apply_allocation.py		apply_allocation.py
apply_allocation_imagenet.py		apply_allocation_imagenet.py
baseline_greedy_random.py		baseline_greedy_random.py
baseline_greedy_saliency.py		baseline_greedy_saliency.py
baseline_proportional_random.py		baseline_proportional_random.py
baseline_proportional_saliency.py		baseline_proportional_saliency.py
batch_sparsity_accuracy.py		batch_sparsity_accuracy.py
batch_test_folder.py		batch_test_folder.py
benchmark_regrowth_methods.py		benchmark_regrowth_methods.py
data_distribution_inference.py		data_distribution_inference.py
feature_compare_.py		feature_compare_.py
feature_compare_3.py		feature_compare_3.py
feature_compare__.py		feature_compare__.py
feature_compare_diff.py		feature_compare_diff.py
feature_sigle_layer_prune.py		feature_sigle_layer_prune.py
feature_single_layer_pretrain.py		feature_single_layer_pretrain.py
feature_statistics.py		feature_statistics.py
final_full_finetue.py		final_full_finetue.py
full_finetune_imagenet.py		full_finetune_imagenet.py
full_finetune_structure.py		full_finetune_structure.py
heat_map.py		heat_map.py
inspect_checkpoint.py		inspect_checkpoint.py
inspect_checkpoint.sh		inspect_checkpoint.sh
inspect_checkpoint_flexible_init.py		inspect_checkpoint_flexible_init.py
inspect_ckpt_saliency.sh		inspect_ckpt_saliency.sh
live_channel.py		live_channel.py
main.py		main.py
parameter.py		parameter.py
parameter_anaysis.py		parameter_anaysis.py
pre_train.py		pre_train.py
pretrain_tiny_imageNet.py		pretrain_tiny_imageNet.py
prune_iterative.py		prune_iterative.py
prune_iterative_imagenet.py		prune_iterative_imagenet.py
prune_oneshot.py		prune_oneshot.py
prune_oneshot_imagenet.py		prune_oneshot_imagenet.py
prune_structure_iterative.py		prune_structure_iterative.py
prune_structure_oneshot.py		prune_structure_oneshot.py
quick_benchmark.py		quick_benchmark.py
regrow_rl.sh		regrow_rl.sh
regrow_rl_saliency.sh		regrow_rl_saliency.sh
regrowth_onetshot_structure.py		regrowth_onetshot_structure.py
requirements.txt		requirements.txt
rl_L1_regrowth_structure.py		rl_L1_regrowth_structure.py
rl_regrowth_nas.py		rl_regrowth_nas.py
rl_regrowth_structure_oneshot.py		rl_regrowth_structure_oneshot.py
rl_saliency_regrowth.py		rl_saliency_regrowth.py
rl_saliency_regrowth_iterative_imagent.py		rl_saliency_regrowth_iterative_imagent.py
rl_saliency_regrowth_iterative_new.py		rl_saliency_regrowth_iterative_new.py
rl_saliency_regrowth_iterative_uninstall .py		rl_saliency_regrowth_iterative_uninstall .py
rl_saliency_regrowth_oneshot.py		rl_saliency_regrowth_oneshot.py
rl_saliency_regrowth_oneshot_imagenet.py		rl_saliency_regrowth_oneshot_imagenet.py
run_dual_lottery_experiments.sh		run_dual_lottery_experiments.sh
single_layer_regrowth_analysis.py		single_layer_regrowth_analysis.py
single_layer_ssim_imagenet.py		single_layer_ssim_imagenet.py
struceture_final_model_test.py		struceture_final_model_test.py
structure_diff.py		structure_diff.py
test_final_model.py		test_final_model.py
test_pretrain.py		test_pretrain.py
test_structure_.py		test_structure_.py
top_k_final_model.py		top_k_final_model.py
topk-acc.py		topk-acc.py
train.sh		train.sh
visual_mask_model.py		visual_mask_model.py

Folders and files

Latest commit

History

Repository files navigation

Bidirectional Pruning-Regrowth

What “regrowth” means in this repo

Folder Overview

Training / pruning

Regrowth

Metrics / analysis

Benchmarking

Inspecting / selective finetuning

Setup

Environment

Data

Workflow overview

1) Pretraining + Pruning

Prune rate note (ICLR2021 LAMP codepath)

2) Regrowth methods

A. Reference-based regrowth (SSIM + reference masks)

B. Saliency-based regrowth

3) Growth metrics (what to report)

A. Sparsity and regrowth budget

B. Layer capacity

C. SSIM-based “growth metric” / priority

D. Saliency-based importance

4) Benchmark: timing comparison of regrowth methods

Quick (single episode)

Full benchmark (multiple runs, plots)

Quickstart recipes

0. Produce the required sparse checkpoint(s)

1. Run a single-episode timing comparison

2. Test whether SSIM actually predicts “good layers to regrow”

3. Compute and visualize FairPrune-style saliency (analysis-only)

Experiment results (VGG16, CIFAR-10)

PART1: Regrowth

Regrowth Outcome (start from 99% sparsity)

PART2: Growth Metric Evaluation

Weight selection time comparison

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages