Feature/performance by danfunk · Pull Request #448 · sartography/SpiffWorkflow

danfunk · 2026-03-02T22:59:08Z

This adds a "copy-on-write" strategy that seems to improve performance and reduce the size data during serialization. This was done with claude doing most of the work and recommendations, and me just vetting code and asking questions, and keeping a very tight reign on changes - so that we are making as minimal a change to existing code as possible.

I am not expecting you to merge this, so much as have a hard look at it so we can talk about it tomorrow post scrum.

We still hit recursion limit on 1000 loops, so that particular problem isn't addressed yet.

Performance Tests Prior to Changes:

Loop Count (Items)	Execution Time (s)	Serialize Time (s)	Deserialize Time (s)	Total Time (s)
20	0.045	0.061	0.045	0.152
100	0.662	1.402	1.109	3.173
200	2.451	6.203	4.439	13.093
300	5.558	16.018	10.378	31.955

Performance Tests after Changes:

Loop Count (Items)	Execution Time (s)	Serialize Time (s)	Deserialize Time (s)	Total Time (s)
20	0.019654	0.021859	0.015446	0.056960
100	0.238948	0.435454	0.336278	1.010681
200	0.837055	1.894795	1.382747	4.114597
300	1.887173	4.150990	2.952501	8.990664

…eplementations.

…tasks.

essweine · 2026-03-03T19:45:54Z

I removed the custom class and replaced it with two one line static methods for comparing two dictionaries, which the serializer now uses to maintain partial serializations.

Claude's "optimization" basically just replaced the deepcopy with a shallow copy, which has caused problems in the past with shared data references between tasks, and was the reason we were using deepcopy anyway (it certainly wasn't my personal preference!). I'm amazed none of the existing tests failed. I replaced it with DeepMerge which is not quite as aggressive about copying things, but nobody should be surprised if weird data problems crop up in six months.

I changed do_engine_steps (another method I just love) to a non-recursive version and that fixes the recursion issue.

I ran the 300 item test with the following results:

    Execution:      0.780830 seconds
    Serialization:  0.882112 seconds
    Deserialization: 1.717875 seconds
    Total:          3.380817 seconds

So either my code is a legit improvement or I have a faster computer.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Instead of two passes (Python dict-building + json.dumps), serialize_json now uses a SpiffEncoder that lets the C-level JSON encoder handle plain data traversal natively, only calling back to Python for registered types. Periodic serialization test (300 items, 180 serializations): | Metric | Before | After | Improvement | |--------------------------------|----------|---------|---------------| | Total time | 162.5s | 45.0s | 3.6x faster | | Total serialization | 160.7s | 43.3s | 3.7x faster | | Avg per serialization | 0.893s | 0.241s | 3.7x faster | | Final checkpoint (308 tasks) | 2.07s | 0.56s | 3.7x faster | Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

refactor serialize_json to use custom JSONEncoder for ~3.7x speedup

danfunk and others added 8 commits March 2, 2026 16:24

initial testing and planning.

4e8751d

30% increase by not doing a deep copy on immutable values

e778fbf

Copy on write dictionary.

95a156f

Add readme files, these can be deleted, but provide details on the im…

9de90d7

…eplementations.

reduce size of serialized workflows by only storing changes in child …

26fe739

…tasks.

make engine steps not recursive

7ac8477

simpler serialization change detection

5a865ca

remove md files

0729a79

danfunk and others added 9 commits March 4, 2026 09:29

fixing an issue that came up when testing in arena.

62c29e7

remove claude generated plan, and update performance test.

60542a8

bump version to 3.1.3a1 for pre-release testing

6808441

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

filter out modules in clean.

5615dc3

bump version to 3.1.3a2

4394d10

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request #450 from sartography/feature/performance2

68ef142

refactor serialize_json to use custom JSONEncoder for ~3.7x speedup

remove unused imports

bf1d767

add ability serialize raw task data

46bdc47

essweine merged commit b8a9aaf into main Apr 17, 2026
5 checks passed

essweine deleted the feature/performance branch April 17, 2026 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/performance#448

Feature/performance#448
essweine merged 17 commits intomainfrom
feature/performance

danfunk commented Mar 2, 2026

Uh oh!

essweine commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danfunk commented Mar 2, 2026

Performance Tests Prior to Changes:

Performance Tests after Changes:

Uh oh!

essweine commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants