v2 baseline — serialization performance (3-way)

Captured 2026-04-23 from branch spike/source-generator. Host: Windows 11, .NET 9.0.15 RyuJIT AVX2, .NET SDK 10.0.201, BenchmarkDotNet 0.14.0. Source: dotnet/benchmarks/SerializationPerformance (SerializationComparisonBenchmarks.cs).

Full numbers + CSV: SerializationComparison.md · SerializationComparison.csv.

Numbers reflect the post-optimization generator. After the initial 3-way capture, three incremental generator improvements landed (LINQ→for-loops, hoisted flag setup for list/dict iteration, elided HashSet allocation for cycle-safe type graphs). See opt-iterations/README.md for the step-by-step breakdown and raw logs. The numbers below are the final (post-#3) state.

Note on runtime drift. The earlier 2-way baseline (not preserved — superseded by this run) recorded host .NET 10.0.7. This 3-way run landed on .NET 9.0.15 because the benchmark project targets net8.0 and the local dotnet host rolled forward to the next available shared runtime. Ratios are internally consistent within this run; absolute numbers drift ~15–25% slower than the earlier .NET 10 numbers across every engine. Re-running under a pinned .NET 10 host is a future cleanup if we want to compare ratios across time.

What’s measured

One benchmark class, four data shapes (SimpleModel, SimpleModelList[100], ComplexNestedModel, ComplexNestedModelList[25]). For each shape, eight method variants:

Stj_Reflection — raw System.Text.Json, reflection-based resolver. The baseline everything else ratios against.
Stj_SourceGen — raw System.Text.Json, TypeInfoResolver = BenchmarkJsonContext.Default (STJ’s own source generator). What an AOT-targeting app would use without Popcorn.
PopcornDefault — Popcorn source-generated converters, ?include= empty. Emits only [Always] / [Default] members.
PopcornAll — Popcorn source-gen, ?include=[!all]. Emits everything (matches STJ output shape).
PopcornCustom — Popcorn source-gen, hand-built include list.
LegacyDefault — legacy PopcornNetStandard (runtime-reflection expander), empty includes. PopcornFactory.Expand(...) → Dictionary<string, object?> → reflection-STJ. Per-request CreatePopcorn() allocation included.
LegacyAll — legacy with an explicit full enumeration of non-Never properties (!all in the legacy engine has a latent ArgumentException collision with AlwaysInclude, so we pass the explicit list instead).
LegacyCustom — legacy with the same hand-built include list as PopcornCustom.

Both DefaultJob (out-of-process, more trustworthy) and Job-HLUJQF (InProcessEmitToolchain, configured in Program.cs) are reported by BDN; numbers below quote DefaultJob.

Headline results (DefaultJob, mean time and allocation ratio vs Stj_Reflection)

Scenario	Stj_SrcGen	Popcorn_Default	Popcorn_All	Popcorn_Custom	Legacy_Default	Legacy_All	Legacy_Custom
SimpleModel	0.95× / 1.00×	1.24× / 1.41×	1.77× / 2.03×	1.70× / 1.54×	5.12× / 5.81×	8.60× / 9.59×	4.44× / 5.73×
SimpleModelList (100)	0.96× / 1.00×	0.81× / 1.03×	1.40× / 1.67×	1.26× / 1.13×	4.50× / 5.34×	8.08× / 9.26×	4.29× / 5.25×
ComplexModel	1.02× / 1.00×	0.15× / 0.17×	1.21× / 1.16×	1.39× / 1.11×	0.49× / 0.55×	4.73× / 4.67×	2.79× / 2.96×
ComplexModelList (25)	1.02× / 1.00×	0.10× / 0.10×	0.87× / 0.93×	0.99× / 0.82×	0.57× / 0.67×	3.61× / 4.41×	2.20× / 2.59×

(Time ratio / allocation ratio. Values < 1 mean faster or less allocation than reflection-STJ.)

Findings

Popcorn source-gen is dramatically faster than the legacy reflection engine, everywhere. For the All case (fair fight — both engines emit the same fields), source-gen is 3–8× faster than legacy. For Default (engine’s own include-filtering win), source-gen on ComplexModelList is 4.7× faster than legacy-default (3.9µs vs 18.3µs) with 5× less allocation. This is the headline v2 migration claim validated end-to-end: moving from runtime reflection to the Roslyn generator buys real throughput.
Legacy’s reflection engine pays a heavy allocation tax. Even best-case LegacyDefault on SimpleModelList[100] allocates 152KB vs STJ’s 28KB — a 5× bloat from building the intermediate Dictionary<string, object?> projection before JSON-serialization. Source-gen skips that step entirely.
Popcorn’s selectivity thesis holds, and holds harder against legacy. ComplexModelList_PopcornDefault is 3,150 ns / 5,456 B: ~10× faster than STJ and ~5.8× faster than legacy-default for the same “mobile client asks for a subset” workload. The “pay less when you ask for less” claim is consistent across both comparison axes.
STJ source-gen remains a wash vs STJ reflection at these model sizes (0.98×–1.02× time, identical allocation). Metadata source-gen is about startup + AOT compatibility, not hot-path throughput. Popcorn’s gains do not come from the STJ source-gen it sits on top of.
Faster than STJ when emitting everything on nested data. Popcorn_All on ComplexModelList is 0.87× time / 0.93× alloc vs reflection-STJ — Popcorn is now faster than STJ even in the “pay full tax, emit everything” scenario for complex nested data. Legacy cannot say the same: LegacyAll on the same shape is 3.6× slower and 4.4× more alloc. (The pre-optimization run was at 0.97×/1.03× — parity. The three in-generator optimizations tipped it past.)
Flat simple data is where Popcorn pays a tax, but still beats legacy. SimpleModelList_PopcornAll is 1.40×/1.67× vs STJ — the per-property include-reference check overhead on flat scalars. But SimpleModelList_LegacyAll is 8.1×/9.3×, so even the “worst case for Popcorn” is ~5.8× faster than legacy for the identical workload. The v2 migration has no scenario in which legacy is faster than v2. (Pre-optimization was 1.80× — the hoisted-flags change closed half the gap.)
Single-object scalar overhead is the one real shrink. SimpleModel_PopcornAll is 1.77× time (292 ns vs 165 ns) and 2.03× alloc (600 B vs 296 B). Absolute terms: ~130 ns / ~300 B envelope tax per request. Unlikely to matter outside extreme throughput paths; even so, still ~5× faster than legacy’s 1,429 ns. (Pre-optimization: 2.03×/2.62× — elided HashSet knocked ~60 ns and ~180 B off.)

Legacy configuration caveats

The legacy PopcornNetStandard engine is configured in the benchmark to mirror the current models’ [Always] / [Default] / [Never] attribute semantics via PopcornFactory.ConfigureType<T>(...). Two concessions were made to keep the legacy engine runnable with the current model set:

AlwaysInclude is left empty and [Always] fields are moved into DefaultInclude instead. Legacy’s DeterminePropertyReferences adds AlwaysInclude entries without a ContainsKey guard; any include list (or !all enumeration) that already names that field throws ArgumentException. Putting the Always fields into DefaultInclude produces the same output for our three test include shapes (empty / all / custom) without tripping the collision.
!all is replaced in the LegacyAll benchmark with an explicit enumeration of every non-Never property. Same reason — !all’s internal enumeration path trips the same collision when combined with any AlwaysInclude fields. Behaviorally identical for the benchmark models; avoids the latent bug.

Both are workarounds for real defects in the legacy engine that production callers would have either dodged with careful config or accepted as “don’t combine these features.” The benchmark numbers above reflect the engine running under configuration it supports.

Gaps — what this baseline does NOT cover

Other benchmark classes. Only SerializationComparisonBenchmarks was run. The IncludeStrategyBenchmarks, ScalabilityBenchmarks, CircularReferenceBenchmarks, and AttributeProcessingBenchmarks classes exist but are not captured in this baseline and do not have legacy variants wired in.
AOT/trim-published numbers. JIT-compiled Release build only. Native AOT throughput belongs with the AOT CI job.
Runtime consistency with earlier 2-way baseline. This run is on .NET 9.0.15 (rollforward); the earlier 2-way was on .NET 10.0.7. See note at the top.

Reproducing

cd dotnet/benchmarks/SerializationPerformance
DOTNET_ROLL_FORWARD=Major dotnet run -c Release -- comparison

Output drops under dotnet/benchmarks/SerializationPerformance/BenchmarkDotNet.Artifacts/. The per-report markdown/CSV files are copied here manually.

Contents

Latest Post