S-23 — Statistical Validation Report

Statistical analysis of the complete backtest.

3,050 trades · MNQ · 5-minute bars · 2 contracts · $1.30 round-trip commission applied · January 2020 – May 2026 (6.4 years)

✓ Core performance figures on this page were verified directly from the raw trade log.

HYPOTHETICAL PERFORMANCE RESULTS HAVE MANY INHERENT LIMITATIONS, SOME OF WHICH ARE DESCRIBED BELOW. NO REPRESENTATION IS BEING MADE THAT ANY ACCOUNT WILL OR IS LIKELY TO ACHIEVE PROFITS OR LOSSES SIMILAR TO THOSE SHOWN; IN FACT, THERE ARE FREQUENTLY SHARP DIFFERENCES BETWEEN HYPOTHETICAL PERFORMANCE RESULTS AND THE ACTUAL RESULTS SUBSEQUENTLY ACHIEVED BY ANY PARTICULAR TRADING PROGRAM. ONE OF THE LIMITATIONS OF HYPOTHETICAL PERFORMANCE RESULTS IS THAT THEY ARE GENERALLY PREPARED WITH THE BENEFIT OF HINDSIGHT. IN ADDITION, HYPOTHETICAL TRADING DOES NOT INVOLVE FINANCIAL RISK, AND NO HYPOTHETICAL TRADING RECORD CAN COMPLETELY ACCOUNT FOR THE IMPACT OF FINANCIAL RISK IN ACTUAL TRADING. FOR EXAMPLE, THE ABILITY TO WITHSTAND LOSSES OR TO ADHERE TO A PARTICULAR TRADING PROGRAM IN SPITE OF TRADING LOSSES ARE MATERIAL POINTS WHICH CAN ALSO ADVERSELY AFFECT ACTUAL TRADING RESULTS. THERE ARE NUMEROUS OTHER FACTORS RELATED TO THE MARKETS IN GENERAL OR TO THE IMPLEMENTATION OF ANY SPECIFIC TRADING PROGRAM WHICH CANNOT BE FULLY ACCOUNTED FOR IN THE PREPARATION OF HYPOTHETICAL PERFORMANCE RESULTS AND ALL OF WHICH CAN ADVERSELY AFFECT ACTUAL TRADING RESULTS.

Hypothetical/backtested results — not indicative of future results. See the full Hypothetical Performance Disclosure.

Robustness profile

THE NINE ROBUSTNESS CRITERIA

The statistical analysis evaluated the system against nine robustness criteria. The backtest met all nine:

  • Statistically significant edge (p < 0.001)
  • Equity-curve linearity (R² = 0.985)
  • System quality (SQN = 6.85)
  • Stable performance across both halves of the backtest
  • 100% of 24-month windows closed above breakeven
  • Sample size of 3,050 trades over 6.4 years
  • Edge robust to fill noise (positive in 100% of runs at ±2 ticks)
  • Sequence-independent edge (100% of 5,000 Monte Carlo paths ended above breakeven)
  • Low risk of ruin (3.1%)

Caption: Cumulative net result across all 3,050 trades, with least-squares linear fit (R² = 0.985).

Performance metrics

NET RESULT

$41,342

Across 3,050 trades, after commission

TOTAL TRADES

3,050

1,752 long · 1,298 short

WIN RATE

48.8%

1,487 W · 1,563 L

PROFIT FACTOR

1.31

Gross gains / gross losses

EXPECTANCY

$13.55

Per-trade average, net

AVG WIN / AVG LOSS

$117.00 / −$84.86

Win/Loss ratio 1.38

MAX DRAWDOWN

$3,338

Over 220 trades (Apr 2021)

MAX LOSING STREAK

24

Average streak 4.1

SQN

6.85

System Quality Number

K-RATIO

8.09

Equity-curve smoothness

MAR RATIO

1.95

Return vs. max drawdown

EQUITY R²

0.985

Linearity of the equity curve

ULCER INDEX

21.63

Moderate — drawdowns can be prolonged

YEARS ABOVE BREAKEVEN

6 of 6

No losing calendar year

Caption: Drawdown from running equity peak. Maximum historical drawdown: $3,338 (April 2021), recovered over the following months.

Statistical edge

Expectancy test. Average net result per trade of $13.55 (standard deviation $109.30) yields a t-statistic of 6.849 and a p-value below 0.001 — a highly significant result. The probability this edge is due to chance is under 1 in 1,000.

Win-rate test — stated plainly. The win rate of 48.8% is not statistically above 50% (z = −1.376, p = 0.916). The edge does not come from how often it wins; it comes from average wins ($117.00) being larger than average losses ($84.86). A win rate below 50% can still produce a positive expectancy when winners outsize losers.

95% confidence intervals

Measure95% confidence intervalRead
Expectancy$9.65 → $17.80Entire range positive
Profit Factor1.22 → 1.42Entire range above 1.0
Win Rate47.0% → 50.5%Spans 50%
Max Drawdown$1,377 → $3,419Bounded

Sample-size confidence: with 3,050 trades, the 90%, 95% and 99% confidence thresholds are all achieved.

Consistency over time

First half vs. second half

MetricFirst half (1,525)Second half (1,525)Change
Win Rate49.9%47.6%−4.6%
Profit Factor1.301.32+1.4%
Expectancy$12.80$14.31+11.8%
Avg Win$110.43$123.89+12.2%
Avg Loss$84.45$85.27+1.0%
Equity R²0.9280.982+5.8%
Max Drawdown$3,338$2,352−29.5%
Total P&L$19,522$21,821

The second half held or improved on the first across nearly every measure — no sign of edge decay.

Year by year

YearTradesWin RateProfit FactorExpectancyEquity R²Net Result
202049451.2%1.33$13.090.920$6,466
202148852.5%1.40$15.810.768$7,715
202248046.3%1.18$8.480.653$4,069
202347849.8%1.32$13.230.908$6,322
202447648.5%1.37$15.790.930$7,518
202545445.6%1.30$14.340.722$6,508
2026 (partial)18044.4%1.30$15.240.110$2,743

Caption: Net result by calendar year. Every full year closed above breakeven; 2026 is a partial year.

Result by quarter (full period)

Q1 $8,142 (19.7%) · Q2 $11,472 (27.7%) · Q3 $11,455 (27.7%) · Q4 $10,273 (24.8%) — contribution spread across all four quarters.

Rolling-window analysis

WindowWindows% PositiveAvg P&LWorst window
1 month7775.3%$537−$1,734
3 months7586.7%$1,616−$1,609
6 months7293.1%$3,224−$2,290
12 months66100%$6,262+$1,254
18 months60100%$9,775+$6,502
24 months54100%$13,135+$8,436

As the holding window lengthens, the share of positive windows rises to 100% at 12 months and beyond — the pattern expected of a genuine edge rather than noise.

Caption: Monthly net result. 58 of 77 months closed positive across the test period.

Monte Carlo — sequence robustness

The trade sequence was randomized across 5,000 simulations to test whether the result depends on the specific order of trades. 100% of the 5,000 paths ended above breakeven, with a median ending equity of $37,227 and a median drawdown of $1,942.

PercentileFinal equityMax drawdown
Worst case$30,590$4,894
5th$34,140$2,934 (95th)
25th$35,970$1,693
Median$37,227$1,942
75th$38,473$1,693 (25th)
Best case$43,600$1,112

Risk of ruin

Across 10,000 randomized sequences, the probability of hitting a $2,000 drawdown threshold was 3.1% — classified Low. Median trades to a successful outcome: 160.

Slippage / noise robustness

Symmetric (mean-zero) price noise of up to ±5 ticks was applied to every fill across 1,000 simulations. Median result of $41,354 closely tracked the original $41,342 (drift 0.0%), with a positive-result share of 100% at every noise level tested and a 5th–95th percentile band of $41,183 → $41,513. The edge reflects structure, not lucky fills.

Trade activity

77 months of operation · 39.6 trades per month on average · 0 months without trades · peak month March 2021 (56 trades).

Risk disclosure. S-23 is a software tool. It does not constitute financial advice, an investment recommendation, or personalized advisory services. Seraphim Quant LLC is not a registered investment adviser, commodity trading advisor, or broker/dealer. The figures on this page are derived from a historical backtest and are hypothetical; hypothetical results have inherent limitations and do not reflect live execution. Trading futures involves substantial risk of loss and is not suitable for all investors. Past performance does not guarantee future results. See our complete Risk Disclosure Statement before making any purchase or operational decision.

Seraphim Quant LLC · Figures produced through standard statistical analysis; core performance metrics verified from the raw trade log · © 2026 Seraphim Quant LLC