Back to Methodology

Validation Report

Walk-forward backtest results with statistical significance analysis

OOS Sharpe Range
0.43 – 1.58
Across 4 non-overlapping periods
All OOS Positive?
4/4
P(4/4 by chance) = 6.25%
Individually Significant?
None
At 5% level (t > 1.96)
Data Span
384
Months (Jan 1994 – Dec 2025)

Methodology

Data Source: Kenneth French Data Library (Fama-French 5-Factor + Momentum)
Optimization: SLSQP with 300 random restarts per profile
ML Model: ElasticNet (alpha=0.01, l1_ratio=0.5) with 99 features
Bootstrap: 200 samples for weight confidence intervals
Factor Mapping:
Quality → RMW(Robust Minus Weak)Value → HML(High Minus Low)Stability → CMA(Conservative Minus Aggressive)Growth → Mkt-RF(Market excess return)Momentum → UMD(Up Minus Down)

Walk-Forward Validation

PeriodRangeMonthsAnn. ReturnAnn. VolSharpeSortinoMax DDSignificance
TrainIN-SAMPLE
1994–2015264+4.77%4.20%
1.14
1.62-9.48%t=4.16
Validate
2016–201724+4.30%2.72%
1.58
5.57-1.25%t=1.49
Test 1
2018–201924+2.28%3.03%
0.75
0.92-2.49%t=0.94
Test 2
2020–202124+8.65%6.54%
1.32
2.05-6.37%t=1.36
Test 3
2022–202548+2.52%5.80%
0.43
0.69-7.64%t=0.82
Statistical significance note: Using the Lo (2002) standard error formula SE(S) = √((1 + S²/2) / Nyears), none of the individual OOS periods reach t > 1.96. This is expected — 24-month windows simply don't have enough data to distinguish moderate Sharpe ratios from zero. The consistent positive direction across all 4 OOS periods (P = 6.25% by chance) is the stronger evidence.

Factor Weights

Quality (RMW)
36.6%
Value (HML)
2.9%
Stability (CMA)
47.7%
Growth (Mkt-RF)
2.9%
Momentum (UMD)
9.9%
ML Blend Weight20%
Fundamental Blend Weight80%

Bootstrap 90% CI

200 samples
Quality
[17.7%, 36.4%]
Value
[3.0%, 3.0%]
Stability
[24.3%, 42.3%]
Growth
[21.9%, 30.4%]
Momentum
[5.3%, 16.4%]

Regime Analysis

Bull Market147mo
+4.8%
Ann. Return
4.0%
Ann. Vol
1.23
Sharpe
Bear Market48mo
+1.6%
Ann. Return
6.8%
Ann. Vol
0.23
Sharpe
Sideways Market177mo
+5.3%
Ann. Return
4.2%
Ann. Vol
1.25
Sharpe

Known Limitations

Short OOS Windows

24-month out-of-sample windows have wide confidence intervals. A Sharpe of 0.75 over 24 months has SE ≈ 0.80 — the 95% CI includes zero.

Factor Proxy Gap

Fama-French factors are long-short academic portfolios, not directly tradable strategies. Our stock selection uses these factors differently than the backtest.

Survivorship Bias

Walk-forward validation prevents look-ahead bias but cannot fully eliminate survivorship bias in the underlying factor return data.

Aggressive Bear Collapse

The aggressive profile shows Sharpe of -0.78 in bear markets. This is a known structural weakness — growth and momentum factors reverse in drawdowns.

Bootstrap Instability

Aggressive profile bootstrap CIs are extremely wide — growth weight CI [3%, 60%] indicates the optimal weight is not well-determined.

Not Financial Advice

Past performance does not guarantee future results. This validation is a statistical exercise, not a promise of returns.

Overall Assessment

LIKELY REAL — Consistent Direction

The conservative profile shows positive Sharpe ratios across all 4 out-of-sample periods, with the weakest being 0.43 over the most recent 48 months. While no individual OOS period reaches statistical significance (the windows are too short), the consistent positive direction has only a 6.25% probability of occurring by chance.

The aggressive profile is higher-return but structurally fragile — it collapses in bear markets and its bootstrap CIs are extremely wide. Use it for growth-seeking portfolios where you accept the drawdown risk.

This is a statistical exercise, not financial advice. The backtest uses academic factor returns as proxies, not our exact stock selection methodology. Real-world performance will differ.