Draft:Deflated Sharpe Ratio
![]() | Review waiting, please be patient.
This may take 3 months or more, since drafts are reviewed in no specific order. There are 2,874 pending submissions waiting for review.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
Reviewer tools
|
The Deflated Sharpe Ratio (DSR) is a statistical method used to determine whether the Sharpe Ratio of an investment strategy is statistically significant, after correcting for selection bias, backtest overfitting, and non-normality in return distributions. It provides a more reliable test of financial performance, especially when many strategies are evaluated.[1]
Background
[edit]The Sharpe Ratio, developed by William F. Sharpe, is a widely used measure of risk-adjusted return, calculated as the ratio of excess return over the risk-free rate to the standard deviation of returns. While useful, the Sharpe Ratio has limitations, especially when applied to multiple strategy evaluations. Issues such as selection bias, where the best-performing strategy is chosen from a large set, and backtest overfitting, where a strategy is tailored to past data, can inflate the Sharpe Ratio, leading to misleading conclusions about a strategy's effectiveness. Additionally, the Sharpe Ratio assumes normally distributed returns, an assumption often violated in practice, and it does not take into account sample length.[2]
Mathematical Definition
[edit]The DSR is defined as:
Where:
- is the observed Sharpe Ratio (not annualized),
- is the threshold Sharpe Ratio that reflects the highest Sharpe Ratio expected from unskilled strategies,
- is the skewnewss of the returns,
- is the kurtosis of the returns,
- is the returns' sample length.
- is the standard normal cumulative distribution function.
The threshold is approximated by:
Where:
- is the cross-sectional variance of Sharpe Ratios across trials,
- is the Euler-Mascheroni constant (approx. 0.5772),
- is Euler's number,
- is the inverse standard normal CDF,
- is the number of independent strategy trials.[1]
False Strategy Theorem: Statement and Proof
[edit]The False Strategy Theorem provides the theoretical foundation for the Deflated Sharpe Ratio (DSR) by quantifying how much the best Sharpe Ratio among many unskilled strategies is expected to exceed zero purely due to chance. Even if all tested strategies have true Sharpe Ratios of zero, the highest observed Sharpe Ratio will typically be positive and statistically significant—unless corrected. The DSR corrects for this inflation.[3]
Statement
[edit]Let be Sharpe Ratios independently drawn from a normal distribution with mean zero and variance . Then the expected maximum Sharpe Ratio among these trials is approximately:
Where:
- is the quantile function (inverse CDF) of the standard normal distribution,
- is the Euler–Mascheroni constant,
- is Euler’s number,
- is the number of independent trials.
This value is the **expected maximum Sharpe Ratio** under the null hypothesis of no skill. It represents a benchmark that any observed Sharpe Ratio must exceed in order to be considered statistically significant.
Proof Sketch
[edit]Let be independent standard normal variables. The expected maximum of such variables is approximated by:
Now let for each . Then:
Combining the two expressions gives:
If is estimated as the cross-sectional variance of Sharpe Ratios , then:
This completes the derivation.
Implication for the DSR
[edit]The False Strategy Theorem shows that in large-scale testing, even unskilled strategies will produce apparently "significant" Sharpe Ratios. To correct for this, the DSR adjusts the observed Sharpe Ratio by subtracting the expected maximum from noise, , and scaling by its standard error:
This yields the probability that the observed Sharpe Ratio reflects true skill, not selection bias or overfitting.
Estimating the Effective Number of Trials (N)
[edit]In practice, many strategy trials are not independent due to overlapping features. To estimate the effective number of independent trials , López de Prado (2018) proposes clustering similar strategies using unsupervised learning techniques such as hierarchical clustering on the correlation matrix of their returns.[4]
This approach yields a conservative lower bound for . Alternatively, spectral methods (e.g. eigenvalue distribution of the correlation matrix) can also provide estimates of .[5]
Confidence and Power of the Sharpe Ratio under Multiple Testing
[edit]To assess the significance of Sharpe Ratios under multiple testing, López de Prado (2018) derives closed-form expressions for the Type I and Type II errors.
Confidence
[edit]The probability that a discovered strategy is not a false positive (i.e., the confidence) is:
Where:
- is the number of return observations,
- and are the skewness and kurtosis of returns,
- is the number of effectively independent trials.[6]
Power
[edit]The power of the test, i.e., the probability of correctly identifying a strategy with true Sharpe Ratio , is:
With:
These equations quantify the reliability of observed Sharpe Ratios under multiple testing and return non-normality.[6]
See also
[edit]References
[edit]- ^ a b Bailey, D. H., & López de Prado, M. (2014). The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting, and Non-Normality. The Journal of Portfolio Management, 40(5), 94–107.
- ^ Bailey, D. H., & Borwein, J. & López de Prado, M. (2014): "Pseudo-Mathematics and Financial Charlatanism: The Effects of Backtest Overfitting on Out-Of-Sample Performance". Notices of the American Mathematical Society, 61(5), pp. 458-471.
- ^ López de Prado, M., & Bailey, D. H. (2018). The False Strategy Theorem: A Financial Application of Experimental Mathematics. American Mathematical Monthly, Volume 128, Number 9, pp. 825-831.
- ^ López de Prado, M., & Lewis, M. J. (2019): Detection of False Investment Strategies Using Unsupervised Learning Methods. Quantitative Finance, 19(9), pp.1555-1565.
- ^ López de Prado, M. (2019): A Data Science Solution to the Multiple-Testing Crisis in Financial Research. Journal of Financial Data Science, 1(1), pp. 99-110.
- ^ a b López de Prado, M. (2022): "Type I and Type II Errors of the Sharpe Ratio under Multiple Testing", The Journal of Portfolio Management, 49(1), pp. 39 - 46