Draft:Deflated Sharpe Ratio

Review waiting, please be patient.

This may take 2 months or more, since drafts are reviewed in no specific order. There are 2,251 pending submissions waiting for review.

If the submission is accepted, then this page will be moved into the article space.
If the submission is declined, then the reason will be posted here.
In the meantime, you can continue to improve this submission by editing normally.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Reviewer tools

Instructions · What links here · Deflated Sharpe Ratio (talk: + · bio) · (log) · Copyvios report · reFill · Citation Bot · (Search: Google, Wikipedia) · Submitted 58 days ago by Dsr02014 (talk: D · +) · Last edited 15 hours ago by Ivanovich4321

The Deflated Sharpe Ratio (DSR) is a statistical method used to determine whether the Sharpe Ratio of an investment strategy is statistically significant, after correcting for selection bias, backtest overfitting, sample length, and non-normality in return distributions. It provides a more reliable test of financial performance, especially when many trials are evaluated.^[1] The application of the DSR, helps practitioners to detect false investment strategies^[2]^[3].

Relation to the Sharpe Ratio

One of the most important statistics for assessing the performance of an investment strategy is the Sharpe Ratio (SR). The Sharpe ratio was developed by William F. Sharpe and is a widely used measure of risk-adjusted return, calculated as the annualized ratio of excess return over the risk-free rate to the standard deviation of returns. While useful, the Sharpe Ratio has important limitations, especially when applied to multiple strategy evaluations. Issues such as selection bias, where the best-performing strategy is chosen from a large set, and backtest overfitting, where a strategy is tailored to past data, can inflate the Sharpe Ratio, leading to misleading conclusions about a strategy's effectiveness. Additionally, the Sharpe Ratio assumes normally distributed returns^[4], an assumption often violated in practice, and it does not take into account sample length.^[5]

Applying the Deflated Sharpe Ratio in Practice

1. Get a record of all the trials.

In order to apply the DSR, researchers need to record the investment performance in returns (%), for every backtest that they ran. This is in relation to the development of a single specific strategy. For example: when building a momentum based strategy that trades at the end-of-day, 100 historical simulations were run to evaluate the performance and the best set of parameters were selected for the final strategy. Here all 100 simulations need to be recorded, with the strategies daily returns in %.

2. Estimating the Effective Number of Trials N.

In practice, many trials are not independent due to overlapping features. To estimate the effective number of independent trials N, López de Prado (2018) proposes 3 techniques to clustering similar strategies using unsupervised learning techniques:

The Optimal Number of Clusters (ONC) algorithm^[2]^[6]^[7].
Hierarchical clustering could be used to get a conservative lower bound for N.
Alternatively, spectral methods (e.g. eigenvalue distribution of the correlation matrix) can also provide estimates of N.^[2]

Tip:

Multiple testing exercises should be carefully planned in advance, so as to avoid running an unnecessary large number of trials. Investment theory, not computational power, should motivate what experiments are worth conducting.^[1]

Steps to estimate N:

2.1. Convert the correlation matrix to a distance matrix.

In order to apply a clustering algorithm to the returns data, we need make use of a statistical association measure (such as a correlation matrix) and we need to transform it into a distance matrix (such as angular distance) so that elements that are very similar to each other will be close together in their higher-dimensional space.^[8]^[9]

2.2. Apply a clustering algorithm to estimate the number of independent trials.

The number of clusters N, are an estimate of the number of independent trials.

2.3 Plot the Block Correlation Matrix

In the figure below we can see a correlation matrix before and after clustering has been applied. Note how we can see blocks down the diagonal, each block corresponds to a cluster.^[7]

Tip: If you don't use the ONC algorithm to cluster, then you can have blocks with trials that don't match very closely. The ONC algorithm uses silhouette scores to make sure each trial is in the best cluster, at the expense of high computational complexity and longer run times.

3. Compute the Sharpe ratio variance, across clusters.

3.1 Calculate the Sharpe ratio for each cluster.

Each cluster will now form a collection of time series returns (in%), for each cluster you need to create a new time series which represents that cluster using the Inverse Variance Portfolio (IVP) and then compute the Sharpe Ratio for each IVP portfolio. One doesn't need to use the IVP - the goal is to form an aggregate cluster return time series. For this a weighting scheme needs to be used, another alternative could be the minimum variance portfolio.^[7]

3.2 Compute the variance of these Sharpe Ratios

$\mathbf {V} [{\hat {SR}}_{n}]$ is used in the next step, where we apply the False Strategy Theorem to determine the Expected Maximum Sharpe ratio.

4. Compute the Expected Maximum Sharpe ratio using the False Strategy Theorem.

Using the equation from the False Strategy Theorem (FST)^[10] we can compute $SR_{0}$ , which is the threshold Sharpe Ratio that reflects the highest Sharpe Ratio expected from $N$ unskilled strategies.

$SR_{0}={\sqrt {\mathbf {V} [{\hat {SR}}_{n}]}}\left((1-\gamma )\Phi ^{-1}\left[1-{\frac {1}{N}}\right]+\gamma \Phi ^{-1}\left[1-{\frac {1}{Ne}}\right]\right)$

Where:

$\mathbf {V} [{\hat {SR}}_{n}]$ is the cross-sectional variance of Sharpe Ratios across trials,
$\gamma$ is the Euler-Mascheroni constant (approx. 0.5772),
$e$ is Euler's number,
$\Phi ^{-1}$ is the inverse standard normal CDF,
$N$ is the number of independent strategy trials.^[1]

Note:

The FST highlights that the optimal outcome of an unknown number of historical simulations is right-unbounded, with enough trials, there is no Sharpe ratio sufficiently large enough to reject the hypothesis that a strategy is false, i.e., that it is over-fit and wont generalize in the out-of-sample data.^[5]^[7]

5. Compute the DSR for each cluster.

You now have all the variables you need to compute the DSR.

${\text{DSR}}=\Phi \left({\frac {({\hat {SR}}-SR_{0})\cdot {\sqrt {T-1}}}{\sqrt {1-{\hat {\gamma }}_{3}{\hat {SR}}+{\frac {{\hat {\gamma }}_{4}-1}{4}}{\hat {SR}}^{2}}}}\right)$

Where:

${\hat {SR}}$ is the observed Sharpe Ratio (not annualized),
$SR_{0}$ is the threshold Sharpe Ratio that reflects the highest Sharpe Ratio expected from $N$ unskilled strategies,
${\hat {\gamma }}_{3}$ is the skewnewss of the returns,
${\hat {\gamma }}_{4}$ is the kurtosis of the returns,
$T$ is the returns' sample length.
$\Phi$ is the standard normal cumulative distribution function.

Notes:

Readers may recognize that the DSR is the Probabilistic Sharpe Ratio (PSR)^[11], where $SR_{0}$ is the maximum expected Sharpe Ratio (estimated using the False Strategy Theorem) instead of a simple threshold SR (often 0).
The PSR assumes that only 1 trial was run and is often used to determine that the observed SR is greater than 0.
To account for multiple-testing, use the DSR.
The DSR will increase with:
- Greater observed SRs.
- Longer track records.
- Positively skewed returns.
The DSR decreases with:
- Fatter tails (Kurtosis).

6. Complete the Template for Disclosing Multiple Tests.

6.1 Aggregate statistics into a table.

Several peer reviewed papers recommend to aggregate the cluster statistics into a table format.^[6]^[12]^[13]

The table below is Exhibit 7 from "A Practitioner’s Guide to the Optimal Number of Clusters Algorithm"^[6].

Where:

Cluster is the index of the cluster; there are N clusters.
Strat Count is the number of strategies included in that cluster.
aSR is the annualized Sharpe Ratio of that cluster's inverse variance portfolio (IVP).
SR is the non-annualized Sharpe Ratio of that cluster's IVP.
Skew is the skew of the returns of that cluster's IVP.
Kurt is the kurtosis of the returns of that cluster's IVP.
T is the number of observations in the cluster's IVP.
sqrt(V[SR]) is the square root of the variance of Sharpe Ratios that was computed in step 3.
E[max SR] is the Expected Maximum Sharpe ratio ( $SR_{0}$ ), computed in step 4.
DSR is the Deflated Sharpe Ratio for that cluster's IVP.

6.2 Plot the Sharpe Ratios, for each cluster.

In the figure above, we can see a collection of non-annualized Sharpe ratios for the 26 independent trials that were tested in the development of this investment strategy. The bars are highlighted based on if they passed the DSR at a 95% confidence level.

Note that this bar chart doesn't correspond to table above in Exhibit 7 but shares the result that only 1 cluster passed the DSR. The goal with this analysis is to show that for all clusters, except 1 - all of them failed the DSR. This would indicate that the strategy is over-fit and is likely to be a false investment strategy.

6.3 Plot the cumulative returns of the strategies.

In the figure above the cumulative returns are plotted. On the y axis is the total return in% and the x axis are the time indexes. Do you see the very straight line (the strategy with an outlier performance)?

7. Derive a conclusion from these results.

As seen in the plot of cumulative returns, there is one outlier strategy which is likely a false investment strategy as this outlier has very high performance relative to its own cluster and others.

We can see in the bar plots that all the cluster portfolios failed to pass the DSR at a 95% confidence level, except for the one that included this outlier strategy.

Mathematical Definitions

The Deflated Sharpe Ratio (DSR)

{\text{DSR}}=\Phi \left({\frac {({\hat {SR}}-SR_{0})\cdot {\sqrt {T-1}}}{\sqrt {1-{\hat {\gamma }}_{3}{\hat {SR}}+{\frac {{\hat {\gamma }}_{4}-1}{4}}{\hat {SR}}^{2}}}}\right)

Where:

${\hat {SR}}$ is the observed Sharpe Ratio (not annualized),
$SR_{0}$ is the threshold Sharpe Ratio that reflects the highest Sharpe Ratio expected from $N$ unskilled strategies,
${\hat {\gamma }}_{3}$ is the skewnewss of the returns,
${\hat {\gamma }}_{4}$ is the kurtosis of the returns,
$T$ is the returns' sample length.
$\Phi$ is the standard normal cumulative distribution function.

The threshold $SR_{0}$ is approximated by:

SR_{0}={\sqrt {\mathbf {V} [{\hat {SR}}_{n}]}}\left((1-\gamma )\Phi ^{-1}\left[1-{\frac {1}{N}}\right]+\gamma \Phi ^{-1}\left[1-{\frac {1}{Ne}}\right]\right)

Where:

$\mathbf {V} [{\hat {SR}}_{n}]$ is the cross-sectional variance of Sharpe Ratios across trials,
$\gamma$ is the Euler-Mascheroni constant (approx. 0.5772),
$e$ is Euler's number,
$\Phi ^{-1}$ is the inverse standard normal CDF,
$N$ is the number of independent strategy trials.^[1]

False Strategy Theorem: Statement and Proof

The False Strategy Theorem provides the theoretical foundation for the Deflated Sharpe Ratio (DSR) by quantifying how much the best Sharpe Ratio among many unskilled strategies is expected to exceed zero purely due to chance. Even if all tested strategies have true Sharpe Ratios of zero, the highest observed Sharpe Ratio will typically be positive and statistically significant—unless corrected. The DSR corrects for this inflation.^[10]

Statement

Let $\{{\hat {SR}}_{1},{\hat {SR}}_{2},\dots ,{\hat {SR}}_{N}\}$ be $N$ Sharpe Ratios independently drawn from a normal distribution with mean zero and variance $\sigma ^{2}$ . Then the expected maximum Sharpe Ratio among these $N$ trials is approximately:

SR_{0}={\sqrt {\sigma ^{2}}}\cdot \left((1-\gamma )\Phi ^{-1}\left(1-{\frac {1}{N}}\right)+\gamma \Phi ^{-1}\left(1-{\frac {1}{Ne}}\right)\right)

Where:

$\Phi ^{-1}$ is the quantile function (inverse CDF) of the standard normal distribution,
$\gamma \approx 0.5772$ is the Euler–Mascheroni constant,
$e\approx 2.718$ is Euler’s number,
$N$ is the number of independent trials.

This value $SR_{0}$ is the **expected maximum Sharpe Ratio** under the null hypothesis of no skill. It represents a benchmark that any observed Sharpe Ratio must exceed in order to be considered statistically significant.

Proof Sketch

Let $X_{1},X_{2},\dots ,X_{N}\sim {\mathcal {N}}(0,1)$ be independent standard normal variables. The expected maximum of $N$ such variables is approximated by:

\mathbb {E} [\max(X_{1},\dots ,X_{N})]\approx (1-\gamma )\Phi ^{-1}\left(1-{\frac {1}{N}}\right)+\gamma \Phi ^{-1}\left(1-{\frac {1}{Ne}}\right)

Now let ${\hat {SR}}_{i}\sim {\mathcal {N}}(0,\sigma ^{2})$ for each $i$ . Then:

\mathbb {E} \left[\max({\hat {SR}}_{1},\dots ,{\hat {SR}}_{N})\right]=\sigma \cdot \mathbb {E} [\max(X_{1},\dots ,X_{N})]

Combining the two expressions gives:

SR_{0}=\sigma \cdot \left((1-\gamma )\Phi ^{-1}\left(1-{\frac {1}{N}}\right)+\gamma \Phi ^{-1}\left(1-{\frac {1}{Ne}}\right)\right)

If $\sigma ^{2}$ is estimated as the cross-sectional variance of Sharpe Ratios $\mathbf {V} [{\hat {SR}}_{n}]$ , then:

SR_{0}={\sqrt {\mathbf {V} [{\hat {SR}}_{n}]}}\cdot \left((1-\gamma )\Phi ^{-1}\left(1-{\frac {1}{N}}\right)+\gamma \Phi ^{-1}\left(1-{\frac {1}{Ne}}\right)\right)

This completes the derivation.

Implication for the DSR

The False Strategy Theorem shows that in large-scale testing, even unskilled strategies will produce apparently "significant" Sharpe Ratios. To correct for this, the DSR adjusts the observed Sharpe Ratio by subtracting the expected maximum from noise, $SR_{0}$ , and scaling by its standard error:

{\text{DSR}}=\Phi \left({\frac {{\hat {SR}}-SR_{0}}{\sigma _{\hat {SR}}}}\right)

This yields the probability that the observed Sharpe Ratio reflects true skill, not selection bias or overfitting.

Confidence and Power of the Sharpe Ratio under Multiple Testing

To assess the significance of Sharpe Ratios under multiple testing, López de Prado (2018) derives closed-form expressions for the Type I and Type II errors.

Confidence

The probability that a discovered strategy is not a false positive (i.e., the confidence) is:

{\text{Confidence}}=1-\alpha _{K}=\left(\Phi \left({\frac {{\hat {SR}}\cdot {\sqrt {T-1}}}{\sqrt {1-\gamma _{3}{\hat {SR}}+{\frac {\gamma _{4}-1}{4}}{\hat {SR}}^{2}}}}\right)\right)^{K}

Where:

$T$ is the number of return observations,
$\gamma _{3}$ and $\gamma _{4}$ are the skewness and kurtosis of returns,
$K$ is the number of effectively independent trials.^[15]

Power

The power of a test is the probability of correctly identifying a positive. This is also known in machine learning as the test's true positive rate or recall, and sensitivity in medicine. Given an alternative hypothesis $H_{1}:SR^{*}$ , the power of the test is:

{\text{Power}}=1-\beta _{K}=1-\left(\Phi \left(\Phi ^{-1}\left[(1-\alpha _{K})^{1/K}\right]-\theta \right)\right)^{K}

With:

\theta ={\frac {SR^{*}\cdot {\sqrt {T-1}}}{\sqrt {1-\gamma _{3}{\hat {SR}}+{\frac {\gamma _{4}-1}{4}}{\hat {SR}}^{2}}}}

These equations quantify the reliability of observed Sharpe Ratios under multiple testing and return non-normality.^[15]

Sample size

A related concept is the Minimum Track Record Length (MinTRL), which computes the minimum sample size needed such that a null hypothesis $SR_{0}$ is rejected with confidence $(1-\alpha _{K})$ and power 0.5, given an observed ${\hat {SR}}$ .^[11] For example, given an observed annualized ${\hat {SR}}=0.95$ , we need approximately 3 years worth of daily strategy returns in order to reject the null hypothesis $H_{0}:SR_{0}=0$ with confidence 95% and power 50%. This provides mathematical support to the common expectation among investors that a hedge fund must produce track records with a minimum length of 3 years, which may be reduced to 2 years for Sharpe ratios above 1.15. It is important to understand MinTRL as a minimum requirement, since for this sample size the power of the test is 50% (higher power thresholds will require longer track records).

References

^ ^a ^b ^c ^d Bailey, D. H., & López de Prado, M. (2014). The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting, and Non-Normality. The Journal of Portfolio Management, 40(5), 94–107.
^ ^a ^b ^c López de Prado, M., & Lewis, M. J. (2019): Detection of False Investment Strategies Using Unsupervised Learning Methods. Quantitative Finance, 19(9), pp.1555-1565.
^ Prado, Marcos López de (2018-07-02). "The 10 Reasons Most Machine Learning Funds Fail". The Journal of Portfolio Management. 44 (6): 120–133. doi:10.3905/jpm.2018.44.6.120. ISSN 0095-4918.
^ Lo, Andrew W. (2002-07-01). "The Statistics of Sharpe Ratios". Financial Analysts Journal. 58 (4): 36–52. doi:10.2469/faj.v58.n4.2453. ISSN 0015-198X.
^ ^a ^b Bailey, D. H., & Borwein, J. & López de Prado, M. (2014): "Pseudo-Mathematics and Financial Charlatanism: The Effects of Backtest Overfitting on Out-Of-Sample Performance". Notices of the American Mathematical Society, 61(5), pp. 458-471.
^ ^a ^b ^c Andrews, Michelle (2023-08-01). "A Practitioner's Guide to the Optimal Number of Clusters Algorithm". The Journal of Financial Data Science. 5 (3): 66–79. doi:10.3905/jfds.2023.1.133. ISSN 2640-3943.
^ ^a ^b ^c ^d López de Prado, Marcos M. (2020). Machine Learning for Asset Managers. Elements in Quantitative Finance. Cambridge: Cambridge University Press. ISBN 978-1-108-79289-9.
^ Lopez de Prado, Marcos (15 January 2020). "Statistical Association (Presentation Slides)". SSRN. SSRN 3512994.
^ Marti, Gautier; Nielsen, Frank; Bińkowski, Mikołaj; Donnat, Philippe (2021), Nielsen, Frank (ed.), "A Review of Two Decades of Correlations, Hierarchies, Networks and Clustering in Financial Markets", Progress in Information Geometry: Theory and Applications, Cham: Springer International Publishing, pp. 245–274, doi:10.1007/978-3-030-65459-7_10, ISBN 978-3-030-65459-7, retrieved 2025-05-21
^ ^a ^b López de Prado, M., & Bailey, D. H. (2018). The False Strategy Theorem: A Financial Application of Experimental Mathematics. American Mathematical Monthly, Volume 128, Number 9, pp. 825-831.
^ ^a ^b Bailey, David; Lopez de Prado, Marcos (Winter 2012). "The Sharpe Ratio Efficient Frontier". Journal of Risk. 15 (2): 36. doi:10.21314/JOR.2012.255. SSRN 1821643.
^ López de Prado, M. (2019): A Data Science Solution to the Multiple-Testing Crisis in Financial Research. Journal of Financial Data Science, 1(1), pp. 99-110.
^ Fabozzi, Frank J.; Prado, Marcos López de (2018-11-01). "Being Honest in Backtest Reporting: A Template for Disclosing Multiple Tests". The Journal of Portfolio Management. 45 (1): 141–147. doi:10.3905/jpm.2018.45.1.141. ISSN 0095-4918.
^ Andrews, Michelle (2023-08-01). "A Practitioner's Guide to the Optimal Number of Clusters Algorithm". The Journal of Financial Data Science. 5 (3): 66–79. doi:10.3905/jfds.2023.1.133. ISSN 2640-3943.
^ ^a ^b López de Prado, M. (2022): "Type I and Type II Errors of the Sharpe Ratio under Multiple Testing", The Journal of Portfolio Management, 49(1), pp. 39 - 46

[DSR-1] Bailey, D. H., & López de Prado, M. (2014). The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting, and Non-Normality. The Journal of Portfolio Management, 40(5), 94–107.

[Detection-2] López de Prado, M., & Lewis, M. J. (2019): Detection of False Investment Strategies Using Unsupervised Learning Methods. Quantitative Finance, 19(9), pp.1555-1565.

[3] Prado, Marcos López de (2018-07-02). "The 10 Reasons Most Machine Learning Funds Fail". The Journal of Portfolio Management. 44 (6): 120–133. doi:10.3905/jpm.2018.44.6.120. ISSN 0095-4918.

[4] Lo, Andrew W. (2002-07-01). "The Statistics of Sharpe Ratios". Financial Analysts Journal. 58 (4): 36–52. doi:10.2469/faj.v58.n4.2453. ISSN 0015-198X.

[AMS-5] Bailey, D. H., & Borwein, J. & López de Prado, M. (2014): "Pseudo-Mathematics and Financial Charlatanism: The Effects of Backtest Overfitting on Out-Of-Sample Performance". Notices of the American Mathematical Society, 61(5), pp. 458-471.

[:0-6] Andrews, Michelle (2023-08-01). "A Practitioner's Guide to the Optimal Number of Clusters Algorithm". The Journal of Financial Data Science. 5 (3): 66–79. doi:10.3905/jfds.2023.1.133. ISSN 2640-3943.

[:1-7] López de Prado, Marcos M. (2020). Machine Learning for Asset Managers. Elements in Quantitative Finance. Cambridge: Cambridge University Press. ISBN 978-1-108-79289-9.

[8] Lopez de Prado, Marcos (15 January 2020). "Statistical Association (Presentation Slides)". SSRN. SSRN 3512994.

[9] Marti, Gautier; Nielsen, Frank; Bińkowski, Mikołaj; Donnat, Philippe (2021), Nielsen, Frank (ed.), "A Review of Two Decades of Correlations, Hierarchies, Networks and Clustering in Financial Markets", Progress in Information Geometry: Theory and Applications, Cham: Springer International Publishing, pp. 245–274, doi:10.1007/978-3-030-65459-7_10, ISBN 978-3-030-65459-7, retrieved 2025-05-21

[FST-10] López de Prado, M., & Bailey, D. H. (2018). The False Strategy Theorem: A Financial Application of Experimental Mathematics. American Mathematical Monthly, Volume 128, Number 9, pp. 825-831.

[:2-11] Bailey, David; Lopez de Prado, Marcos (Winter 2012). "The Sharpe Ratio Efficient Frontier". Journal of Risk. 15 (2): 36. doi:10.21314/JOR.2012.255. SSRN 1821643.

[DataScience-12] López de Prado, M. (2019): A Data Science Solution to the Multiple-Testing Crisis in Financial Research. Journal of Financial Data Science, 1(1), pp. 99-110.

[13] Fabozzi, Frank J.; Prado, Marcos López de (2018-11-01). "Being Honest in Backtest Reporting: A Template for Disclosing Multiple Tests". The Journal of Portfolio Management. 45 (1): 141–147. doi:10.3905/jpm.2018.45.1.141. ISSN 0095-4918.

[14] Andrews, Michelle (2023-08-01). "A Practitioner's Guide to the Optimal Number of Clusters Algorithm". The Journal of Financial Data Science. 5 (3): 66–79. doi:10.3905/jfds.2023.1.133. ISSN 2640-3943.

[Power-15] López de Prado, M. (2022): "Type I and Type II Errors of the Sharpe Ratio under Multiple Testing", The Journal of Portfolio Management, 49(1), pp. 39 - 46

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

Relation to the Sharpe Ratio

Applying the Deflated Sharpe Ratio in Practice

1. Get a record of all the trials.

2. Estimating the Effective Number of Trials N.

3. Compute the Sharpe ratio variance, across clusters.

4. Compute the Expected Maximum Sharpe ratio using the False Strategy Theorem.

5. Compute the DSR for each cluster.

6. Complete the Template for Disclosing Multiple Tests.

7. Derive a conclusion from these results.

Mathematical Definitions

The Deflated Sharpe Ratio (DSR)

False Strategy Theorem: Statement and Proof

Statement

Proof Sketch

Implication for the DSR

Confidence and Power of the Sharpe Ratio under Multiple Testing

Confidence

Power

Sample size

See also

References