The Backstage of Backtesting

December 15th, 2021 3 minutes read

In our previous article, “Digital Data: Gold mine or Garbage Dump”, we introduce the potential that digital data holds for traders. But how can traders utilize this information?

From a signal standpoint and given all the traps, traders must focus on nowcasting instead of the usual forecasting models. Through nowcasting, traders can get

  • Direct measurements that always hold true because they do not rely on a statistical lead-lag relationship
  • Short-range predictions are statistically more reliable than long-range ones, which also implies that most published discoveries or signals in finance are false after a while

As a smart trader, you know the importance of backtesting your trading signals to develop or refine your trading system. The idea is that your trading strategies that may have worked well in the past may stop doing so in the future. If you are smart but less experienced with what machine learning may have brought to the market, you may not have paid attention to how backtesting must change to truly contribute to alpha creation and trading in general. There are two related issues here: (a) machine learning has the power to democratize access to data relationships but on the other hand, (b) financial data has a very low signal to noise ratio.

We define the basic framework in two ways:

  1. We will identify at least two statistical sets, then run the result on one and compare it to the other.
  2. We will rely on rational observation: if the two sets are very similar, the results are likely to be very similar. When this happens, the backtesting is not informative.

 From this, we now know that we should differentiate the data sets. As we focus on market realities (thanks to the no-arbitrage principle), we should also note that there are no parallel universes in financial markets. From here, we can conclude that financial market data is limited.

5 Common Traps When Backtesting

The most important trap you’ll find when backtesting has to do with economic reasoning. Here are some of the other ones that I’ve found through the years:

Trap No. 1: Using the same historical data or finding combinations that yield statistically significant results (i.e., creating false positives or data snooping)

Trap No. 2: Ignoring rational economic theory--or expecting the same results to hold in the future even if not all conditions are the same.  

Trap No. 3: Knowing that the larger the dataset, the more behaviors you can expect, but ignoring research that indicates that Sharpe ratios quickly deteriorate when applying strategies backed by the same historical data.

Trap No. 4: When more and more traders are equipped with the same signals, their trades become very crowded and their signals less potent.

Trap No. 5: Not having guardrails in place to protect you and your trading system. Traders must now keep backtesting and adjusting old signals as often as possible.

Can This Age Of Big Data Predict the Future?

The answer is no. With machine learning, it's tempting to think that traders can perform miracles and predict the future with all these zettabytes of data and their computing resources. Unfortunately, while you can certainly learn a lot through signals, there is a limit to what you can gain from the data you process. More data does not always mean better forecasts. 

In addition, there are no accurate out-of-sample data sets in the financial world. In fact, the only true out-of-sample is live trading. This is why you must backtest dynamically and continuously. Combine this with cross-validation, and you keep track of various combinations of variables.

Are We Ignoring Economic Reasoning?

Economics, unlike many other fields (e.g., physics, medicine), don't have the luxury of being able to carry out extensive out-of-sample tests. When you do, it’s critical to consider individual behaviors (humanity's preferences, needs, and attitudes) that change over time. You can develop tests covering different economic regimes and see the direct impact of changes in individual and collective behaviors and their impact on models and data sets.


For a deeper insight, please see our article “Backtesting in the Age of Financial Machine Learning”.



Related Blogs
Generating Alpha: Machine Learning Helps Traders
Investors are adopting machine learning as a strategy to identify alpha and gain market advantage through sentiment analysis, alternative data insights and maximizing gains. However, the implementation of a professional’s analytical skills and moral values are important in the decision-making processes.

October 1st, 2021

Key Benefits of Machine Learning
Market professionals' main goal is producing unique and fresh alpha. Machine learning is one of the most valuable tools. We have listed its key benefits below.

October 15th, 2021

What is Machine Learning?
Machine Learning is the most prominent topic in tech right now. However, it can be greatly beneficial to traders if rightly incorporated into their strategies.

November 1st, 2021