RT Journal Article SR Electronic T1 Mitigating Overfitting on Financial Datasets with Generative Adversarial Networks JF The Journal of Financial Data Science FD Institutional Investor Journals SP 76 OP 85 DO 10.3905/jfds.2019.1.019 VO 2 IS 1 A1 Fernando De Meer Pardo A1 Rafael Cobo López YR 2020 UL https://pm-research.com/content/2/1/76.abstract AB Overfitting is an inevitable phenomenon when applying deep learning techniques to financial data, given the relative scarcity of available historical data and the ever-changing nature of financial series. This seemingly unavoidable pitfall has heavily impaired the application of many machine learning techniques, such as deep or reinforcement learning, to financial settings. Data augmentation can be an option to circumvent this, specifically generative adversarial networks (GANs). GANs are a type of neural network architecture that focuses on sample generation. Through adversarial training, the GAN can implicitly learn the underlying structure inherent to the dynamics of financial series and acquire the capacity to generate scenarios that share many similarities to those seen in the historic time series. In this article, the authors propose a data augmentation technique using the Wasserstein GAN with gradient penalty and show how training deep models on synthetic data mitigates overfitting, improving their performance on test data when compared to models trained solely on real data.TOPICS: Statistical methods, simulations, big data/machine learningKey Findings• Generative adversarial networks can capture the complex dynamics that govern many financial assets and produce realistic synthetic scenarios based on historic data.• Synthetic financial scenarios can be used to enlarge training datasets in order to improve the accuracy and robustness of other deep learning models.• Synthetic financial series may also be applied in other tasks such as model backtesting, risk analysis, or option pricing.