PT  - JOURNAL ARTICLE
AU  - Jay Cao
AU  - Jacky Chen
AU  - John Hull
AU  - Zissis Poulos
TI  - Deep Hedging of Derivatives Using Reinforcement Learning
AID  - 10.3905/jfds.2020.1.052
DP  - 2020 Dec 22
TA  - The Journal of Financial Data Science
PG  - jfds.2020.1.052
4099  - https://pm-research.com/content/early/2020/12/21/jfds.2020.1.052.short
4100  - https://pm-research.com/content/early/2020/12/21/jfds.2020.1.052.full
AB  - This article shows how reinforcement learning can be used to derive optimal hedging strategies for derivatives when there are transaction costs. The article illustrates the approach by showing the difference between using delta hedging and optimal hedging for a short position in a call option when the objective is to minimize a function equal to the mean hedging cost plus a constant times the standard deviation of the hedging cost. Two situations are considered. In the first, the asset price follows a geometric Brownian motion. In the second, the asset price follows a stochastic volatility process. The article extends the basic reinforcement learning approach in several ways. First, it uses two different Q-functions to track both the expected value of the cost and the expected value of the square of the cost. This approach increases the range of objective functions that can be used. Second, it uses a learning algorithm that allows for continuous state and action space. Third, it compares the accounting profit and loss (P&amp;amp;L) approach and the cash flow approach. The authors find that a hybrid approach involving the use of an accounting P&amp;amp;L approach that incorporates a relatively simple valuation model works well.TOPICS: Big data/machine learning, derivatives, simulationsKey Findings▪ The authors show how reinforcement learning can be used to derive optimal hedging strategies for derivatives in the presence of transaction costs.▪ They extend the standard reinforcement learning approach by using multiple Q-functions to increase the range of objective functions that can be used and by using algorithms that allow the state space and action space to be continuous.▪ For valuing the hedging position at each step, the authors compare three approaches: the accounting P&amp;amp;L, cash flow, and a hybrid involving the use of an accounting P&amp;amp;L approach that incorporates a relatively simple valuation model.