TY - JOUR T1 - Fund Asset Inference Using Machine Learning Methods: <em>What’s in That Portfolio?</em> JF - The Journal of Financial Data Science DO - 10.3905/jfds.2019.1.005 SP - jfds.2019.1.005 AU - David Byrd AU - Sourabh Bajaj AU - Tucker Hybinette Balch Y1 - 2019/06/12 UR - https://pm-research.com/content/early/2019/05/21/jfds.2019.1.005.abstract N2 - Given only the historic net asset value of a large-cap mutual fund, which members of some universe of stocks are held by the fund? Discovering an exact solution is combinatorially intractable because there are, for example, C(500, 30) or 1.4 × 1048 possible portfolios of 30 stocks drawn from the S&amp;P 500. The authors extend an existing linear clones approach and introduce a new sequential oscillating selection method to produce a computationally efficient inference. Such techniques could inform efforts to detect fund window dressing of disclosure statements or to adjust market positions in advance of major fund disclosure dates. The authors test the approach by tasking the algorithm with inferring the constituents of exchange-traded funds for which the components can be later examined. Depending on the details of the specific problem, the algorithm runs on consumer hardware in 8 to 15 seconds and identifies target portfolio constituents with an accuracy of 88.2% to 98.6%.TOPICS: Big data/machine learning, statistical methods, portfolio management/multi-asset allocation ER -