RT Journal Article SR Electronic T1 Fund Asset Inference Using Machine Learning Methods: What’s in That Portfolio? JF The Journal of Financial Data Science FD Institutional Investor Journals SP 98 OP 107 DO 10.3905/jfds.2019.1.005 VO 1 IS 3 A1 David Byrd A1 Sourabh Bajaj A1 Tucker Hybinette Balch YR 2019 UL https://pm-research.com/content/1/3/98.abstract AB Given only the historic net asset value of a large-cap mutual fund, which members of some universe of stocks are held by the fund? Discovering an exact solution is combinatorially intractable because there are, for example, C(500, 30) or 1.4 × 1048 possible portfolios of 30 stocks drawn from the S&P 500. The authors extend an existing linear clones approach and introduce a new sequential oscillating selection method to produce a computationally efficient inference. Such techniques could inform efforts to detect fund window dressing of disclosure statements or to adjust market positions in advance of major fund disclosure dates. The authors test the approach by tasking the algorithm with inferring the constituents of exchange-traded funds for which the components can be later examined. Depending on the details of the specific problem, the algorithm runs on consumer hardware in 8 to 15 seconds and identifies target portfolio constituents with an accuracy of 88.2% to 98.6%.TOPICS: Big data/machine learning, statistical methods, portfolio management/multi-asset allocation