Katsafados, Apostolos G. and Androutsopoulos, Ion and Chalkidis, Ilias and Fergadiotis, Manos and Leledakis, George N. and Pyrgiotakis, Emmanouil G. (2020): Textual Information and IPO Underpricing: A Machine Learning Approach.
Preview |
PDF
MPRA_paper_103813.pdf Download (684kB) | Preview |
Abstract
This study examines the predictive power of textual information from S-1 filings in explaining IPO underpricing. Our empirical approach differs from previous research, as we utilize several machine learning algorithms to predict whether an IPO will be underpriced, or not. We analyze a large sample of 2,481 U.S. IPOs from 1997 to 2016, and we find that textual information can effectively complement traditional financial variables in terms of prediction accuracy. In fact, models that use both textual data and financial variables as inputs have superior performance compared to models using a single type of input. We attribute our findings to the fact that textual information can reduce the ex-ante valuation uncertainty of IPO firms, thus leading to more accurate estimates.
Item Type: | MPRA Paper |
---|---|
Original Title: | Textual Information and IPO Underpricing: A Machine Learning Approach |
English Title: | Textual Information and IPO Underpricing: A Machine Learning Aapproach |
Language: | English |
Keywords: | Initial public offerings; First-day returns; Machine learning; Natural language processing |
Subjects: | G - Financial Economics > G0 - General > G02 - Behavioral Finance: Underlying Principles G - Financial Economics > G1 - General Financial Markets > G14 - Information and Market Efficiency ; Event Studies ; Insider Trading G - Financial Economics > G3 - Corporate Finance and Governance > G30 - General G - Financial Economics > G3 - Corporate Finance and Governance > G32 - Financing Policy ; Financial Risk and Risk Management ; Capital and Ownership Structure ; Value of Firms ; Goodwill |
Item ID: | 103813 |
Depositing User: | Dr George Leledakis |
Date Deposited: | 28 Oct 2020 11:33 |
Last Modified: | 28 Oct 2020 11:33 |
References: | Ambrose, B. W., & Megginson, W. L. (1992). The role of asset structure, ownership structure, and takeover defenses in determining acquisition likelihood. Journal of Financial and Quantitative Analysis, 27, 575-589. Arnold, T., Fishe, R. P. H., & North, D. (2010). The effects of ambiguous information on initial and subsequent IPO returns. Financial Management, 39, 1497-1519. Balakrishnan, R., Qiu X. Y., & Srinivasan P. (2010). On the predictive ability of narrative disclosures in annual reports. European Journal of Operational Research, 202, 789-801. Banerjee, S., Dai, L., & Shrestha, K. (2011). Cross-country IPOs: What explains differences in underpricing?. Journal of Corporate Finance, 17, 1289-1305. Barnes, P. (1998). Can takeover targets be identified by statistical techniques? Some UK evidence. Journal of the Royal Statistical Society: Series D (The Statistician), 47, 573-591. Baron, D. P. (1982). A model of the demand for investment banking advising and distribution services for new issues. Journal of Finance, 37, 955-976. Baron, D. P., & Holmstrom, B. (1980). The investment banking contract for new issues under asymmetric information: Delegation and the incentive problem. Journal of Finance, 35, 1115-1138. Bastı, E., Kuzey, C., & Delen, D. (2015). Analyzing initial public offerings' short-term performance using decision trees and SVMs. Decision Support Systems, 73, 15-27. Beatty, R. P., & Ritter, J. R. (1986). Investment banking, reputation, and the underpricing of initial public offerings. Journal of Financial Economics, 15, 213-232. Beatty, R. P., & Welch, I. (1996). Issuer expenses and legal liability in initial public offerings. Journal of Law and Economics, 39, 545-602. Benveniste, L. M., Ljungqvist, A., Wilhelm Jr., W. J., & Yu, X. (2003). Evidence of information spillovers in the production of investment banking services. Journal of Finance, 58, 577-608. Bhabra, H. S., & Pettway, R. H. (2003). IPO prospectus information and subsequent performance. Financial Review, 38, 369-397. Bodnaruk, A., Loughran, T., & McDonald, B. (2015). Using 10-K text to gauge financial constraints. Journal of Financial and Quantitative Analysis, 50, 623-646. Brau, J. C., Cicon, J., & McQueen, G. (2016). Soft strategic information and IPO underpricing. Journal of Behavioral Finance, 17, 1-17. Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123-140. Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32. Brown, S. V, & Tucker, J. W. (2011). Large-sample evidence on firms' year-over-year MD&A modifications. Journal of Accounting Research, 49, 309-346. Butler, A. W., Keefe, M. Ο. C., & Kieschnick, R. (2014). Robust determinants of IPO underpricing and their implications for IPO research. Journal of Corporate Finance, 27, 367-383. Cao, L. (2003). Support vector machines experts for time series forecasting. Neurocomputing, 51, 321-339. Carter, R., & Manaster, S. (1990). Initial public offerings and underwriter reputation. Journal of Finance, 45, 1045-1067. Carter, R. B., Dark, F. H., & Singh, A. K. (1998). Underwriter reputation, initial returns, and the long-run performance of IPO stocks. Journal of Finance, 53, 285-311. Chahine, S. (2008). Underpricing versus gross spread: New evidence on the effect of sold shares at the time of IPOs. Journal of Multinational Financial Management, 18, 180-196. Chen, G., Firth, M., & Kim, J. B. (2004). IPO underpricing in China’s new stock markets. Journal of Multinational Financial Management, 14, 283-302. Chen, Y. S., Chen, J. S., & Cheng, C. H. (2010). An alternate method of examining IPO returns. Intelligent Automation and Soft Computing, 16, 151-161. Cheng, C.H., Chen, Y.S., & Chen, J.S. (2007). Classifying initial returns of electronic firm’s IPOs using entropy based rough sets in Taiwan trading systems. Second International Conference on Innovative Computing, Information and Control (ICICIC). Cohen, L., Malloy, C., & Nguyen, Q. (2020). Lazy prices. Journal of Finance, 75, 1371-1415. Ding, R. (2016). Disclosure of downside risk and investors’ use of qualitative information: Evidence from the IPO prospectus’s risk factor section. International Review of Finance, 16, 73-126. Doumpos, M., Andriosopoulos, K., Galariotis, E., Makridou, G., & Zopounidis, C. (2017). Corporate failure prediction in the European energy sector: A multicriteria approach and the effect of country characteristics. European Journal of Operational Research, 262, 347-360. Doumpos, M., Kosmidou, K., & Pasiouras, F. (2004). Prediction of acquisition targets in the UK: A multicriteria approach. Operational Research: An International Journal, 4, 191-211. Engelen, P. J., & Van Essen, M. (2010). Underpricing of IPOs: Firm-, issue-and country-specific characteristics. Journal of Banking and Finance, 34, 1958-1969. Espahbodi, H., & Espahbodi, P. (2003). Binary choice models for corporate takeover. Journal of Banking and Finance, 27, 549-574. Ferris, S. P., Hao, Q., & Liao, M. Y. (2013). The effect of issuer conservatism on IPO pricing and performance, Review of Finance, 17, 933-1027. Gandhi, P., Loughran, T., & McDonald, B. (2019). Using annual report sentiment as a proxy for financial distress in U.S. banks. Journal of Behavioral Finance, 20, 424-436. Gao, X., Ritter, J. R., & Zhu, Z. (2013). Where have all the IPOs gone? Journal of Financial and Quantitative Analysis, 48, 1663-1692. Geng, R., Bose, I., & Chen, X. (2015). Prediction of financial distress: An empirical study of listed Chinese companies using data mining. European Journal of Operational Research, 241, 236-247. Gogas, P., Papadimitriou, T., & Agrapetidou, A. (2018). Forecasting bank failures and stress testing: A machine learning approach. International Journal of Forecasting, 34, 440-455. Goldberg, Y. (2017). Neural network methods for natural language processing. Morgan & Claypool Publishers. Hagenau, M., Liebmann, M., & Neumann, D. (2013). Automated news reading: Stock price prediction based on financial news using context-capturing features. Decision Support Systems, 55, 685-697. Hanley, K. W., & Hoberg, G. (2010). The information content of IPO prospectuses. Review of Financial Studies, 23, 2821-2864. Hanley, K. W., & Hoberg, G. (2012). Litigation risk, strategic disclosure and the underpricing of initial public offerings. Journal of Financial Economics, 103, 235-254. Hasbrouck, J. (1985). The characteristics of takeover targets q and other measures. Journal of Banking and Finance, 9, 351-362. Hauser, S., Yaari, U., Tanchuma, Y., & Baker, H. (2006). Initial public offering discount and competition. Journal of Law and Economics, 49, 331-351. Howland, P., Jeon, M., & Park, H. (2003). Structure preserving dimension reduction for clustered text data based on the generalized singular value decomposition. SIAM Journal on Matrix Analysis and Applications, 25, 165-179. Huang, W., Nakamori, Y., & Wang, S. Y. (2005). Forecasting stock market movement direction with support vector machine. Computers & Operations Research, 32, 2513-2522. Ibbotson, R. G. (1975). Price performance of common stock new issues. Journal of Financial Economics, 2, 235-272. Ibbotson, R. G., Sindelar, J. L., & Ritter, J. R. (1994). The market’s problems with the pricing of initial public offerings. Journal of Applied Corporate Finance, 7, 66-74. Imbens, G. W. (1992). An efficient method of moments estimator for discrete choice models with choice-base sampling. Econometrica, 60, 1187-1214. Jain, B. A., & Nag, B. N. (1995). Artificial neural network models for pricing initial public offerings. Decision Sciences, 26, 283-302. Katsafados, A. G., Androutsopoulos, I., Chalkidis, I., Fergadiotis, E., Leledakis, G. N., & Pyrgiotakis, E. G. (2020). Using textual analysis to identify merger participants: Evidence from U.S. banking industry. Working Paper. Available at SSRN: https://ssrn.com/abstract=3474583. Kearney, C., & Liu, S. (2014). Textual sentiment in finance: A survey of methods and models. International Review of Financial Analysis, 33, 171-185. Kim, H., Howland, P., & Park, H. (2005). Dimension reduction in text classification with support vector machines. Journal of Machine Learning Research, 6, 37-53. Kim, J., Shin, S., Lee, H. S., & Oh, K. J. (2019). A machine learning portfolio allocation system for IPOs in Korean markets using GA-rough set theory. Sustainability, 11, 6803. Kumar, B. S., & Ravi, V. (2016). A survey of the applications of text mining in financial domain. Knowledge-Based Systems, 114, 128-147. Kumar, R. B., Kumar, B. S., & Prasad, C. S. S. (2012). Financial news classification using SVM. International Journal of Scientific and Research Publications, 2, 1-6. Laitinen, T., & Kankaanpaa, M. (1999). Comparative analysis of failure prediction methods: The Finnish case. European Accounting Review, 8, 67-92. Leone, A. J., Rock, S., & Willenborg, M. (2007). Disclosure of intended use of proceeds and underpricing in initial public offerings. Journal of Accounting Research, 45, 111-153. Ljungqvist, A. (2007). IPO Underpricing. In B.E. Eckbo, (Eds.), Handbook of Corporate Finance: Empirical Corporate Finance (pp. 375-422). Amsterdam: Elsevier-North Holland. Ljungqvist, A., & Wilhelm Jr, W. J. (2003). IPO pricing in the dot‐com bubble. Journal of Finance, 58, 723-752. Ljungqvist, A., & Wilhelm Jr, W. J. (2005). Does prospect theory explain IPO market behavior? Journal of Finance, 60, 1759-1790. Logue, D. E. (1973). On the pricing of unseasoned equity issues: 1965-1969. Journal of Financial and Quantitative Analysis, 25, 133-141. Loughran, T., & McDonald, B. (2011). When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. Journal of Finance, 66, 35-65. Loughran, T., & McDonald, B. (2013). IPO First-day returns, offer price revisions, volatility, and form S-1 language. Journal of Financial Economics, 109, 307-326. Loughran, T., & McDonald, B. (2016). Textual analysis in accounting and finance: A survey. Journal of Accounting Research, 54, 1187-1230. Loughran, T., & Ritter, J. (2004). Why has IPO underpricing changed over time? Financial Management, 33, 5-37. Ly, T. H., & Nguyen, K. (2020). Do words matter: Predicting IPO performance from prospectus sentiment. 2020 IEEE 14th International Conference on Semantic Computing. Mai, F., Tian, S., Lee, C., & Ma, L. (2019). Deep learning models for bankruptcy prediction using textual disclosures. European Journal of Operational Research, 274, 743-758. Min, J. H., & Lee, Y. C. (2005). Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters. Expert Systems with Applications, 28, 603-614. Mitsdorffer, R., Diederich, J., & Tan, C. (2002). Rule extraction from the technology IPOs in the US stock market. Proceedings of the 9th International Conference on Neural Information Processing (ICONIP’02) (IEEE Press, Piscataway), 2328-2334. Nardo, M., Petracco‐Giudici, M., & Naltsidis, M. (2016). Walking down wall street with a tablet: A survey of stock market predictions using the web. Journal of Economic Surveys, 30, 356-369. Nassirtoussi, A. K., Aghabozorgi, S., Wah, T. Y., & Ling Ngo, D. C. (2014). Text mining for market prediction: A systematic review. Expert Systems with Applications, 41, 7653-7670. Neophytou, E., & Mar Molinero, C. (2004). Predicting corporate failure in the UK: A multidimensional scaling approach. Journal of Business Finance and Accounting, 31, 677-710. Nguyen, T. H., Shirai, K., & Velcin, J. (2015). Sentiment analysis on social media for stock movement prediction. Expert Systems with Applications, 42, 9603-9611. Pai, P. F., & Lin, C. S. (2005). A hybrid ARIMA and support vector machines model in stock price forecasting. Omega, 33, 497-505. Palepu, K. G. (1986). Predicting takeover targets: A methodological and empirical analysis. Journal of Accounting and Economics, 8, 3-35. Pasiouras, F., & Tanna, S. (2010). The prediction of bank acquisition targets with discriminant and logit analyses: Μethodological issues and empirical evidence. Research in International Business and Finance, 24, 39-61. Pasiouras, F., Gaganis, C., & Zopounidis, C. (2010). Multicriteria classification models for the identification of targets and acquirers in the Asian banking sector. European Journal of Operational Research, 204, 328-335. Pasiouras, F., Gaganis, C., Tanna, S., & Zopounidis, C. (2008). An application of support vector machines in the prediction of acquisition targets: Evidence from the EU banking sector. In C. Zopounidis, M. Doumpos, & P. Pardalos (Eds.), Handbook of financial engineering (pp. 431-456). Boston: Springer. Pasiouras, F., Tanna, S., & Zopounidis, C. (2007). The identification of acquisition targets in the EU banking industry: An application of multicriteria approaches. International Review of Financial Analysis, 16, 262-281. Pestov, V. (2013). Is the k-NN classifier in high dimensions affected by the curse of dimensionality? Computers & Mathematics with Applications, 65, 1427-1437. Powell, R. G. (2001). Takeover prediction and portfolio performance: A note. Journal of Business Finance and Accounting, 28, 993-1011. Quintana, D., Chávez, F., Luque Baena, R. M., & Luna, F. (2018). Fuzzy techniques for IPO underpricing prediction. Journal of Intelligent & Fuzzy Systems, 35, 367-381. Quintana, D., Sáez, Y., & Isasi, P. (2017). Random forest prediction of IPO underpricing. Applied Sciences, 7, 636. Reber, B., Berry, B., & Toms, S. (2005). Predicting mispricing of initial public offerings. Intelligent Systems in Accounting, Finance and Management, 13, 41-59. Ritter, J. R. (1984). The “hot issue” market of 1980. Journal of Business, 57, 215-240. Ritter, J. R., & Welch, I. (2002). A review of IPO activity, pricing, and allocations. Journal of Finance, 57, 1795-1828. Rock, K. (1986). Why new issues are underpriced. Journal of Financial Economics, 15, 187-212. Routledge, B. R., Sacchetto, S., & Smith, N. A. (2017). Predicting merger targets and acquirers from text. Working Paper, Carnegie Mellon University. Schumaker, R. P., & Chen, H. (2009). Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions of Information Systems, 27, 1-19. Shin, K. S., Lee, T. S., & Kim, H. (2005). An application of support vector machines in bankruptcy prediction model. Expert Systems with Applications, 28, 127-135. Sun, S., Luo, C., & Chen, J. (2017). A review of natural language processing techniques for opinion mining systems. Information Fusion, 36, 10-25. Vapnik, V. N. (1998). Statistical learning theory. (1st ed.). New York: Wiley. Veganzones, D., & Severin, E. (2018). An investigation of bankruptcy prediction in imbalanced datasets. Decision Support Systems, 112, 111-124 Wang, D., Qian, X., Quek, C., Tan, A. H., Miao, C., Zhang, X., Ng, G. S., & Zhou, Y. (2018). An interpretable neural fuzzy inference system for predictions of underpricing in initial public offerings. Neurocomputing, 319, 102-117. Wu, C. H., Tzeng, G. H., Goo, Y. J., & Fang, W. C. (2007). A real-valued genetic algorithm to optimize the parameters of support vector machine for predicting bankruptcy. Expert Systems with Applications, 32, 397-408. |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/103813 |