Next Article in Journal
The role of problem solving ability on innovative behavior and opportunity recognition in university students
Previous Article in Journal
Dynamics from open innovation to evolutionary change
Article Menu

Article Versions

Export Article

Open AccessArticle
J. Open Innov. Technol. Mark. Complex. 2018, 4(1), 3; doi:10.1186/s40852-018-0080-9 (registering DOI)

About relationship between business text patterns and financial performance in corporate data

1
Korea Institue of Science and Technology Information, Seoul, Republic of Korea
2
The Graduate School of the University of Seoul, Seoul, Republic of Korea
*
Author to whom correspondence should be addressed.
Received: 15 October 2017 / Accepted: 4 January 2018 / Published: 2 February 2018
Download PDF [2998 KB, uploaded 8 February 2018]

Abstract

This study uses text and data mining to investigate the relationship between the text patterns of annual reports published by US listed companies and sales performance. Taking previous research a step further, although annual reports show only past and present financial information, analyzing text content can identify sentences or patterns that indicate the future business performance of a company. First, we examine the relation pattern between business risk factors and current business performance. For this purpose, we select companies belonging to two categories of US SIC (Standard Industry Classification) in the IT sector, 7370 and 7373, which include Twitter, Facebook, Google, Yahoo, etc. We manually collect sales and business risk information for a total of 54 companies that submitted an annual report (Form 10-K) for the last three years in these two categories. To establish a correlation between patterns of text and sales performance, four hypotheses were set and tested. To verify the hypotheses, statistical analysis of sales, statistical analysis of text sentences, sentiment analysis of sentences, clustering, dendrogram visualization, keyword extraction, and word-cloud visualization techniques are used. The results show that text length has some correlation with sales performance, and that patterns of frequently appearing words are correlated with the sales performance. However, a sentiment analysis indicates that the positive or negative tone of a report is not related to sales performance.
Keywords: Corporate annual report, 10-k, Text mining, Business keyword, Financial performance, Keyword trends, Word cloud, Sentiment analysis, Correlation coefficient, Hierarchical clustering, Dendrogram Corporate annual report, 10-k, Text mining, Business keyword, Financial performance, Keyword trends, Word cloud, Sentiment analysis, Correlation coefficient, Hierarchical clustering, Dendrogram
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Lee, B.; Park, J.-H.; Kwon, L.; Moon, Y.-H.; Shin, Y.; Kim, G.; Kim, H.-J. About relationship between business text patterns and financial performance in corporate data. J. Open Innov. Technol. Mark. Complex. 2018, 4, 3.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
J. Open Innov. Technol. Mark. Complex. EISSN 2199-8531 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top