Do SEC filings indicate any trends? Evidence from the sentiment distribution of forms 10-K and 10-Q with FinBERT

DOIhttps://doi.org/10.1108/DTA-05-2022-0215
Published date21 April 2023
Date21 April 2023
Pages293-312
Subject MatterLibrary & information science,Librarianship/library management,Library technology,Information behaviour & retrieval,Metadata,Information & knowledge management,Information & communications technology,Internet
AuthorHyogon Kim,Eunmi Lee,Donghee Yoo
Do SEC lings indicate any trends?
Evidence from the sentiment
distribution of forms 10-K and 10-Q
with FinBERT
Hyogon Kim
Management of Technology, Gyeongsang National University, Jinju,
Republic of Korea
Eunmi Lee
Textile and Apparel Management, University of Missouri, Columbia, Missouri,
USA, and
Donghee Yoo
Management Information Systems, BERI, Gyeongsang National University, Jinju,
Republic of Korea
Abstract
Purpose This study quantied companiesviews on the COVID-19 pandemic with sentiment analysis of US
public companiesdisclosures. The study aims to provide timelyinsights to shareholders, investors and consumers
by exploring sentiment trends and changes in the industry and the relationship with stock price indices.
Design/methodology/approach From more than 50,000 Form 10-K and Form 10-Q published between
2020 and 2021, over one million texts related to the COVID-19 pandemic were extracted. Applying the
FinBERT ne-tuned for this study, the texts were classied into positive, negative and neutral sentiments.
The correlations between sentiment trends, dierences in sentiment distribution by industry and stock price
indices were investigated by statistically testing the changes and distribution of quantied sentiments.
Findings First, there were quantitative changes in texts related to the COVID-19 pandemic in the US
companiesdisclosures. In addition, the changes in the trend of positive and negative sentiments were found.
Second, industry patterns of positive and negative sentiment changes were similar, but no similarities were
found in neutral sentiments. Third, in analyzing the relationship between the representative US stock indices
and the sentiment trends, the results indicated a positive relationship with positive sentiments and a negative
relationship with negative sentiments.
Originality/value Performing sentiment analysis on formal documents like Securities and Exchange
Commission (SEC) lings, this study was dierentiated from previous studies by revealing the quantitative
changes of sentiment implied in the documents and the trend over time. Moreover, an appropriate data
preprocessing procedure and analysis method were presented for the time-series analysis of the SEC lings.
Keywords Sentiment analysis, Trend, FinBERT, SEC, Disclosure, COVID-19
Paper type Research paper
1. Introduction
Corporate disclosures, such as annual and quarterly reports, provide useful information to
shareholders, investors and so on, as ocial documents that contain a companysviewson
overall management. In the United States, most public companies must prepare Form 10-K
(annual reports) and Form 10-Q (quarterly reports) and submit them regularly to the United
States Securities and Exchange Commission (SEC). The SEC lings are considered valuable data
for corporate analysis because they contain quantitative data such as nancial statements and
qualitative data described from companiesperspectives, such as business overview, business
conditions, perceived risks and future plans (Stephany et al.,2020).
ThecurrentissueandfulltextarchiveofthisjournalisavailableonEmeraldInsightat:
https://www.emerald.com/insight/2514-9288.htm
293
Received 24 May 2022
Revised 26 September 2022
4 November 2022
Accepted 10 November 2022
Data Technologies and
Applications
Vol. 57 No. 2, 2023
pp. 293-312
© Emerald Publishing Limited
2514-9288
DOI 10.1108/DTA-05-2022-0215
Sentiment
analysis of
SEC ling with
FinBERT
Companies experiencing major management changes both internally and externally
have an ethical responsibility to provide the latest information to support investors
correct decision-making (Yuthas et al., 2002). In response to the COVID-19 pandemic that
shocked the entire industry, multiple companies quickly included their views on the
COVID-19 pandemic in corporate disclosures. According to a survey by Larcker et al.
(2020), in the 3,644 SEC lings collected in 2020, COVID-19 mentions rose sharply to
4.8 per cent in January, 35.5 per cent in February, 76.6 per cent in March, 100 per cent in
April and 99.9 per cent in May 2020. Therefore, through the analysis of the SEC lings
released in 20202021, it is possible to examine the views of US companies on the COVID-19
pandemic over time.
However, SEC lings are unstructured documents and consist of massive numerical and
textual data.Text mining is gaining insightsand discovering meaningfulinformation such as
patterns and important links from unstructured text data like SEC lings, newsand internet
postings. Sentiment analysis, one of the text mining techniques, aims to systematically
extract, identify, categorize and quantify emotional states, subjective information and so on
from texts. Being popular and widely used in corporate management, sentiment analysis
classies produced data like consumersexperiences and opinions on social media platforms
into positive and negative (Capuano et al.,2021;Tian et al.,2021). However, sentiment
analysis is also being attempted on documents in which authors express opinions and
sentiments less explicitly, such as SEC lings and corporate social responsibility (CSR)
reports. For example, SEC lings have been utilized in sentiment analysis studies for
corporate evaluation, including performance prediction (Azimi and Agrawal, 2021),
nancial risk prediction (Wang et al.,2013) and identication of insolvent companies
(Gandhi et al., 2019). In addition, through sentiment analysis of CSR reports, the values like
return on assets were predicted (Che et al.,2020;Myšková and Hájek, 2018).
Text mining techniques including sentiment analysis are gradually evolving for
practical data analysis. In particular, the development of deep learning and natural
language processing (NLP) has made remarkable achievements in various classication
and prediction tasks. Studies are being conducted to challenge more complex sentiment
classication using a language model such as Bidirectional Encoder Representations from
Transformers (BERT) that considers contextual meanings and bidirectional sentences.
Although sentiment analysis using text mining and NLP techniques is being actively
used, there are relatively few studies on sentiment analysis on formal documents such as
SEC lings. Moreover, few studies reveal how sentiment implied in documents changes
over time. To ll the gap in the literature, this study aims to quantitatively measure the
views and attitudes of US companies to the COVID-19 pandemic related to sentiment and to
explore changes and trends in the companiessentiment over time. The results of this study
may contribute to various investors, consumers and administrators in exploring corporate
responses and changes in their responses to the uncontrollable events aecting the entire
society like the COVID-19 pandemic and getting a better understanding of the individual
company, as well as all industries.
The detailed tasks of this study to achieve the study purpose are as follows. First,
sentiment analysis of texts related to the COVID-19 pandemic in the SEC lings was
performed. Several signicant issues, such as the WHOs pandemic declaration, increase
in conrmed COVID-19 cases and deaths, vaccine development and vaccination and the
emergence of COVID-19 variants, aected corporate disclosures. This would result in
a quantitative change in texts related to the COVID-19 pandemic and indicate certain
trends. Second, it was examined in the study whether the distribution of sentiment due to
the COVID-19 pandemic diered by industry. The COVID-19 pandemic aected industries
as a whole, but the magnitude and intensity of the impact varied by industry (Szczygielski
et al., 2022). Accordingly, it was predicted that the sentiment distribution and trend would
DTA
57,2
294

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT