A corpus-based analysis of potential linguistic indicators of corporate deception in tobacco industry documents
- Published
- Accepted
- Subject Areas
- Algorithms and Analysis of Algorithms, Computational Linguistics, Natural Language and Speech
- Keywords
- deception, verbs, tobacco, corporate fraud, function words, public health, linguistic analysis
- Copyright
- © 2015 Brown-Johnson et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
- Cite this article
- 2015. A corpus-based analysis of potential linguistic indicators of corporate deception in tobacco industry documents. PeerJ PrePrints 3:e1079v1 https://doi.org/10.7287/peerj.preprints.1079v1
Abstract
Introduction: To more fully understand the impact of specific language attributes on deception in corporate communication, we used a stratified random sample of tobacco industry documents to test a small number of individual potential automated linguistic indicators of corporate deception: cognitive-emotional verbs, allness and superlative terms, nonbinding verbs, and group mentality. Methods: Texts from the Tobacco Documents Corpus were categorized by audience addressed (industry internal/external) and company of origin, and compared for incidence of the deceptive language indicators. Results: Cognitive-emotional verbs were strongly associated with likely deception and most prevalent for external audiences and documents produced by the Tobacco Institute, an industry front group. Cognitive-emotional verbs include believe, think, seem, feel and realize, in opposition to action verbs (e.g. throw). Discussion: Linguistic measurements of cognitive-emotional verbs may help pinpoint deceptive and misleading corporate communication. Verb choice and the use of different classes of verbs may reflect corporate linguistic deception.
Author Comment
This is a submission to PeerJ Computer Science for review.