Deep learning-based dimensional emotion recognition for conversational agent-based cognitive behavioral therapy

View article
PeerJ Computer Science

Main article text

 

Introduction

  1. Development of a transformer-based model: We introduce a transformer-based model for dimensional text-based emotion recognition that captures a broad spectrum of emotional complexities. This model significantly outperforms existing state-of-the-art models in recognizing emotional states’ dimensions, specifically valence, arousal, and dominance (VAD), based on individual text messages.

  2. Creation of a new dimensional emotion dataset: We created a novel dimensional emotion dataset using a dataset transformation schema that integrated both publicly available categorical and dimensional data sources into one data pool. This new dataset, comprising 75,503 samples, features a more balanced distribution of VAD and was crucial for fine-tuning our model.

  3. Application to CA-based CBT: The advanced emotion recognition capabilities of our model were integrated into a CA-based CBT system. A feasibility study involving 20 participants evaluated not only the model’s technical effectiveness but also its usability, acceptance, and ability to convey empathy, which are key factors for applications in therapeutic contexts such as iCBT.

Dimensional Emotion Recognition for CA-based CBT

Overall conceptual design

Transformer-based model for dimensional emotion recognition

Emotion dataset merging and training

  • The Emobank dataset (Buechel & Hahn, 2022) consists of over 10,000 annotated English sentences, sourced from previously categorically annotated datasets (the manually annotated sub-corpus of the American National Corpus (Ide et al., 2008) and the SemEval-2007 Task 14 AffectiveText Corpus (Strapparava & Mihalcea, 2007). Each sentence in the dataset was rated on its VAD value by five distinct human judges.

  • The GoEmotions dataset (Demszky et al., 2020) consists of 58,000 annotated comments of the social media platform reddit.com (https://www.reddit.com/, accessed 23.11.2023), making it the largest emotion dataset annotated by humans currently. Each sentence in the dataset was labeled by three, and in cases of indecision, five human judges with one of 28 emotional categories.

  • The International Survey on Emotion Antecedents and Reactions (ISEAR) (ISEAR dataset, https://paperswithcode.com/dataset/isear, accessed 22.08.2023) is an annotated dataset that comprises 7,503 sentences, annotated for the emotional categories of joy, fear, anger, sadness, disgust, shame, and guilt by psychology students and non-psychology students.

  • The CrowdFlower dataset (Sentiment Analysis in Text - Dataset by crowdflower, https://data.world/crowdflower/sentiment-analysis-in-text, accessed 07.08.2022) is comprised of over 7,500 tweets, annotated by the emotional categories: empty, sadness, enthusiasm, neutral, worry, sadness, love, fun, hate, happiness, relief, boredom, surprise or anger.

Implementation

Evaluation

Technical evaluation

Methodology

Results

User study

Methodology

Participants

Results

Discussion

Implications of deep learning-based dimensional emotion recognition for iCBT

Limitations

System design

Emotion recognition approach and model training

Biases in data and annotation

Significance and comparability of user study

Conclusion

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Julian Striegl conceived and designed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Jordan Wenzel Richter conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Leoni Grossmann analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Björn Bråstad analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Marie Gotthardt analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Christian Rück analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

John Wallert analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Claudia Loitsch conceived and designed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The model is available at GitHub and Zenodo:

- https://github.com/JulianStriegl/dimensional-er-cbt.

- Julian Striegl. (2024). JulianStriegl/dimensional-er-cbt: Initial Release (v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.11091285.

Funding

The authors received no funding for this work.

1 Citation 1,163 Views 73 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more