Architecting an enterprise financial management model: leveraging multi-head attention mechanism-transformer for user information transformation
Author and article information
Abstract
Financial management assumes a pivotal role as a fundamental information system contributing to enterprise development. Nonetheless, prevalent methodologies frequently encounter challenges in proficiently overseeing diverse information streams inherent to financial management. This study introduces an innovative paradigm for enterprise financial management centered on the transformation of user information signals. In its initial phases, the methodology augments the Transformer network and self-attention mechanism to extract features pertaining to both users and financial data, fostering a more cohesive integration of financial and user information. Subsequently, a reinforcement learning-based alignment method is implemented to reconcile disparities between financial and user information, thereby enhancing semantic alignment. Ultimately, a signal conversion technique employing generative adversarial networks is deployed to harness user information, elevating financial management efficacy and, consequently, optimizing overall financial operations. The empirical validation of this approach, achieving an impressive mAP score of 81.9%, not only outperforms existing methodologies but also underscores the tangible impact and enhanced execution prowess that this paradigm brings to financial management systems. As such, this work not only contributes to the state of the art but also holds promise for revolutionizing the landscape of enterprise financial management.
Cite this as
2024. Architecting an enterprise financial management model: leveraging multi-head attention mechanism-transformer for user information transformation. PeerJ Computer Science 10:e1928 https://doi.org/10.7717/peerj-cs.1928Main article text
Introduction
The great progress of development requires enterprises to have efficient financial management capabilities to adapt to quick business transaction processing. A good financial management system can help enterprises quickly meet the market demand and quickly deploy various resources throughout the enterprise (Sukenti, 2023). Therefore, the study of efficient and fast enterprise financial management systems has extremely high application value.
In addition, studying enterprise financial management helps assess and manage all kinds of risks faced by enterprises, including market risks, credit risks, liquidity risks, etc., to ensure steady enterprise development. Further analysis of financial data can provide decision support for enterprise leadership, help formulate strategic planning, optimize business models, and choose appropriate development directions (Winarno, Agustina & Vinola, 2020). Studying financial management helps enterprises make wise investment and financing decisions, choose the most suitable financing methods and investment projects for enterprise development, and protect shareholders’ rights and interests to the greatest extent (Bustani, Khaddafi & Ilham, 2022). Studying financial management helps enterprises adapt to global competition, understand the characteristics of international markets, formulate global financial strategies, and expand international markets (Klapper & Lusardi, 2020). Therefore, the study of enterprise financial management has extremely high theoretical value.
Enterprise financial management is a complex and multi-level research in which there are some difficulties and challenges (Mazur et al., 2021; Cui, 2023; Chen & Metawa, 2020). (1) The quality of financial data is uneven and diverse: Different enterprises may have large differences in the format, structure and specification of financial data, which leads to difficulties in data integration and analysis. (2) Finance is multi-layered: enterprise financial management involves multi-level decisions. Carrying out reasonable information transmission and decision coordination at different levels to support the realization of the enterprise’s overall financial goals is a complex issue. (3) Financial risk and uncertainty: Enterprises are faced with various and complex types of risks regarding market, credit, liquidity, etc. How to effectively quantify, identify and manage these risks and reduce uncertainty through financial management research is a challenging research direction (Abad-Segura et al., 2020; Aifuwa & Embele, 2019).
Around the above difficulties, many scholars have carried out a series of studies. Chen & Metawa (2020) posited that the swift advancement of IT has the potential to bolster organizational performance within the AIS and enhance the competitive edge of both enterprises and institutions. Nazah et al. (2022) organized and revamped the accounting table, enabling an efficient display of accounting codes and product-related data for information subjects using system queries. This enhancement aims to optimize budget management for enterprises. Ren (2022) asserted that American colleges had implemented comprehensive management and information systems encompassing budgeting, funding, analysis and decision-making. With the development of signal processing, signal conversion algorithms have become an important way to quantify financial information. Gao (2022) proposed an enterprise financial information system based on cloud technology, which helps enterprises build a powerful, simple operation and strong business expansion information system at low cost through cloud computing, deep learning and other technologies. Sitinjak et al. (2023) perfected the traditional financial information system by using the big data model based on the Meacher model.
In spite of the extensive research undertaken by the scholars above on financial information systems, persistent limitations and challenges remain. Primarily, current methodologies predominantly concentrate on singular factors in financial management, presenting challenges when dealing with multimodal information. Additionally, these approaches fall short of establishing swift connections and models between users and financial information, potentially impeding the flexibility and responsiveness of information systems. Furthermore, while innovative technologies have been introduced, further refinement is essential in certain areas, such as signal conversion algorithms and the utilization of big data models, to enhance the overall efficiency of financial information systems.
To address these issues, we propose an enterprise financial management method grounded in user information signal conversion. Recognizing that financial management involves diverse types of information, including multimodal data such as digital data, text, and images, we have employed a comprehensive approach to enhance both the Transformer network and the self-attention mechanism. The aim is to more efficiently extract features related to users and financial data, fostering a more comprehensive comprehension of various information types by the system. Subsequently, a reinforcement learning-based alignment method is introduced to swiftly reconcile disparities between financial and user information, enhancing semantic alignment. This method exhibits increased flexibility in adapting to different information modalities, ensuring coherence between the two sets of information. Lastly, a signal conversion method based on generative adversarial networks has been incorporated, allowing user information to play a more effective role in financial management. The objective of this phase is to enhance the efficiency of financial operations and elevate the decision-making process’s intelligence and accuracy by optimizing the utilization of user information.
-
Introducing a novel multimodal feature extraction method based on an enhanced Transformer to quantify both financial and user information.
-
Proposing a feature alignment method founded on reinforcement learning and a signal conversion method utilizing generative adversarial networks to model the interrelationships between users and finance.
-
Achieving superior performance when compared with other competitive methods on the Enterprise Finance Dataset.
Experiment and Analysis
Dataset and implement details
We utilized the Enterprise Finance Dataset to assess the effectiveness of the enterprise financial management method based on user information signal conversion. This dataset was sourced from attachments within company financial reports submitted to the commission and is accessible on Zenodo. The dataset primarily includes financial statements, offering a more concise representation compared to the comprehensive financial statements and the Notes dataset. The latter comprises both numerical and narrative disclosures for all financial statements and accompanying notes. The information aligns consistently with the financial reports “filed” by each registrant. The data is organized in a clear format, facilitating users in the analysis and comparison of company disclosures over time and among registrants.
Moreover, the dataset incorporates supplementary fields, such as standardized industry classifications for companies, streamlining the utilization of data. Given that Transformers, reinforcement learning, and GANs are widely utilized for big data training, we intend to establish the environment and conduct model training using CPU: Xeon(R) E5-2640 v4 and GPU: 4*Nvidia Tesla V100. Tensorflow will serve as the deep learning framework for these endeavours. The specifics of the experimental parameters can be found in Table 1. We set the learning rate decay term of the model to 0.095 and the initial learning rate to 8×10ˆ(−4), epoch set to 80, batch size set to 40, and SGD is used as the optimizer.
Parameters | Value |
---|---|
Initial learning rate | 8 × 10−4 |
Epoch | 80 |
Batch-size | 40 |
Decay | 0.95 |
Gradient descent method | SGD |
Image input size | 380 × 380 |
Image feature dimension | 1024 |
Since the enterprise financial management method is a multimodal task, we adopt the mean square Error (mAP) and F-measure as the evaluation criteria of the method, which are calculated as follows:
(11)VP=gt⋂prpr (12)VR=gt⋂prgt (13)F=2×VP×VRVP+VR (14)mAP=1N×∑VP×VR.
In the equation, pr refers to the result of the method and gt denotes the true value present in the dataset. In addition, we evaluate the performance of the enterprise financial management by the amount of calculation, the number of model parameters, and the operation time.
Compare our detection method with others
First, we conduct experiments on the improved Transformer-based multimodal feature extraction method on the Enterprise Finance dataset. We select some excellent feature models, such as Transformer (Vaswani, Shazeer & Parmar, 2017), Bert (Deepa, 2021), Oscar (Li, Yin & Li, 2020) and, VinVL (Zhang, Li & Hu, 2021) and DFT (Zhang, Xie & Ding, 2023), and compare the performance. The results are presented in Fig. 6 and Table 2. We can conclude that our method obtains the highest value in all evaluation metrics, which is 0.942 for recall, 0.915 for precision, 0.936 for F-measure, and 0.924 for mAP, while comparing with other algorithms. Compared with the Transformer, our method improves the mAP value of the model by more than 7%, mainly because we improve the Transformer and optimize the self-attention mechanism. Compared with BERT, our method has a lead of more than 6%. BERT is almost the same as a Transformer in principle, so the performance is comparable between the two. Oscar and VinVL are models that are pre-trained with big data and have better adaptability to multimodal tasks, and our method still obtains more than a 3% improvement in the mAP score. Compared with the latest DFT method, which can obtain more than 90% of the mAP value by excellent model performance, our method still obtains about a 2% advantage. For the extraction of enterprise financial features and user features, the multimodal feature extraction by the improved Transformer proposed in this article has great advantages. Through the improved self-attention mechanism and recurrent Transformer structure, the financial features and user features can be effectively extracted and aligned.
Figure 6: Compare our method with others.
Methods | Recall | Precision | F | mAP |
---|---|---|---|---|
Transformer | 0.878 | 0.839 | 0.856 | 0.851 |
Bert | 0.889 | 0.846 | 0.868 | 0.857 |
Oscar | 0.898 | 0.863 | 0.882 | 0.881 |
VinVL | 0.914 | 0.886 | 0.895 | 0.892 |
DFT | 0.923 | 0.899 | 0.912 | 0.905 |
Ours | 0.942 | 0.915 | 0.936 | 0.924 |
Then, we implemented the performance test of the reinforcement learning-based feature alignment method on the dataset. The alignment yield mainly evaluates feature alignment. Therefore, the conversion rate of feature alignment was used as the index in this experiment. We still compare our method to Transformer (Vaswani, Shazeer & Parmar, 2017), Bert (Deepa, 2021), Oscar (Li, Yin & Li, 2020), VinVL (Zhang, Li & Hu, 2021) and DFT (Zhang, Xie & Ding, 2023). Figure 7 shows that the feature alignment method by RL achieves the highest feature alignment yield, that is, 78.6%, which reaches the most advanced level in the world. In addition, the training process is recorded, and the convergence graph is generated. In Fig. 7, the training process of the proposed method is very smooth, and the lowest loss value can be obtained, which fully demonstrates the stability and scalability of the proposed method to data.
Figure 7: The performance of our signal process method.
After validating the multimodal feature extraction method based on the improved Transformer and the signal conversion method grounded in reinforcement learning, we further assessed our proposed signal conversion method using generative adversarial networks (GAN) on the dataset. Our method was benchmarked against notable models, including ActFormer (Xu, Song & Wang, 2023), Git (Wang, Yang & Hu, 2022), CP-GAN (Skariah & Thomas, 2023), and NF-ResNet (Khan et al., 2022). Evaluation metrics encompassed recall, precision, F-measure, and mAP, with results presented in Fig. 8 and Table 3.
Figure 8: The results of our method.
Methods | Recall | Precision | F | mAP |
---|---|---|---|---|
ActFormer | 0.767 | 0.782 | 0.776 | 0.768 |
Git | 0.788 | 0.795 | 0.792 | 0.785 |
CP-GAN | 0.792 | 0.804 | 0.800 | 0.796 |
NF-ResNet | 0.807 | 0.819 | 0.814 | 0.805 |
Ours | 0.823 | 0.837 | 0.826 | 0.819 |
Our method demonstrated exceptional performance, achieving the highest values of 0.823 in recall, 0.837 in precision, and 0.826 in F-measure across all evaluation metrics. In comparison with ActFormer, our method showcased a more than 5% improvement in mAP score and a 5% boost in the F-measure. Against Git, our method exhibited over a 3% F-measure lead and a 3.4% mAP value increase. In contrast to CP-GAN, our method outperformed in all aspects, with all evaluation indexes surpassing 2%. Finally, when compared with NF-ResNet, our method enhanced the mAP score by 1.4% and the F measure score by 1.2%. Leveraging a GAN model based on Unet with fewer layers and feature processing steps, our proposed signal transformation method demonstrated notably superior performance compared to other methods.
The amalgamation of the three methods above culminated in the successful implementation of an end-to-end enterprise financial management system. The system evaluation metrics encompassed model parameter quantity, inference time, Flops (floating-point operations), and training time. As depicted in Fig. 9, the approach proposed in this article exhibited optimal performance, particularly in terms of training and testing time, as well as model parameter quantity. Our method not only streamlined the model by reducing the number of parameters but also significantly shortened inference and training times, ultimately enhancing the system’s efficiency. This results in expedited decision-making and heightened responsiveness in practical applications, providing enterprises with more efficient financial management services.
Figure 9: Model efficiency comparison with other methods.
Moreover, the reduction in Flops and model parameters enhances the computational efficiency of the system, opening up adaptable deployment options for resource-constrained environments. The comprehensive approach presented in this article demonstrates superior performance across various dimensions, offering practical solutions for the development and application of enterprise financial management systems. Figure 9 visually underscores the excellence of our method across diverse evaluation metrics, serving as a valuable reference for future research and development endeavours in the realm of financial management systems.
Discussion
In this article, we introduce a novel enterprise financial management approach centred around user information signal conversion. The primary goal is to address the challenge of handling multimodal information in financial management. We progressively enhance the stages of feature extraction, feature alignment, and signal conversion by incorporating an improved Transformer-based multimodal feature extraction method, a reinforcement learning-based feature alignment method, and a generative adversarial network-based signal conversion method. These methodological refinements aim to bridge the semantic gaps between financial information and user information, establish a reciprocal relationship between the two, and ultimately optimize financial management systems.
The experimental findings underscore the effectiveness of our proposed method, yielding an mAP score of 81.9%. This outcome substantiates the significant impact of our multi-faceted and comprehensive approach to managing multimodal information and refining financial management processes. This innovative method holds promise in furnishing enterprises with more streamlined and intelligent financial management solutions, fostering adaptability in the ever-evolving business landscape.
Despite the enhancements made to enterprise financial management methods, there exist notable shortcomings that require urgent attention and improvement. The substantial demand for computing resources in our proposed approach poses limitations on its feasibility in certain environments. The heightened complexity of the Transformer network and self-attention mechanism necessitates more advanced skills for effective management and maintenance.
Moving forward, we will persist in refining and expanding this approach to better cater to the evolving demands of enterprises in the realm of financial management.
While the introduced enterprise financial management approach, centred around user information signal conversion and leveraging advanced techniques, demonstrates promising results in optimizing financial systems, its practical implementation faces notable challenges. The substantial demand for computing resources poses feasibility issues in resource-constrained environments, necessitating a thorough exploration of resource-efficient alternatives. The heightened complexity of the Transformer network and self-attention mechanism highlights the importance of addressing potential skill gaps among personnel. Additionally, integration challenges with existing financial management systems, scalability concerns, regulatory compliance, and user acceptance are vital aspects that require careful consideration. A comprehensive cost-benefit analysis and a clear roadmap for future refinements will be crucial in enhancing the method’s overall viability and addressing the identified shortcomings for broader industry adoption.
Conclusion
To cater to the dynamic requirements of sophisticated enterprise information systems, this article proposes an innovative paradigm for enterprise financial management. At the core of this approach lies the transformation of user information signals, effectively addressing the challenge of managing diverse modalities of information within the financial domain. Through enhancements such as multimodal feature extraction utilizing an optimized Transformer, a reinforcement learning-based feature alignment method, and a signal conversion method employing generative adversarial networks, we sequentially refine feature extraction, feature alignment, and signal conversion processes. These enhancements play a pivotal role in bridging the semantic gap between financial information and user data, enabling comprehensive modelling of their interrelation. Ultimately, these advancements pave the way for the optimization of the financial management system. Experimental results underscore the effectiveness of our method, attaining an mAP score of 81.9%, highlighting its potential to significantly elevate the performance of enterprise financial management systems. In our future endeavours, we aspire to delve into advanced research and implement cutting-edge algorithms and technologies to reduce dependence on hardware resources. This strategic approach aims to enhance the scalability and adaptability of our methods. Through meticulous refinement of feature extraction, alignment, and signal conversion steps, our primary objective is to elevate the overall performance of the system.
Supplemental Information
Additional Information and Declarations
Competing Interests
The authors declare that there are no conflicts of interest.
Author Contributions
Wan Yu conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.
Habib Hamam performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, and approved the final draft.
Data Availability
The following information was supplied regarding data availability:
The data is available at Zenodo: Tom. (2023). Enterprise Financial Dataset [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10013148.
The code is available in the Supplementary File.
Funding
The study was supported by the Henan Federation of Social Sciences under the project name “Research on Functional Evaluation and Improvement of Zhengzhou National Central City” (project number: SKL-2021-2633). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.