Detecting temporal asymmetry after epilepsy surgery: a 3D MRI-based comparative outcome study of clinicians and lay observers

Rafael Denadai; Marina Koutsodontis Machado Alvim; Yeonah Kang; Junior Chun-Yu Tu; Brunno M. de Campos; Enrico Ghizoni; Helder Tedeschi; Clarissa Yasuda; Fernando Cendes

doi:10.7717/peerj.20201

Detecting temporal asymmetry after epilepsy surgery: a 3D MRI-based comparative outcome study of clinicians and lay observers

Rafael Denadai ¹, Marina Koutsodontis Machado Alvim^1,2, Yeonah Kang³, Junior Chun-Yu Tu⁴, Brunno M. de Campos^1,5, Enrico Ghizoni^1,2, Helder Tedeschi^1,2, Clarissa Yasuda^1,2,5, Fernando Cendes^1,2,5

1Brazilian Institute of Neuroscience and Neurotechnology (BRAINN), Universidade Estadual de Campinas, Campinas, São Paulo, Brazil

2Department of Neurology, School of Medical Sciences, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil

3Department of Radiology, Seo II Medical Group Clinic, Busan, Republic of South Korea

4Department of Plastic and Reconstructive Surgery and Craniofacial Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan

5Neuroimaging Laboratory, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil

DOI: 10.7717/peerj.20201

Published: 2025-10-30
Accepted: 2025-09-16
Received: 2025-03-06

Academic Editor: Nikolaos Gkantidis

Subject Areas: Anatomy and Physiology, Neurology, Psychiatry and Psychology, Radiology and Medical Imaging
Keywords: Epilepsy, Asymmetry, Outcome study, Clinicians, Laypeople, Three-dimensional, Diagnostic imaging, Temporal asymmetry

Copyright: © 2025 Denadai et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Denadai R, Alvim MKM, Kang Y, Tu JC, Campos BMd, Ghizoni E, Tedeschi H, Yasuda C, Cendes F. 2025. Detecting temporal asymmetry after epilepsy surgery: a 3D MRI-based comparative outcome study of clinicians and lay observers. PeerJ 13:e20201 https://doi.org/10.7717/peerj.20201

The authors have chosen to make the review history of this article public.

Abstract

Background

Resective surgery through pterional approach is an effective treatment for drug-resistant temporal lobe epilepsy, but it may lead to temporal asymmetry in the craniofacial region. Nonetheless, recent systematic reviews showed that there is no gold standard method for the discrimination of a clinically visible abnormal temporal asymmetry from a normal fluctuating asymmetry. This study compares the ability of trained and untrained observers to detect temporal asymmetry and establishes a threshold for clinically detecting abnormal asymmetry.

Methods

Standardized magnetic resonance imaging (MRI)-derived three-dimensional (3D) frontal views of adult patients who underwent temporal lobe epilepsy surgery were used to create a continuum spectrum of preoperative (n = 96) and 12-month postoperative (n = 96) craniofacial images. A panel of 32 untrained lay observers (family members and laypeople) and 32 trained clinicians (surgeons and clinical specialists) independently appraised randomly displayed 3D craniofacial soft-tissue images to assess the presence or absence of temporal asymmetry. Objective linear quantifications of temporal thickness differences were obtained from each preoperative and postoperative MRI scan to identify a potential threshold for subjective asymmetry detection. Temporal asymmetry was further categorized into severity levels I to IV based on incremental asymmetry values.

Results

The temporal thickness difference was significantly (P < 0.001) greater in postoperative images (18.3% ± 9.6%) compared to preoperative images (4.6% ± 1.9%). As temporal asymmetry increased from Level I to IV, a significantly higher proportion of 3D craniofacial images were perceived as asymmetrical by all observer categories (P < 0.001). Both trained clinicians and untrained observers—including surgeons, clinical specialists, family members, and laypeople—demonstrated increased (P < 0.001) detection rates with increasing asymmetry severity. A temporal thickness difference exceeding 10% was clinically detected with over 90% accuracy across all observer categories (P < 0.001), establishing a 10% threshold for the clinical perception of temporal asymmetry.

Introduction

Perfect symmetry does not exist in humans; instead, craniofacial asymmetry is viewed as a continuum, from normal craniofacial fluctuating asymmetry in healthy individuals (Crins-de Koning et al., 2025) to pronounced congenital, developmental, and acquired deformities (Cheong & Lo, 2011; Wenger, Gallagher & Bhoj, 2019). Among craniofacial subunits, the frontally visible temporal region plays a key role in facial symmetry, as its contour—positioned above the zygomatic arch and behind the lateral orbital rim—defines the upper facial width (Vaca et al., 2017; Shay et al., 2022). Unilateral changes in this area can result in noticeable asymmetry (Santiago et al., 2018; Thiensri, Limpoka & Burusapat, 2020).

In this setting, resective epilepsy surgery via the pterional approach is a notable contributor to asymmetrical temporal deformities, due to epilepsy’s global prevalence (Yang et al., 2024), the need for surgery in drug-resistant epilepsy (Pang et al., 2025), and potential postoperative complications (Yasuda et al., 2010; Gonzalez-Martinez, 2025). These deformities often lead to appearance-related preoccupation, disrupting quality of life, self-perception, daily functioning, and social interactions (Klassen et al., 2021), underscoring the need for greater clinical and research attention.

The amount of craniofacial asymmetry reported in both healthy individuals and affected patients varies depending on the assessment method used (Nguyen et al., 2024; Harripershad et al., 2025; Lin et al., 2025; Nishimura et al., 2025). Panel assessment, a common indirect method for evaluating craniofacial asymmetry, relies on subjective judgments from lay observers and clinicians (Zhu, Jayaraman & Khambay, 2016; Tan et al., 2021), with clinicians generally considered more sensitive, though studies report mixed findings on their relative accuracy (Lee, Dumrongwongsiri & Lo, 2019; Tan et al., 2021; Zhang et al., 2023). This approach is widely used to detect visible asymmetries and establish recognition thresholds across various craniofacial subunits, which differ by region (Wang et al., 2017; Lee, Dumrongwongsiri & Lo, 2019). For instance, occlusal canting under 3° often goes unnoticed, while deviations over 4° are detected by 90% of observers (Padwa, Kaiser & Kaban, 1997). However, formal data specific to the temporal region is lacking.

Imaging modality can also influence asymmetry-focused analyses (Lo & Lin, 2023). Subjective and objective craniofacial assessments are limited when based on two-dimensional images, which are prone to distortion, magnification errors, and lack of depth representation (Hsu et al., 2020). The rise of three-dimensional (3D) imaging has improved accuracy, offering undistorted, lifelike representations of the complex craniofacial structure (Wu et al., 2019). Among various 3D reconstruction methods, magnetic resonance imaging (MRI) offers non-invasive, radiation-free, high-contrast, high-resolution soft tissue imaging without visual distractions such as skin color, hair, or facial features that could bias perception (Vander Pluym et al., 2007; Villavisanis et al., 2024). MRI is essential for diagnosing and managing epilepsy, offering high-quality data storage (Biagioli et al., 2025). Neuroimaging repositories like Brazilian Institute of Neuroscience and Neurotechnology (BRAINN) house MRI datasets that have supported epilepsy research and can also contribute to craniofacial studies using images originally acquired for other neuroscience purposes (De Souza et al., 2020; Giacomini et al., 2020).

Recent systematic reviews confirm that no standardized method for analyzing the temporal region has been established (Wang et al., 2017; Gonçalves et al., 2021; Shay et al., 2022; Nasim et al., 2024). Although several grading systems have been proposed, their validity and consistency remain insufficiently examined (Vaca et al., 2017; Wang et al., 2017; Choi et al., 2018; Kim et al., 2018; Huang et al., 2018; Laloze et al., 2019; Gonçalves et al., 2021; Shay et al., 2022; Nasim et al., 2024). Moreover, reliable assessments of unmanipulated 3D images for detecting temporal asymmetry after epilepsy surgery remain lacking. Better understanding how laypeople and clinicians perceive such asymmetry could improve deformity detection, treatment planning, and research design.

Using MRI-derived 3D craniofacial soft-tissue image datasets from adult patients treated with resective surgery for temporal lobe epilepsy, this study aimed to assess the subjective recognition of temporal asymmetry by comparing evaluations from trained clinicians and lay observers. Additionally, the study aimed to determine the threshold value at which a temporal asymmetry is clinically recognized as abnormal.

Materials & Methods

This methodological cross-sectional investigation (Fig. 1) analyzed a standardized craniofacial image dataset obtained from the BRAINN database following approval by the Institutional Review Board (CAAE: 93412318.0.0000.5404; University of Campinas (UNICAMP), Campinas, São Paulo, Brazil) and in compliance with the Declaration of Helsinki. All patients provided written informed consent prior to undergoing MRI scans. The study included MRI-derived images from Brazilian adult patients (n = 96; 53 females; aged 42.6 ± 11.2 years; 58 left-side temporal lobe epilepsy) who consecutively underwent unilateral resective surgery via a modified pterional approach for drug-resistant epilepsy between 2016 and 2022, with a minimum postoperative period of 12 months. The 12-month postoperative evaluation was selected to ensure complete scar maturation, resolution of soft-tissue swelling, and stabilization of surgery-induced change (Bond et al., 2008). All included patients had a confirmed diagnosis of temporal lobe epilepsy, established by one of two experienced epileptologists at the Epilepsy Clinic, UNICAMP, Brazil. Exclusion criteria (n = 11) included the presence of other pathological conditions affecting the head or face (e.g., dento-skeletofacial deformities, prior reconstructive procedures, craniofacial trauma, or surgery) and incomplete datasets.

Figure 1: Flowchart of the cross-sectional study design, including pre- and post-epilepsy surgery magnetic resonance imaging -derived three-dimensional (3D) images.
These images were used for subjective panel assessments by trained clinicians (clinical and surgical observers) and untrained observers (laypeople and family members), objective linear quantification of temporal thickness differences, and the threshold-based detection of asymmetry (represented by the dotted arrow).

Download full-size image

DOI: 10.7717/peerj.20201/fig-1

The imaging databank was compiled for diagnostic and follow-up purposes, with written patient consent (Alvim et al., 2016; De Souza et al., 2020; Giacomini et al., 2020). Under standardized conditions previously described by our team, all pre- and postoperative MRI images were acquired (3 Tesla Achieva-Intera Philips^{^®}: T1-weighted images with isotropic voxels of one mm, slice thickness one mm, no gap, flip angle = 8°, TR = 7.0 ms, TE = 3.2 ms, matrix = 240 × 240, and field of view = 240 × 240) (Fonseca et al., 2012; Coan et al., 2014; Campos et al., 2015; Alvim et al., 2016; De Campos et al., 2016) and preprocessed using a multi-software workflow, with intensity inhomogeneity corrected by the N4 bias field correction algorithm in 3D Slicer and intensity normalization achieved through global scaling and alignment to tissue probability maps during spatial normalization in SPM8 (Ashburner & Friston, 2005; Ashburner, 2007; Avants, Nick & Gang, 2009; Tustison et al., 2010; Fedorov et al., 2012; Chakravarty et al., 2011; Madan, 2015; Alvim et al., 2016; Yushkevich et al., 2016; Villavisanis et al., 2024). Postoperative MRI images were rigidly co-registered to the preoperative images using the General Registration (BRAINS) module in 3D Slicer (Johnson, Harris & Williams, 2007; Kikinis, Pieper & Vosburgh, 2014), ensuring anatomical correspondence for asymmetry assessment without altering local tissue morphology (Friston et al., 1995; Ashburner & Friston, 1997; Ashburner & Friston, 2005).

A combination of advanced image analysis and engineering software was utilized for image processing and measurement: Avizo (FEI, Mérignac, France), Geomagic (3D Systems, Rock Hill, S.C.), and SimPlant O&O (Materialize, Leuven, Belgium). All anatomical landmarks, head orientation reference frames, measurement reference planes, severity stratification levels (I to IV), potential cutoff value for detecting asymmetrical deformities of the temporal region, subjective panel assessments (including 3D image frontal views, task time, and binominal scale system), and objective linear measurement (temporal soft tissue thickness difference in percentage) were standardized according to prior studies (Padwa, Kaiser & Kaban, 1997; Vander Pluym et al., 2007; Stephan & Devine, 2009; Kim et al., 2018; Wu et al., 2019; Hsu et al., 2020; Wan, Tsai & Lo, 2021; Morandi et al., 2022; Kurniawan et al., 2024).

Objective analysis

Using predefined anatomical landmarks and structures of interest—including the lateral orbital rim, temporal bone surface, and external skin surface—identified and interactively verified in the axial, coronal, and sagittal imaging planes, objective measurements of temporal thickness differences (Fig. 2A) were collected from each preoperative and postoperative MRI scan (Vander Pluym et al., 2007; Stephan & Devine, 2009; Bu et al., 2010; Welling et al., 2015; Wysong et al., 2013; Wysong et al., 2014; Kim et al., 2018; Morandi et al., 2022).

Figure 2: Magnetic resonance imaging (MRI)-derived images.
(A) Axial magnetic resonance imaging (MRI) view showing the internal (yellow) and external (green) reference lines used for objective linear distance measurements on the operated (blue) and non-operated (red) sides. The internal line (yellow) was defined from the surface of the non-operated lateral orbital rim to the tangential point on the surface of the non-operated temporal bone. The external line (green) was drawn parallel to the internal reference line along the skin surface. Temporal thickness was measured perpendicularly (blue and red lines) as the distance between the internal reference line (yellow) and the skin surface line (green). (B) Three-dimensional MRI-based craniofacial model in a frontal view, illustrating an epilepsy surgery-induced temporal asymmetry deformity on the left side.

Download full-size image

DOI: 10.7717/peerj.20201/fig-2

The degree of temporal asymmetry was calculated as the percentage difference in thickness between the right and left temporal sides (or between the operated and non-operated sides) using the formula: (longer side−shorter side)/longer side×100. A total of 192 MRI scans (96 pre-epilepsy surgery and 96 post-epilepsy surgery) were linearly ranked based on incremental asymmetry values. Temporal asymmetry was subsequently categorized into four severity levels (I, II, III, and IV), each containing 48 images, ranging from the minimum to the maximum asymmetry percentage values.

Using the temporal thickness difference data, a previously described but not validated cutoff value of 10% for temporal asymmetry (Kim et al., 2018) was applied to classify included preoperative and postoperative images into “no visible temporal asymmetry” (≤cut-off value) versus “at risk of clinically obvious temporal asymmetry deformity” (>cut-off value). This cutoff value was further utilized for comparative analysis of subjective asymmetry detection by various observer categories, including trained clinicians and untrained observers, as well as subcategories such as family members, laypeople, clinical observers, and surgical observers.

Stimuli processing

The stimulus materials used to assess the detection rate of temporal asymmetry with the subjective panel assessment instrument consisted of 3D frontal views, generated from preoperative and 12-month postoperative MRI scans. Native-space images were head-oriented using horizontal, coronal, and sagittal planes, and the aligned patient-specific 3D craniofacial soft tissue images were automatically adjusted to convert all right-sided operated regions to be consistently viewed as left-sided operated regions. The 3D models were also cropped to remove the lower craniofacial portion, particularly beneath the level of the mouth corners. Corresponding preoperative 3D craniofacial images were similarly flipped and cropped, ensuring consistency between the preoperative and postoperative images for subjective assessment. These horizontally flipped and cropped images (Fig. 2B) attenuated the unconscious bias on the part of the human observer, as the adjustments ensured that any potential visible asymmetry could be assessed without the influence of side-specific differences and confounding external facial features, such as the lower lip, oral commissures, lower cheek, mandible, and chin regions, in the images (Deall et al., 2016; Schwirtz et al., 2018).

All preoperative and postoperative 3D craniofacial images were presented at full brightness on a 15-inch MacBook Pro (Apple, Inc., Cupertino, CA, USA) using a timed Microsoft PowerPoint presentation (Microsoft Corporation, Redmond, WA, USA). The stimuli were randomly organized with varying levels of asymmetry and surgical statuses (pre- and postoperative) to minimize sequence effects. Each image was displayed for 6 s to allow observers to evaluate the asymmetry and respond to the questionnaire (Padwa, Kaiser & Kaban, 1997; Rhodes et al., 2005; Chatrath et al., 2007; Carvalho et al., 2012; Jackson et al., 2013; Lewis, 2017; Chou et al., 2019; Wan, Tsai & Lo, 2021). A blank slide was inserted between images for 2 s to reduce direct comparisons between consecutive stimuli. To mitigate fatigue during the rating task, automated breaks were incorporated into the presentation: a 1-minute break after every 13 slides and a 5-minute break after every 39 slides (Wood, Fisher & Andres, 1997; Larese Filon et al., 2019; Taylor-Phillips & Stinton, 2019; Hsiao et al., 2023).

Subjective assessment

The temporal region, identified as the area of interest, was evaluated through subjective intuitive perception by a panel of clinicians and lay observers, employing a widely used binary rating system: binomial symmetry versus asymmetry grading system (Padwa, Kaiser & Kaban, 1997; Rhodes et al., 2005; Chatrath et al., 2007; Carvalho et al., 2012; Jackson et al., 2013; Lewis, 2017; Chou et al., 2019; Wan, Tsai & Lo, 2021).

All clinicians and lay observers were instructed to assess each slide spontaneously regarding the of the temporal region, following standardized instructions. (1) The temporal anatomical subunit within the craniofacial region was defined and explained. Each observer was asked to identify this region on their own head, and no comprehension issues were observed. (2) Symmetry was described as both sides of the temporal region appearing similar, while asymmetry indicated a difference between the sides. Observers were asked to paraphrase this concept to confirm their understanding, with no issues identified. (3) A printed folder was provided to guide the use of the rating system, and observers were explicitly instructed: “Please rate the individual on the slide using the scale”.

The clinician panel comprised surgical and clinical specialists. Surgical and clinical observers included 16 external craniofacial plastic surgeons (six females) and 16 external dermatologists (13 females), respectively, each with 5 to 22 years of post-certification experience. The lay observer panel included 16 parents (eight females, aged 20–57 years) of individuals who had undergone surgery for cleft-craniofacial deformities, and 16 individuals (eight females, aged 18–52 years) without personal or familial histories of craniofacial surgery, cleft-craniofacial deformities, or plastic surgery-related experiences. These lay observers represented diverse socioeconomic and educational backgrounds (upper secondary, postsecondary, and tertiary education, classified according to the Brazilian stratification system) (Weller & Horta Neto, 2021). None had formal training in nursing, medicine, dentistry, or psychology.

All observers had normal or corrected-to-normal vision, no history of psychotropic drug use or mental impairments, no personal relationships with the patients, and were blinded to the study’s purpose except for the region of interest. They could not revisit slides, as initial impressions were prioritized, and received no financial compensation.

Reproducibility

Reproducibility was evaluated in terms of reliability and precision. All objective linear distance measurements were performed twice by the same examiner, with a 4-week interval, and their mean was used for analysis. Inter-examiner reliability was assessed by having a second examiner independently repeat measurements on 30% of randomly selected images. Intra- and inter-examiner reliability were quantified using intraclass correlation coefficients (ICC) (McGraw & Wong, 1996; Portney & Watkins, 2000; Koo & Li, 2016). Precision was evaluated through the relative error magnitude (Utermohle, Zegura & Heathcote, 1983; Ward & Jamison, 1991; Gordon & Bradtmiller, 1992).

To assess intra-observer reliability, 20% of the 3D images were randomly duplicated in the panel sets. Cohen’s Kappa (k) and Fleiss’s multirater kappa (k_Fleiss) were applied to evaluate intra-observer and inter-observer reliabilities, respectively, for nominal data (Fleiss, 1971; Kraemer, 2014; Asmundson, 2022).

Statistical analysis

The frequency of clinical detection of temporal asymmetry by each observer category and subcategory was calculated across varying levels of severity (I to IV), predefined cutoff values of asymmetry (0–10% and >10% asymmetry), and preoperative and postoperative image types. The clinical detection rate for each category and subcategory of observers was determined by dividing the number of asymmetry responses identified by observers in the respective category and subcategory by the total number of possible responses for that category and subcategory. A detection rate of 90% or higher was pre-established as the threshold for determining panel-based clinical recognition of obvious temporal asymmetry using the binary grading system (Padwa, Kaiser & Kaban, 1997; Rhodes et al., 2005; Chatrath et al., 2007; Carvalho et al., 2012; Jackson et al., 2013; Lewis, 2017; Chou et al., 2019; Wan, Tsai & Lo, 2021).

The data distribution was assessed using the Kolmogorov–Smirnov test. Statistical comparisons were conducted using the Kruskal–Wallis test, Z-test, Chi-square test, Fisher’s Exact Test, Wilcoxon signed-rank test, and Mann–Whitney U test as appropriate for the data types and sample structures. When applicable, relationships between variables (e.g., sex and temporal thickness difference) were evaluated using Spearman’s rank correlation coefficient (r). A Bonferroni correction was applied to adjust for multiple comparisons. A P-value of <0.05 was considered statistically significant, and all tests (IBM SPSS software, version 23.0; Armonk, NY, USA) were two-sided.

Results

Objective analysis

For objective measurements, intra- and inter-examiner reliability ranged from moderate to excellent. Regarding the error magnitude statistic, temporal thickness measurements demonstrated good precision for the REM parameter (Table S1).

Most MRI craniofacial images (61.5%; P < 0.001) exhibited temporal thickness differences of less than 10% (Table 1; Fig. 3). Across asymmetry severity levels, level I showed the lowest temporal thickness values, followed by levels II through IV (all P < 0.001). Asymmetry levels I and II had no temporal thickness differences exceeding 10%, while level III included both values above and below 10% (54.2% and 45.8% of images, respectively; P > 0.05). Level IV consisted exclusively of images with values above 10% (Table 1; Fig. 4).

Table 1:

Temporal thickness difference distribution across asymmetry levels.

Temporal asymmetry level	Temporal thickness difference			P-value
	Overall	≤10%	>10%
Level I–IV
n (%)	192 (100)	118 (61.46)	74 (38.54)	<0.001^*
Temporal thickness difference (%) m ± sd (range)	11.46 ± 9.77 [1.12–46.56]	4.87 ± 2.08 [1.12–9.90]	21.96 ± 7.80 [10.19–46.56]	<0.001^*
Level I
n (%)	48 (25)	48 (100)	0 (0)	–
Temporal thickness difference (%) m ± sd (range)	2.93 ± 0.99 [1.12–4.12]	2.93 ± 0.99 [1.12–4.12]	0 ± 0 [0–0]	–
Level II
n (%)	48 (25)	48 (100)	0 (0)	–
Temporal thickness difference (%) m ± sd (range)	5.35 ± 0.61 [4.13–6.52]	5.35 ± 0.61 [4.13–6.52]	0 ± 0 [0–0]	–
Level III
n (%)	48 (25)	22 (45.83)	26 (54.17)	0.541
Temporal thickness difference (%) m ± sd (range)	11.52 ± 3.54 [6.67–17.19]	8.08 ± 1.08 [6.67–9.90]	14.43 ± 1.83 [10.19–17.19]	<0.001^*
Level IV
n (%)	48 (25)	0 (0)	48 (100)	–
Temporal thickness difference (%) m ± sd (range)	26.04 ± 6.66 [18.39–46.56]	0 ± 0 [0–0]	26.04 ± 6.66 [18.39–46.56]	–
P-value	<0.001^^,^*	–	–	–

DOI: 10.7717/peerj.20201/table-1

Notes:

–, no data for comparison purpose.

n, number of craniofacial images.

%, percentage.

m, mean.

sd, standard deviation.

Preop, preoperative.

Postop, postoperative.

*Significant after applying the Bonferroni correction.

**Level I < Level II < Level III < Level IV, with p < 0.001 for all pairwise comparisons.

Figure 3: Box plots illustrating the distribution of temporal thickness differences.
For further details, refer to Table 1.

Download full-size image

DOI: 10.7717/peerj.20201/fig-3

Figure 4: Box plots illustrating the distribution of temporal thickness differences across asymmetry levels and preoperative/postoperative image types.
Asymmetry levels I and II showed no temporal thickness differences exceeding 10%, whereas level III included values both above and below this threshold. Level IV exclusively comprised three-dimensional craniofacial images with temporal thickness differences greater than 10%. Preoperative three-dimensional craniofacial images exhibited only differences below 10%, while postoperative images encompassed values both above and below this threshold. For further details, refer to Table 2.

Download full-size image

DOI: 10.7717/peerj.20201/fig-4

Postoperative images exhibited a significantly (P < 0.001) greater temporal thickness difference than preoperative images (Table 2). No preoperative image had a temporal thickness difference greater than 10% (ranging from 1.12% to 9.23%), while postoperative images displayed a broader range of temporal thickness differences, from 1.99% to 46.56% (Table 2; Fig. 4). Both preoperative and postoperative images were distributed across asymmetry severity levels I to III, while only postoperative images were categorized as level IV (all P < 0.001) (Table 2). Subanalyses revealed no significant influence (all P > 0.05) of the tested parameters (age, sex, and surgical side) on temporal thickness differences, considering preoperative/postoperative image types, asymmetry cutoff, and severity levels.

Table 2:

Temporal thickness difference distribution across asymmetry levels and preoperative/postoperative image types.

Temporal asymmetry level	Temporal thickness difference
	Overall			≤10%			>10%
	Preop	Postop	P-value	Preop	Postop	P-value	Preop	Postop
Level I–IV
n (%)	96 (100)	96 (100)	–	96 (100)	22 (22.92)	<0.001^*	–	74 (77.08)
Temporal thickness difference (%) m ± sd (range)	4.60 ± 1.92 [1.12–9.23]	18.32 ± 9.64 [1.99–46.56]	<0.001^*	4.60 ± 1.92 [1.12–9.23]	6.09 ± 2.33 [1.99–9.90]	0.013	–	21.96 ± 7.80 [10.19–46.56]
Level I
n (%)	42 (87.5)	6 (12.5)	<0.001^*	42 (87.5)	6 (12.5)	<0.001^*	–	–
Temporal thickness difference (%) m ± sd (range)	2.85 ± 1.00 [1.12–4.12]	3.51 ± 0.70 [1.99–4.03]	0.139	2.85 ± 1.00 [1.12–4.12]	3.51 ± 0.77 [1.99–4.03]	0.139	–	–
Level II
n (%)	40 (83.3)	8 (16.7)	<0.001^*	40 (83.3)	8 (16.7)	<0.001^*	–	–
Temporal thickness difference (%) m ± sd (range)	5.35 ± 0.61 [4.13–6.52]	5.35 ± 0.59 [4.46–6.24]	0.989	5.35 ± 0.61 [4.13–6.52]	5.35 ± 0.63 [4.46–6.24]	0.989	–	–
Level III
n (%)	14 (29.2)	34 (70.8)	<0.001^*	14 (29.2)	8 (16.7)	0.131	–	26 (54.1)
Temporal thickness difference (%) m ± sd (range)	7.69 ± 0.94 [6.67–9.23]	13.09 ± 2.95 [7.22–17.19]	<0.001^*	7.69 ± 0.94 [6.67–9.23]	8.76 ± 1.02 [7.22–9.90]	0.020	–	14.43 ± 1.83 [10.19–17.19]
Level IV
n (%)	–	48 (100)	–	–	–	–	–	48 (100)
Temporal thickness difference (%) m ± sd (range)	–	26.04 ± 6.59 [18.39–46.56]	–	–	–	–	–	26.04 ± 6.66 [18.39–46.56]
P-value	–	<0.001^^,^*	–	–	–	–	–	–

DOI: 10.7717/peerj.20201/table-2

Notes:

–, no data for comparison purpose.

n, number of craniofacial images.

%, percentage.

m, mean.

sd, standard deviation.

Preop, preoperative.

Postop, postoperative.

*Significant after applying the Bonferroni correction.

**Level I < Level II < Level III < Level IV, with p < 0.001 for all pairwise comparisons.

Subjective assessment

The intra-observer reliability testing demonstrated a strong level of agreement (k = 0.80–0.90) for untrained observers and an almost perfect level of agreement (k > 0.90) for trained observers. The inter-observer reliability testing showed a strong level of agreement (k_Fleiss = 0.80–0.90) across both subcategories and categories of observers (Table S2).

For both observer category (Tables 3 to 5) and single-observer (Table 6) analyses, clinicians detected temporal asymmetry with significantly greater frequency (all P < 0.001) than lay observers, considering overall temporal thickness difference, the 10% cutoff, preoperative/postoperative image types, and severity levels (except for level IV). In both the observer subcategory (Tables 3 to 5) and single-observer (Table 6) analyses, laypeople had the lowest detection rate (all P < 0.001), followed by family members, while surgical and clinical observers demonstrated higher detection rates for overall asymmetry, the 10% cutoff, preoperative/postoperative image types, and severity levels (except for level IV).

Table 3:

Detection rates of temporal asymmetry across observer categories/subcategories and asymmetry cutoffs.

Parameters	Detection rate of temporal asymmetry (%)
	Temporal thickness difference			P-value
	Overall (i = 192)	≤10% (i = 118)	>10% (i = 74)
All observers (n = 64)	38.40%	3.32%	94.32%	<0.001^*
Total number of possible responses (t)	12,288	7,552	4,736	–
Untrained observers (n = 32)	37.00%	2.41%	92.15%	<0.001^*
Total number of possible responses (t)	6,144	3,776	2,368	–
Family members (n = 16)	37.57%	2.49%	93.50%	<0.001^*
Lay people (n = 16)	36.43%	2.33%	90.79%	<0.001^*
P-value (intra-subcategory)	0.263	0.797	<0.001^*	–
Total number of possible responses (t)	3,072	1,888	1,184	–
Trained clinicians (n = 32)	39.79%	4.24%	96.49%	<0.001^*
Total number of possible responses (t)	6,144	3,776	2,368	–
Clinical observers (n = 16)	39.26%	3.60%	96.11%	<0.001^*
Surgical observers (n = 16)	40.33%	4.87%	96.88%	<0.001^*
P-value (intra-subcategory)	0.3897	0.0525	0.3146	–
Total number of possible responses (t)	3,072	1,888	1,184	–
P-value	–	–	–	–
Untrained versus trained observers	<0.001^*	<0.001^*	<0.001^*	–
Inter-subcategories	0.154	<0.001^^,^**	<0.001^^,^*	–

DOI: 10.7717/peerj.20201/table-3

Notes:

–, no data for comparison purpose.

n, number of observers.

i, number of craniofacial images.

t, total number of possible responses.

%, percentage.

*Significant after applying the Bonferroni correction.

**Lay people < family members < clinical observers = surgical observers, with p < 0.001 for all pairwise comparisons, except between clinical observers and surgical observers (p > 0.05).

***Lay people = family members < clinical observers = surgical observers, with p < 0.001 for all pairwise comparisons, except between lay people and family members, and clinical observers and surgical observers (p > 0.05).

Table 4:

Detection rates of temporal asymmetry across observer categories/subcategories, asymmetry cutoffs, and preoperative/postoperative image types.

Parameters	Detection rate of temporal asymmetry (%)
	Overall temporal thickness difference (i = 192)			Temporal thickness difference≤10% (i = 118)			Temporal thickness difference > 10% (i = 74)		P-value^#
	Preop (i = 96)	Postop (i = 96)	P-value	Preop (i = 96)	Postop (i = 22)	P-value	Preop (i = 0)	Postop (i = 74)
All observers (n = 64)	3.24%	73.55%	<0.001^*	3.24%	3.69%	0.458	–	94.32%	<0.001^*
Total number of possible responses (t)	6,144	6,144	–	6,144	1,408	–	–	4,736	–
Untrained observers (n = 32)	2.41%	71.58%	<0.001^*	2.41%	2.41%	1.00	–	92.15%	<0.001^*
Total number of possible responses (t)	3072	3072	–	3072	704	–	–	2368
Family members (n = 16)	2.54%	72.59%	<0.001^*	2.54%	2.27%	0.772	–	93.50%	<0.001^*
Lay people (n = 16)	2.28%	70.57%	<0.001^*	2.28%	2.56%	0.758	–	90.79%	<0.001^*
P-value (intra-subcategory)	0.677	0.197	–	0.677	0.805	–	–	0.0021	–
Total number of possible responses (t)	1,536	1,536	–	1,536	352	–	–	1,184	–
Trained clinicians (n = 32)	4.07%	75.52%	<0.001^*	4.07%	4.97%	0.28	–	96.49%	<0.001^*
Total number of possible responses (t)	3,072	3,072	–	3,072	704	–	–	2,368	–
Clinical observers (n = 16)	3.71%	74.80%	<0.001^*	3.71%	3.13%	0.595	–	96.11%	<0.001^*
Surgical observers (n = 16)	4.43%	76.24%	<0.001^*	4.43%	6.82%	0.060	–	96.88%	<0.001^*
P-value (intra-subcategory)	0.315	0.356	–	0.315	0.024	–	–	0.315	–
Total number of possible responses (t)	1,536	1,536	–	1,536	352	–	–	1,184	–
P-value	–	–	–	–	–	–	–	–	–
Untrained versus trained observers	<0.001^*	<0.001^*	–	<0.001^*	<0.001^*	–	–	<0.001^*	–
Inter-subcategories of observers	<0.001^^,^*	<0.001^^,^*	–	<0.001^^,^*	<0.001^^,^**	–	–	<0.001^^,^***	–

DOI: 10.7717/peerj.20201/table-4

Notes:

–, no data for comparison purpose.

%, percentage.

n, number of observers.

i, number of craniofacial images.

t, total number of possible responses.

Preop, preoperative.

Postop, postoperative.

#Postoperative (≤10% of asymmetry) versus postoperative (>10% of asymmetry).

*Significant after applying the Bonferroni correction.

**Surgical observers scored significantly higher than lay people (p < 0.001), with no significant differences observed in the remaining pairwise comparisons (p > 0.05).

***Lay people = family members = clinical observers <surgical observers, with p < 0.001 for all pairwise comparisons, except between lay people and family members, lay people and clinical observers, and family members and clinical observers (p > 0.05).

****Lay people <family members <clinical observers = surgical observers, with p < 0.001 for all pairwise comparisons, except between clinical observers and surgical observers (p > 0.05).

Table 5:

Detection rates of temporal asymmetry across observer categories/subcategories, severity levels, and asymmetry cutoffs.

Parameters	Detection rate of temporal asymmetry (%)
	Overall temporal thickness difference					Temporal thicknessdifference≤10%			Temporal thickness difference > 10%
	Level I	Level II	Level III	Level IV	P-value	Level I	Level II	Level III	Level III	Level IV
All observers (n = 64)	0.72%	4.07%	48.80%	100.00%	<0.001^^,^*	0.72%	4.07%	7.39%	83.83%	100.00%
Untrained observers (n = 32)	0.59%	2.73%	44.66%	100.00%	<0.001^^,^*	0.59%	2.73%	5.68%	77.64%	100.00%
Family members (n = 16)	0.65%	2.60%	47.01%	100.00%	<0.001^^,^*	0.65%	2.60%	6.25%	81.49%	100.00%
Lay people (n = 16)	0.52%	2.86%	42.32%	100.00%	<0.001 ^^,^*	0.52%	2.86%	5.11%	73.80%	100.00%
P-value (intra-subcategory)	<0.001^*	<0.001^*	<0.001^*	–	–	<0.001^*	<0.001^*	<0.001^*	<0.001^*	–
Trained clinicians (n = 32)	0.85%	5.40%	52.93%	100.00%	<0.001^^,^*	0.85%	5.40%	9.09%	90.02%	100.00%
Clinical observers (n = 16)	0.78%	3.26%	52.99%	100.00%	<0.001^^,^*	0.78%	3.26%	10.51%	88.94%	100.00%
Surgical observers (n = 16)	0.91%	7.55%	52.86%	100.00%	<0.001^^,^*	0.91%	7.55%	7.67%	91.11%	100.00%
P-value (intra-subcategory)	<0.001^*	<0.001^*	<0.001^*	–	–	<0.001^*	<0.001^*	<0.001^*	<0.001^*	–
P-value	–	–	–	–	–	–	–	–	–	–
Untrained versus trained observers	<0.001^*	<0.001^*	<0.001^*	–	–	<0.001^*	<0.001^*	<0.001^*	<0.001^*	–
Inter-subcategories of observers	<0.001^^,^**	<0.001^^,^**	<0.001^^,^***	–	–	<0.001^^,^**	<0.001^^,^**	<0.001^^,^***	<0.001^^,^**	–

DOI: 10.7717/peerj.20201/table-5

Notes:

–, no data for comparison purpose.

%, percentage.

n, number of observers.

*Significant after applying the Bonferroni correction.

**Level I < Level II < Level III < Level IV, with p < 0.001 for all pairwise comparisons.

***Lay people < family members < clinical observers < surgical observers, with p < 0.001 for all pairwise comparisons.

****Lay people < family members < surgical observers < clinical observers, with p < 0.001 for all pairwise comparisons.

Table 6:

Detection rates of temporal asymmetry by individual observers.

Single-observer analysis	Detection rate of temporal asymmetry (%)
	Temporal thickness difference			P-value
	Overall (i = 192)	≤10% (i = 118)	>10% (i = 74)
All observers (n = 64) m ± sd (range)	38.40 ± 1.94 [35.42–43.23]	3.32 ± 2.03 [0.85–9.32]	94.32 ± 2.69 [90.54–98.65]	<0.001^*
Untrained observers (n = 32)	37.00 ± 0.87 [35.42–39.06]	2.41 ± 0.81 [0.85–4.24]	92.15 ± 1.66 [90.54–95.95]	<0.001^*
Family members (n = 16) m ± sd (range)	37.57 ± 0.68 [36.46–39.06]	2.49 ± 0.72 [1.69–4.24]	93.50 ± 1.23 [91.89–95.95]	<0.001^*
Lay people (n = 16) m ± sd (range)	36.43 ± 0.64 [35.42–38.02]	2.33 ± 0.90 [0.85–4.24]	90.79 ± 0.54 [90.54–91.89]	<0.001^*
P-value (intra-subcategory)	<0.001^*	0.657	<0.001^*	–
Trained clinicians (n = 32)	39.79 ± 1.69 [37.50–43.23]	4.24 ± 2.45 [1.69–9.32]	96.49 ± 1.49 [93.24–98.65]	<0.001^*
Clinical observers (n = 16) m ± sd (range)	39.26 ± 1.45 [37.50–41.67]	3.60 ± 2.11 [1.69–6.78]	96.11 ± 1.47 [93.24–97.30]	<0.001^*
Surgical observers (n = 16) m ± sd (range)	40.33 ± 1.78 [37.50–43.23]	4.87 ± 2.67 [1.69–9.32]	96.88 ± 1.46 [94.59–98.65]	<0.001^*
P-value (intra-subcategory)	0.081	0.167	0.171	–
P-value	–	–	–	–
Untrained versus trained observers	<0.001^*	<0.001^*	<0.001^*	–
Inter-subcategories of observers	<0.001^^,^*	<0.001^^,^**	<0.001^^,^**	–

DOI: 10.7717/peerj.20201/table-6

Notes:

–, no data for comparison purpose.

n, number of observers.

i, number of craniofacial images.

%, percentage

m, mean.

sd, standard deviation.

*Significant after applying the Bonferroni correction.

**Lay people < family members < clinical observers = surgical observers, with p < 0.001 for all pairwise comparisons, except between clinical observers and surgical observers (p > 0.05).

***Surgical observers scored significantly higher than lay people (p < 0.001), with no significant differences observed in the remaining pairwise comparisons (p > 0.05).

For images with temporal thickness difference of 10% or less, no significant (all P > 0.05) difference was observed between preoperative and postoperative image types across observer categories and subcategories (Table 4). As asymmetry severity level increases, the detection rate of asymmetry by all observer categories and subcategories also increases significantly (all P < 0.001), considering both overall temporal thickness difference and the 10% cutoff (Table 5). For level III severity, clinical observers had the highest detection rates in the overall temporal thickness difference parameter and when appraising images with temporal thickness differences of 10% or less (all P < 0.001) (Table 5). Surgical observers had the highest detection rates for levels I and II (regardless of temporal thickness difference) and for level III when appraising images with temporal thickness differences greater than 10% (all P < 0.001) (Table 5). For level IV asymmetry, all observer categories and subcategories detected asymmetry in all 48 images, regardless of the overall temporal thickness difference or the 10% cutoff (all P > 0.05) (Table 5). Subanalyses revealed no significant influence (all P>0.05) of the tested parameters (age, sex, and surgical side) on detection rates, considering temporal thickness difference, observer categories/subcategories, preoperative/postoperative image types, asymmetry cutoff, and severity levels.

Threshold-based detection of asymmetry

The detection rate of temporal asymmetry in 3D craniofacial images with temporal thickness difference greater than 10% (range: 10.186% to 46.555%) was significantly higher than in images with temporal thickness difference of 10% or less (range: 1.118% to 9.901%), across both observer categories and subcategories (all P < 0.001) (Table 3; Fig. S1). Temporal thickness differences greater than 10% were clinically detected with a frequency exceeding 90% by both observer categories (trained clinicians and untrained lay observers) and subcategories (family members, lay people, and clinical and surgical observers) (Tables 3, 4 and 6; Fig. S2). The clinical detection rate for 3D craniofacial images with temporal thickness differences less than 10% (range: 1.118% to 9.901%) did not exceed 90% for any observer category or subcategory (Table 3). No significant correlation (all P > 0.05) was found between age, sex, surgical side, and the asymmetry cutoff parameter, considering temporal thickness difference, observer categories/subategories, preoperative/postoperative image types, or severity levels.

Discussion

Currently, there is no gold standard for identifying temporal region asymmetry (Wang et al., 2017; Laloze et al., 2019; Shay et al., 2022; Nasim et al., 2024). To address this, instead of computer-generated asymmetrical models (e.g., chimeras or blends) (Naini et al., 2012a), we used unaltered 3D craniofacial models from surgically treated patients, preserving the natural clinical context and reflecting real-world epilepsy care in a Brazilian population of mixed ethnic backgrounds (Giacomini et al., 2020; Secolin et al., 2021). By applying appropriate statistical tests and established cut-off values (Padwa, Kaiser & Kaban, 1997; Rhodes et al., 2005; Chatrath et al., 2007; Kottner et al., 2011; Carvalho et al., 2012; Jackson et al., 2013; Lewis, 2017; Chou et al., 2019; Wan, Tsai & Lo, 2021), both the subjective panel assessments and the objective measurements of temporal soft tissue thickness differences met criteria for reliability and precision, supporting the robustness of the data collected for the temporal region.

Our analyses showed that 3D craniofacial images with prior temporal surgery exhibited greater asymmetry than those without surgical intervention, and perfect symmetry was absent in all images. These results support the presence of craniofacial asymmetry across the population (Chou et al., 2019; Crins-de Koning et al., 2025) and indicate that surgery-induced asymmetry is more clinically noticeable (Cheong & Lo, 2011). Our data reflect natural variability in the temporal region under real-world conditions, providing a useful reference for future studies on both natural and surgical asymmetry. To broaden these findings, future MRI-based research could include diverse ethnic groups and patients undergoing other temporal surgeries (e.g., for trauma, aneurysms, tumors) and assess additional asymmetry indices. Studies might also examine the influence of underlying brain and cranial structures on craniofacial asymmetry (Marečková et al., 2013; Kong et al., 2018).

In our study, subjective judgments of temporal asymmetry followed a consistent pattern: larger temporal thickness differences associated with higher detection rates. Both trained and untrained observers performed similarly for severe asymmetry (level IV), but detection rates varied significantly for milder asymmetries (levels I–III), with laypeople showing the lowest rates, followed by family members, while clinical and surgical observers demonstrated higher accuracy. Overall, lay observers detected asymmetry less frequently than clinicians.

Consistent with previous studies (Kokich, Kokich & Kiyak, 2006; An et al., 2014), specialized clinicians demonstrated superior recognition of asymmetry, likely due to their ability to apply clinical skills to subtle asymmetries. The clinicians in our study—craniofacial plastic surgeons and dermatologists experienced in treating appearance-altering craniofacial conditions—possess extensive expertise in managing temporal region asymmetries, including the use of fillers and grafts for reconstructive and aesthetic purposes. Unlike untrained observers, clinicians develop enhanced visualization, judgment, and sensitivity through rigorous training and clinical practice, which may explain their higher detection rates. We also hypothesize that clinicians’ superior performance may reflect conscious or unconscious motivational bias (Montibeller & von Winterfeldt, 2015), as they may invest additional effort in identifying asymmetry in regions closely related to their professional expertise.

Interestingly, studies show that laypeople with exposure to a specific condition—for example, individuals who have undergone orthodontic treatment—are better at perceiving asymmetry, such as incisal plane canting or midline shifts, than those without such experience (Naini et al., 2012a; Naini et al., 2012b; An et al., 2014). Similarly, managing cleft-craniofacial conditions poses significant challenges for patients and families, and family members involved in long-term care may develop greater awareness and critical perception of craniofacial symmetry (Denadai & Lo, 2022). This may explain why, in our study, family members demonstrated higher recognition rates of asymmetry compared to laypeople without prior exposure to appearance-altering craniofacial conditions. Our findings suggest that future research should consider subdividing lay observers rather than treating them as a single group. To further explore how technical and experiential backgrounds influence temporal asymmetry assessment, future studies could examine whether other trained healthcare providers (e.g., neurologists, neurosurgeons, psychologists managing epilepsy) and different untrained observers (e.g., family members of patients with or without epilepsy surgery) perceive asymmetry differently. Additionally, research should investigate how various skills and strategies—such as perceptual fluency, cultural influences, artistic experience, focus on specific facial regions, and sensitivity to shape, contour, or light and shadow (Pinheiro et al., 2023)—affect the perception of symmetry versus asymmetry in 3D craniofacial images across different observer groups. However, it should be acknowledged that visually assessing temporal asymmetry, particularly for untrained observers, remains inherently challenging and represents a limitation of such studies.

Similar to many medical and nonmedical contexts (Bathiany, Hidding & Scheffer, 2020; Nakajima, Okuda & Komatsu, 2021), the perception of craniofacial asymmetry appears to follow a threshold model, becoming clinically noticeable only beyond a certain degree of deviation (Parrini et al., 2016; Wang et al., 2017). Studies have examined various craniofacial subunits to define objective cutoff values—using linear, angular, or percentage measures—that correspond to subjective detection by clinicians and laypeople ((Padwa, Kaiser & Kaban, 1997; Parrini et al., 2016; Wang et al., 2017; Lee, Dumrongwongsiri & Lo, 2019)). Each craniofacial subunit has a unique perceptual threshold that distinguishes normal variation from clinically recognizable deviations (Padwa, Kaiser & Kaban, 1997; Parrini et al., 2016; Wang et al., 2017).

To date, no study has quantified the minimal degree of objectively measured temporal asymmetry detectable by both trained and untrained observers. Although a 10% threshold has been proposed for defining visible unilateral temporal deformity (Kim et al., 2018), this value remains unvalidated through formal observer detection analyses. Craniofacial soft tissue thickness—a linear measurement widely used in clinical and research settings for purposes such as evaluating surgery-induced changes, assessing sarcopenia, serving as a prognostic marker in cancer, and aiding facial approximation in forensic medicine (Stephan & Devine, 2009; Hona & Stephan, 2024)—was therefore employed as the objective measure in the current study.

Our cumulative data demonstrated that both clinicians and lay observers were able to identify temporal asymmetry at a threshold of 10% difference in temporal thickness. All observer categories and subcategories met or exceeded the minimum clinical recognition level, with over 90% accuracy in detecting asymmetry in 3D craniofacial images exhibiting temporal thickness differences greater than 10%. This threshold effectively distinguished between 3D craniofacial images predicted to be unaffected (preoperative images) and those hypothesized to exhibit visible temporal asymmetry (postoperative images). Based on our findings, and in alignment with definitions supported by previous research (Kim et al., 2018), a 10% difference in temporal thickness can be proposed as a discriminative threshold for the clinical detection of temporal asymmetry by both trained and untrained observers. Importantly, differences in sensitivity for the detection of temporal asymmetry between clinicians and lay observers could have practical implications. In real-world settings, lower sensitivity among lay observers might mean that minor temporal asymmetries, although noticeable to clinicians, remain imperceptible to the public, potentially reducing patients’ psychosocial distress or perceived need for intervention. Therefore, recognizing this potential clinical scenario could help avoid unnecessary treatment or overinflated diagnoses driven by clinicians’ higher sensitivity. Understanding these dynamics is crucial for guiding patient counseling, setting realistic expectations, and informing decisions about therapeutic interventions.

While our study does not answer all questions about craniofacial asymmetry, it offers valuable insights into how clinicians and lay observers recognize temporal asymmetry. We have also established a key threshold for clinical detection, which may benefit interdisciplinary epilepsy care. This data provides an initial framework for counseling adult patients and setting expectations for potential postsurgical asymmetry. Moreover, it could aid perioperative assessments in temporal reconstruction (Vaca et al., 2017; Gonçalves et al., 2021), where residual asymmetry may exist but remain imperceptible in interpersonal interactions.

Potential caveats of this study should be acknowledged. This was not an epidemiological investigation, and we did not assess the prevalence of temporal deformity, explore multiple levels of nonindependence, or investigate potential predictors of its occurrence following epilepsy surgery. The study was also not designed to examine hierarchical data structures or to model clustered or nested sources of variability. These aspects merit future research using alternative methodologies, including mixed-effects model analyses. Future investigation could explore specific structural components of the temporal region, such as fat, muscle, and bone, to provide a more detailed understanding of asymmetry. Studies could also assess the severity of asymmetry and treatment needs using continuous or ordinal rating scales, as well as additional parameters such as surface area and volume, allowing for more nuanced differentiation beyond binary classification. Additionally, leveraging artificial intelligence or machine learning techniques for automated detection holds promise for matching or even exceeding human performance in identifying temporal asymmetry.

Conclusions

This study identified a 10% difference in temporal thickness as the threshold for the clinical recognition of temporal asymmetry by both trained clinicians (surgeons and clinical specialists) and untrained observers (family members and laypeople). Moreover, significant differences were observed between these groups, with trained clinicians detecting asymmetry more frequently than untrained observers across most conditions—including overall temporal thickness differences, the 10% cutoff, preoperative/postoperative image types, and varying severity levels—except at the highest severity level, where all observers achieved complete detection.

Supplemental Information

STROBE Checklist

DOI: 10.7717/peerj.20201/supp-1

Download

Raw data

All independent and dependent variables were adopted for the reported analysis.

DOI: 10.7717/peerj.20201/supp-2

Download

Box plots

Box plots illustrating the distribution of detection rates of temporal asymmetry by individual observers based on three-dimensional craniofacial images with temporal thickness differences below 10%. For further details, refer to Table 6.

DOI: 10.7717/peerj.20201/supp-3

Download

Box plots

DOI: 10.7717/peerj.20201/supp-4

Download

Reproducibility statistics for temporal thickness measurement

DOI: 10.7717/peerj.20201/supp-5

Download

Reliability statistics for panel assessment

Reliability statistics for panel assessment .

DOI: 10.7717/peerj.20201/supp-6

Download

[1] Alvim MK, Coan AC, Campos BM, Yasuda CL, Oliveira MC, Morita ME, Cendes F. 2016. Progression of gray matter atrophy in seizure-free patients with temporal lobe epilepsy. Epilepsia 57(4):621-629

[2] An SM, Choi SY, Chung YW, Jang TH, Kang KH. 2014. Comparing esthetic smile perceptions among laypersons with and without orthodontic treatment experience and dentists. Korean Journal of Orthodontics 44(6):294-303

[3] Ashburner J. 2007. A fast diffeomorphic image registration algorithm. NeuroImage 38(1):95-113

[4] Ashburner J, Friston KJ. 1997. Multimodal image coregistration and partitioning—a unified framework. NeuroImage 6(3):209-217

[5] Ashburner J, Friston KJ. 2005. Unified segmentation. NeuroImage 26(3):839-851

[6] Asmundson GJG. 2022. Comprehensive clinical psychology (2nd edition). Amsterdam, The Netherlands: Elsevier.

[7] Avants BB, Nick T, Gang S. 2009. Advanced normalization tools (ANTS) The Insight Journal 365:1-35

[8] Bathiany S, Hidding J, Scheffer M. 2020. Edge detection reveals abrupt and extreme climate events. The Journal of Climate 33(15):6399-6421

[9] Biagioli N, Parfyonov M, Meletti S, Pavesi G, Archer J, Bernhardt BC, Caciagli L, Cendes F, Chinvarun Y, Concha L, Federico P, Gaillard WD, Kobayashi E, Ogbole G, Rampp S, Wang S, Winston GP, Wang I, Vaudano AE. 2025. ILAE neuroimaging task force highlight: the utility of multimodal neuroimaging in diagnostic and presurgical workup of drug-resistant focal epilepsy. Epileptic Disorders 27(3):439-450

[10] Bond JS, Duncan JAL, Sattar A, Boanas A, Mason T, O’Kane S, Ferguson MWJ. 2008. Maturation of the human scar: an observational study. Plastic and Reconstructive Surgery 121(5):1650-1658

[11] Bu XB, Zhang J, Wang XM, Sun LG, Yang Y. 2010. Change of zygomatic and temporal soft tissue after coronal incision. Zhonghua Zheng Xing Wai Ke Za Zhi 26(5):345-348

[12] Campos BM, Coan AC, Beltramini GC, Liu M, Yassuda CL, Ghizoni E, Beaulieu C, Gross DW, Cendes F. 2015. White matter abnormalities associate with type and localization of focal epileptogenic lesions. Epilepsia 56(1):125-132

[13] Carvalho B, Ballin AC, Becker RV, Berger CA, Hurtado JG, Mocellin M. 2012. Rhinoplasty and facial asymmetry: analysis of subjective and anthropometric factors in the Caucasian nose. International Archives of Otorhinolaryngology 16(4):445-451

[14] Chakravarty MM, Aleong R, Leonard G, Perron M, Pike GB, Richer L, Veillette S, Pausova Z, Paus T. 2011. Automated analysis of craniofacial morphology using magnetic resonance images. PLOS ONE 6(5):e20241

[15] Chatrath P, De Cordova J, Nouraei SA, Ahmed J, Saleh HA. 2007. Objective assessment of facial asymmetry in rhinoplasty patients. Archives of Facial Plastic Surgery 9(3):184-187

[16] Cheong YW, Lo LJ. 2011. Facial asymmetry: etiology, and evaluation, and management. Chang Gung Medical Journal 34(4):341-351

[17] Choi J, Choi H, Shin D, Kim J, Lee M, Kim S, Jo D, Kim C. 2018. Autogenous fat grafting for mild-to-moderate postoperative temporal hollowing after decompressive craniectomy: one-year follow-up. Archives of Plastic Surgery 45(1):69-73

[18] Chou PY, Denadai R, Chen SH, Tseng HJ, Hsu CK, Wang SW, Hallac R, Chen CH, Kane AA, Lo LJ. 2019. Identifying three-dimensional facial fluctuating asymmetry in normal pediatric individuals: a panel assessment outcome study of clinicians and observers. Journal of Clinical Medicine 8(5):648

[19] Coan AC, Kubota B, Bergo FP, Campos BM, Cendes F. 2014. 3T MRI quantification of hippocampal volume and signal in mesial temporal lobe epilepsy improves detection of hippocampal sclerosis. AJNR. American Journal of Neuroradiology 35(1):77-83

[20] Crins-de Koning M, Bruggink R, Nienhuijs M, Wagner T, Bronkhorst EM, Ongkosuwito EM. 2025. Three-dimensional analysis of facial morphology in nine-year-old children with different unilateral orofacial clefts compared to normative data. PeerJ 13:e18739

[21] De Campos BM, Coan AC, Lin Yasuda C, Casseb RF, Cendes F. 2016. Large-scale brain networks are distinctly affected in right and left mesial temporal lobe epilepsy. Human Brain Mapping 37(9):3137-3152

[22] De Souza JPSAS, Ayub G, Nogueira M, Zanao T, Lopes TM, Pimentel-Silva LR, Domene V, Marquez G, Yasuda CL, Ribeiro LF, Campos BM, Vasconcellos J, Rogerio F, Joaquim AF, Cendes F, Tedeschi H, Ghizoni E. 2020. Temporopolar amygdalohippocampectomy: seizure control and postoperative outcomes. Journal of Neurosurgery 134(3):1044-1053

[23] Deall CE, Kornmann NS, Bella H, Wallis KL, Hardwicke JT, Su TL, Richard BM. 2016. Facial aesthetic outcomes of cleft surgery: assessment of discrete lip and nose images compared with digital symmetry analysis. Plastic and Reconstructive Surgery 138(4):855-862

[24] 2022. Denadai R, Lo LJ, eds. Current concept in cleft surgery: moving toward excellence of outcome and reducing the burden of care. Singapore: Springer.

[25] Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, Bauer C, Jennings D, Fennessy F, Sonka M, Buatti J, Aylward S, Miller JV, Pieper S, Kikinis R. 2012. 3D slicer as an image computing platform for the quantitative imaging network. Magnetic Resonance Imaging 30(9):1323-1341

[26] Fleiss JL. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76:378-382

[27] Fonseca V de C, Yasuda CL, Tedeschi GG, Betting LE, Cendes F. 2012. White matter abnormalities in patients with focal cortical dysplasia revealed by diffusion tensor imaging analysis in a voxelwise approach. Frontiers in Neurology 3:121

[28] Friston KJ, Ashburner J, Frith CD, Poline JB, Heather JD, Frackowiak RS. 1995. Spatial registration and normalization of images. Human Brain Mapping 2(3):165-189

[29] Giacomini L, De Souza JPSA, Formentin C, De Campos BM, Todeschini AB, De Oliveira E, Tedeschi H, Joaquim AF, Cendes F, Ghizoni E. 2020. Temporal lobe structural evaluation after transsylvian selective amygdalohippocampectomy. Neurosurgical Focus 48(4):e14

[30] Gonçalves DB, Dos Santos MIA, De Cristo Rojas Cabral L, Oliveira LM, Da Silva Coutinho GC, Dutra BG, Martins RV, Reis F, Paiva WS, De Amorim RLO. 2021. Esthetics outcomes in patients submitted to pterional craniotomy and its variants: a scoping review. Surgical Neurology International 12:461

[31] Gonzalez-Martinez JA. 2025. Epilepsy surgery in the last 10 years: advancements and controversies. Epilepsy Currents

[32] Gordon CC, Bradtmiller B. 1992. Interobserver error in a large scale anthropometric survey. American Journal of Human Biology 4(2):253-263

[33] Harripershad M, Liebenberg L, Ridel AF, Theye CEG. 2025. Exploring craniofacial fluctuating asymmetry in a South African sample. Journal of Anatomy Epub ahead of print

[34] Hona TWPT, Stephan CN. 2024. Global facial soft tissue thicknesses for craniofacial identification (2023): a review of 140 years of data since Welcker’s first study. International Journal of Legal Medicine 138(2):519-535

[35] Hsiao CY, Kuo CC, Liou YA, Wang MJ. 2023. Determining work-rest schedules for visual tasks that use optical head-mounted displays based on visual fatigue and visually induced motion sickness recovery. International Journal of Environmental Research and Public Health 20(3):1880

[36] Hsu PJ, Denadai R, Pai BCJ, Lin HH, Lo LJ. 2020. Outcome of facial contour asymmetry after conventional two-dimensional versus computer-assisted three-dimensional planning in cleft orthognathic surgery. Scientific Reports 10:2346

[37] Huang RL, Xie Y, Wang W, Tan P, Li Q. 2018. Long-term Outcomes of temporal hollowing augmentation by targeted volume restoration of fat compartments in Chinese adults. JAMA Facial Plastic Surgery 20(5):387-393

[38] Jackson TH, Mitroff SR, Clark K, Proffit WR, Lee JY, Nguyen TT. 2013. Face symmetry assessment abilities: clinical implications for diagnosing asymmetry. American Journal of Orthodontics and Dentofacial Orthopedics 144(5):663-671

[39] Johnson HJ, Harris G, Williams K. 2007. BRAINSFit: mutual information registrations of whole-brain 3D images, using the insight toolkit. The Insight Journal 57:11

[40] Kikinis R, Pieper SD, Vosburgh KG. 2014. 3D slicer: a platform for subject-specific image analysis, visualization, and clinical support. In: Intraoperative imaging and image-guided therapy. New York: Springer.

[41] Kim JH, Lee R, Shin CH, Kim HK, Han YS. 2018. Temporal augmentation with calvarial onlay graft during pterional craniotomy for prevention of temporal hollowing. Archives of Craniofacial Surgery 19(2):94-101

[42] Klassen AF, Rae C, Riff W, Denadai R, Murray DJ, Bracken S, Courtemanche DJ, Bulstrode N, O’Hara J, Butler D, Goldstein J, Tassi A, Hol ML, Johnson D, Ganske IM, Kölby L, Benitez S, Breuning EE, Malic CC, Allen GC, Pusic AL, Cano S. 2021. FACE-Q craniofacial module: part 2 Psychometric properties of newly developed scales for children and young adults with facial conditions. Journal of Plastic, Reconstructive & Aesthetic Surgery 74(9):2330-2340

[43] Kokich VO, Kokich VG, Kiyak HA. 2006. Perceptions of dental professionals and laypersons to altered dental esthetics: asymmetric and symmetric situations. American Journal of Orthodontics and Dentofacial Orthopedics 130(2):141-151

[44] Kong XZ, Mathias SR, Guadalupe T, ENIGMA Laterality Working Group, Glahn DC, Franke B, Crivello F, Tzourio-Mazoyer N, Fisher SE, Thompson PM, Francks C. 2018. Mapping cortical brain asymmetry in 17 141 healthy individuals worldwide via the ENIGMA Consortium. Proceedings of the National Academy of Sciences of the United States of America 115(22):E5154-E5163

[45] Koo TK, Li MY. 2016. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine 15(2):155-163

[46] Kottner J, Audigé L, Brorson S, Donner A, Gajewski BJ, Hróbjartsson A, Roberts C, Shoukri M, Streiner DL. 2011. Guidelines for reporting reliability and agreement studies (GRRAS) were proposed. Journal of Clinical Epidemiology 64(1):96-106

[47] Kraemer HC. 2014. Kappa coefficient. In: Wiley StatsRef: statistics reference online:2014. New Jersey, United States: John Wiley & Sons, Ltd.

[48] Kurniawan MSIC, Tio PAE, Abdel Alim T, Roshchupkin G, Dirven CMF, Pleumeekers MM, Mathijssen IMJ, Van Veelen MC. 2024. 3D analysis of the cranial and facial shape in craniosynostosis patients: a systematic review. Journal of Craniofacial Surgery 35(3):813-821

[49] Laloze J, Brie J, Chaput B, Usseglio J. 2019. Depression after temporal muscle flap: a systematic review of the literature. Journal of Cranio-Maxillo-Facial Surgery 47(7):1104-1109

[50] Larese Filon F, Drusian A, Ronchese F, Negro C. 2019. Video display operator complaints: a 10-year follow-up of visual fatigue and refractive disorders. International Journal of Environmental Research and Public Health 16(14):2501

[51] Lee SF, Dumrongwongsiri S, Lo LJ. 2019. Perception of lip cant as a sign of facial deformity: assessment by laypersons and professionals on composite face photographs. Annals of Plastic Surgery 82(1S Suppl 1):S140-S143

[52] Lewis MB. 2017. Factors affecting the perception of 3D facial symmetry from 2D projections. Symmetry 9(10):243

[53] Lin CL, Chen YF, Chen YA, Yao CF, Xi T, Liao YF, Chen YR. 2025. Associations with lip cant and facial midline correction following bimaxillary surgery in class III asymmetry: a CBCT-based analysis. Biomedical Journal

[54] Lo LJ, Lin HH. 2023. Applications of three-dimensional imaging techniques in craniomaxillofacial surgery: a literature review. Biomedical Journal 46(4):100615

[55] Madan CR. 2015. Creating 3D visualizations of MRI data: a brief guide. F1000Research 4:466

[56] Marečková K, Chakravarty MM, Huang M, Lawrence C, Leonard G, Perron M, Pike BG, Richer L, Veillette S, Pausova Z, Paus T. 2013. Does skull shape mediate the relationship between objective features and subjective impressions about the face? NeuroImage 79:234-240

[57] McGraw KO, Wong SP. 1996. Forming inferences about some intraclass correlation coefficients. Psychological Methods 1:30-46

[58] Montibeller G, Von Winterfeldt D. 2015. Cognitive and motivational biases in decision and risk analysis. Risk Analysis 35(7):1230-1251

[59] Morandi EM, Pinggera D, Kerschbaumer J, Rauchenwald T, Winkelmann S, Thomé C, Pierer G, Wolfram D. 2022. Correction of temporal hollowing after pterional craniotomy by autologous fat grafting. A single-center experience. World Neurosurgery 164:e784–e791

[60] Naini FB, Donaldson AN, Cobourne MT, McDonald F. 2012b. Assessing the influence of mandibular prominence on perceived attractiveness in the orthognathic patient, clinician, and layperson. European Journal of Orthodontics 34(6):738-746

[61] Naini FB, Donaldson AN, McDonald F, Cobourne MT. 2012a. Assessing the influence of asymmeftry affecting the mandible and chin point on perceived attractiveness in the orthognathic patient, clinician, and layperson. Journal of Oral and Maxillofacial Surgery 70(1):192-206

[62] Nakajima K, Okuda K, Komatsu J. 2021. What does diagnostic threshold mean? Deterministic and probabilistic considerations. The Journal of Nuclear Cardiology 28(4):1702-1706

[63] Nasim S, Nasim H, Kauke M, Safi AF. 2024. Autologous fat grafting for cosmetic temporal augmentation: a systematic review. Frontiers in Surgery 11:1410162

[64] Nguyen TT, Cohen SM, Shaw KS, Dedeoglu F, Vleugels RA, Ganske IM. 2024. Comparison of 3D facial photographs and clinical documentation in patients with craniofacial morphea. Skin Health and Disease 4(6):e438

[65] Nishimura M, Tachiki C, Morikawa T, Ariizumi D, Matsunaga S, Sugahara K, Aihara Y, Watanabe A, Katakura A, Kawamata T, Nishii Y. 2025. Cranial vault deformation and its association with mandibular deviation in patients with facial asymmetry: a CT-based study. Diagnostics 15(13):1702

[66] Padwa BL, Kaiser MO, Kaban LB. 1997. Occlusal cant in the frontal plane as a reflection of facial asymmetry. Journal of Oral and Maxillofacial Surgery 55(8):811-816

[67] Pang Y, Yang Y, Lin Y, Zhu J, Liu P, Tian Y, Wang F, Mei Z, Kang D, Cao M, Lin Y. 2025. Interictal neural fragility predicts seizure onset zone and surgical outcomes in drug-resistant epilepsy. PeerJ 13:e19548

[68] Parrini S, Rossini G, Castroflorio T, Fortini A, Deregibus A, Debernardi C. 2016. Laypeople’s perceptions of frontal smile esthetics: a systematic review. American Journal of Orthodontics and Dentofacial Orthopedics 150(5):740-750

[69] Pinheiro LP, Monteiro LCP, Henriques LD, Souza GS, Miranda AC, Costa MF, Henriques AL. 2023. Association between facial metrics and mate rejection for long-term relationship by heterosexual men. Symmetry 15(1):133

[70] Portney LG, Watkins MP. 2000. Foundations of clinical research: applications to practice. Upper Saddle River, New Jersey: Prentice Hall Health.

[71] Rhodes G, Peters M, Lee K, Morrone MC, Burr D. 2005. Higher-level mechanisms detect facial symmetry. Proceedings of the Royal Society B: Biological Sciences 272(1570):1379-1384

[72] Santiago GF, Terner J, Wolff A, Teixeira J, Brem H, Huang J, Gordon CR. 2018. Post-neurosurgical temporal deformities: various techniques for correction and associated complications. Journal of Craniofacial Surgery 29(7):1723-1729

[73] Schwirtz RMF, Mulder FJ, Mosmuller DGM, Tan RA, Maal TJ, Prahl C, De Vet HCW, Don Griot JPW. 2018. Rating nasolabial aesthetics in unilateral cleft lip and palate patients: cropped versus full-face images. Cleft Palate-Craniofacial Journal 55(5):747-752

[74] Secolin R, De Araujo TK, Gonsales MC, Rocha CS, Naslavsky M, Marco L, Bicalho MAC, Vazquez VL, Zatz M, Silva WA, Lopes-Cendes I. 2021. Genetic variability in COVID-19-related genes in the Brazilian population. Human Genome Variation 8:15

[75] Shay T, Shachar T, Olshinka A, Ad-El DD, Ibelli T, Shilo Yaacobi D. 2022. Temporal hollowing causes, classifications, and treatment options: a systematic review. Journal of Cosmetic Dermatology 21(10):4215-4224

[76] Stephan CN, Devine M. 2009. The superficial temporal fat pad and its ramifications for temporalis muscle construction in facial approximation. Forensic Science International 191(1–3):70-79

[77] Tan RA, Mulder FJ, Schwirtz RMF, Mosmuller DGM, De Vet HCW, Griot JPWD. 2021. Atypical outcomes of nasal and lip appearance after unilateral cleft lip repair: judgment by professionals, patients, and laypeople. Cleft Palate-Craniofacial Journal 58(10):1226-1235

[78] Taylor-Phillips S, Stinton C. 2019. Fatigue in radiology: a fertile area for future research. British Journal of Radiology 92(1099):20190043

[79] Thiensri T, Limpoka A, Burusapat C. 2020. Analysis of factors associated with temporal hollowing after pterional craniotomy. Indian Journal of Plastic Surgery 53(1):71-82

[80] Tustison NJ, Avants BB, Cook PA, Zheng Y, Egan A, Yushkevich PA, Gee JC. 2010. N4ITK: improved N3 bias correction. IEEE Transactions on Medical Imaging 29(6):1310-1320

[81] Utermohle CJ, Zegura SL, Heathcote GM. 1983. Multiple observers, and humidity, and choice of precision statistics: factors influencing craniometric data quality. American Journal of Physical Anthropology 61(1):85-95

[82] Vaca EE, Purnell CA, Gosain AK, Alghoul MS. 2017. Postoperative temporal hollowing: is there a surgical approach that prevents this complication? A systematic review and anatomic illustration. Journal of Plastic, Reconstructive & Aesthetic Surgery 70(3):401-415

[83] Vander Pluym J, Shan WW, Taher Z, Beaulieu C, Plewes C, Peterson AE, Beattie OB, Bamforth JS. 2007. Use of magnetic resonance imaging to measure facial soft tissue depth. Cleft Palate-Craniofacial Journal 44(1):52-57

[84] Villavisanis DF, Khandelwal P, Zapatero ZD, Wagner CS, Blum JD, Cho DY, Swanson JW, Taylor JA, Yushkevich PA, Bartlett SP. 2024. Craniofacial soft-tissue anthropomorphic database with magnetic resonance imaging and unbiased diffeomorphic registration. Plastic and Reconstructive Surgery 153(3):667-677

[85] Wan SY, Tsai PY, Lo LJ. 2021. Quantifying perceived facial asymmetry to enhance physician–patient communications. Applied Sciences 11(18):8398

[86] Wang TT, Wessels L, Hussain G, Merten S. 2017. Discriminative thresholds in facial asymmetry: a review of the literature. Aesthetic Surgery Journal 37(4):375-385

[87] Ward RE, Jamison PL. 1991. Measurement precision and reliability in craniofacial anthropometry: implications and suggestions for clinical applications. Journal of Craniofacial Genetics and Developmental Biology 11(3):156-164

[88] Weller W, Horta Neto JL. 2021. The education system of Brazil: an overview of history and politics. In: The education systems of the Americas. Global education systems. Cham: Springer.

[89] Welling LC, Figueiredo EG, Wen HT, Gomes MQ, Bor-Seng-Shu E, Casarolli C, Guirado VM, Teixeira MJ. 2015. Prospective randomized study comparing clinical, functional, and aesthetic results of minipterional and classic pterional craniotomies. Journal of Neurosurgery 122(5):1012-1019

[90] Wenger TL, Gallagher ER, Bhoj EJ. 2019. An Algorithm for the assessment of facial asymmetry in children with focus on etiology and treatment. Cleft Palate-Craniofacial Journal 56(3):419-424

[91] Wood DD, Fisher DL, Andres RO. 1997. Minimizing fatigue during repetitive jobs: optimal work-rest schedules. Human Factors 39(1):83-101

[92] Wu TY, Denadai R, Lin HH, Ho CT, Lo LJ. 2019. The outcome of skeletofacial reconstruction with mandibular rotation for management of asymmetric skeletal class III deformity: a three-dimensional computer-assisted investigation. Scientific Reports 9:13337

[93] Wysong A, Joseph T, Kim D, Tang JY, Gladstone HB. 2013. Quantifying soft tissue loss in facial aging: a study in women using magnetic resonance imaging. Dermatologic Surgery 39(12):1895-1902

[94] Wysong A, Kim D, Joseph T, MacFarlane DF, Tang JY, Gladstone HB. 2014. Quantifying soft tissue loss in the aging male face using magnetic resonance imaging. Dermatologic Surgery 40(7):786-793

[95] Yang LZ, Guo Y, Wang ZQ, Zhang CQ. 2024. A population-based analysis of the global burden of epilepsy across all age groups (1990–2021): utilizing the Global Burden of Disease 2021 data. Frontiers in Neurology 15:1448596