Application of machine learning algorithms in building health diagnostics: predictive analytics evaluating indoor air quality and sick building syndrome in educational settings
Abstract
Our research aimed to develop and validate a predictive analytics model for diagnosing sick building syndrome (SBS) in learners. We achieved this by gathering and analyzing epidemiological and exposure assessment data via a cross-sectional study approach. The current assessment involved the use of the modified MM040NA SBS questionnaire and checklist from Indoor Air Quality (IAQ) Industry Code of Practice (IAQ-ICOP) from the Department of Occupational Safety and Health (DOSH), Malaysia, with participants scoring their answers, and recording and scoring of their simultaneous self-reported and physician-ascertained health complaints. At the same time, IAQ assessments were collected at the location of each participant with the use of occupational hygiene techniques. Several predictive analytics algorithms, namely Neural Network, Logistic Regression, Classification Tree, Random Forest, and Support Vector Machine, were used to train and test the collected dataset. The Neural Network model rendered the most effective classification accuracy, reaching 82.5%. Validation also showed that multiple IAQ parameters were strongly associated with health complaints, especially in mechanically ventilated environments. Taken together, the results confirm the effectiveness of neural network-based predictive analytics in correctly diagnosing sick building syndrome (SBS) and related health complaints on limited IAQ data and thereby improving the ability to assess during the early stages.