Machine Learning-Enabled NIR Spectroscopy. Part 3: Hyperparameter by Design (HyD) Based ANN-MLP Optimization, Model Generalizability, and Model Transferability

Show simple item record

dc.contributor.author Ali, Hussain
dc.contributor.author Muthudoss, Prakash
dc.contributor.author Chauhan, Chirag
dc.contributor.author Kaliappan, Ilango
dc.contributor.author Kumar, Dinesh
dc.contributor.author Paudel, Amrit
dc.contributor.author Ramasamy, Gobi
dc.date.accessioned 2024-04-10T06:09:38Z
dc.date.available 2024-04-10T06:09:38Z
dc.date.issued 2023-12-07
dc.identifier.issn 15309932
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/3123
dc.description This paper published with affiliation IIT (BHU), Varanasi in open access mode. en_US
dc.description.abstract Data variations, library changes, and poorly tuned hyperparameters can cause failures in data-driven modelling. In such scenarios, model drift, a gradual shift in model performance, can lead to inaccurate predictions. Monitoring and mitigating drift are vital to maintain model effectiveness. USFDA and ICH regulate pharmaceutical variation with scientific risk-based approaches. In this study, the hyperparameter optimization for the Artificial Neural Network Multilayer Perceptron (ANN-MLP) was investigated using open-source data. The design of experiments (DoE) approach in combination with target drift prediction and statistical process control (SPC) was employed to achieve this objective. First, pre-screening and optimization DoEs were conducted on lab-scale data, serving as internal validation data, to identify the design space and control space. The regression performance metrics were carefully monitored to ensure the right set of hyperparameters was selected, optimizing the modelling time and storage requirements. Before extending the analysis to external validation data, a drift analysis on the target variable was performed. This aimed to determine if the external data fell within the studied range or required retraining of the model. Although a drift was observed, the external data remained well within the range of the internal validation data. Subsequently, trend analysis and process monitoring for the mean absolute error of the active content were conducted. The combined use of DoE, drift analysis, and SPC enabled trend analysis, ensuring that both current and external validation data met acceptance criteria. Out-of-specification and process control limits were determined, providing valuable insights into the model’s performance and overall reliability. This comprehensive approach allowed for robust hyperparameter optimization and effective management of model lifecycle, crucial in achieving accurate and dependable predictions in various real-world applications. en_US
dc.description.sponsorship Machine Learning Company en_US
dc.language.iso en en_US
dc.publisher Springer Science and Business Media Deutschland GmbH en_US
dc.relation.ispartofseries AAPS PharmSciTech;24
dc.subject Artificial Neural Network Multilayer Perceptron (ANN-MLP); en_US
dc.subject data-driven modelling; en_US
dc.subject design of experiments (DoE); en_US
dc.subject hyperparameter optimization; en_US
dc.subject model generalizability; en_US
dc.subject model lifecycle management; en_US
dc.subject model transferability; en_US
dc.subject near infrared (NIR); en_US
dc.subject process monitoring; en_US
dc.subject statistical process control (SPC); en_US
dc.subject target drift detection en_US
dc.subject Algorithms; en_US
dc.subject Machine Learning; en_US
dc.subject Neural Networks, en_US
dc.title Machine Learning-Enabled NIR Spectroscopy. Part 3: Hyperparameter by Design (HyD) Based ANN-MLP Optimization, Model Generalizability, and Model Transferability en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search in IDR


Advanced Search

Browse

My Account