Research Paper Volume 16, Issue 13 pp 11018—11026
Identification of Escherichia coli strains using MALDI-TOF MS combined with long short-term memory neural networks
- 1 Department of General Surgery, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
- 2 Department of Medicine and Pharmacy, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
- 3 School of Medicine, Ningbo University, Ningbo 315211, Zhejiang, China
- 4 Department of Gastroenterology, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
Received: March 18, 2024 Accepted: June 3, 2024 Published: June 29, 2024
https://doi.org/10.18632/aging.205995How to Cite
Copyright: © 2024 Mao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Abstract
The current study aims to develop a new technique for the precise identification of Escherichia coli strains, utilizing matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) combined with a long short-term memory (LSTM) neural network. A total of 48 Escherichia coli strains were isolated and cultured on tryptic soy agar medium for 24 hours for the generation of MALDI-TOF MS spectra. Eight hundred MALDI-TOF MS spectra were obtained per strain, resulting in a database of 38,400 spectra. Fifty percent of the data was utilized for LSTM neural network training, with fine-tuned parameters for strain-level identification. The other half served as the test set to assess model performance. Traditional PCA dimension reduction of MALDI-TOF MS spectra indicated 47 out of 48 strains to be unclassifiable. In contrast, the LSTM neural network demonstrated remarkable efficacy. After 20 training epochs, the model achieved a loss value of 0.0524, an accuracy of 0.999, a precision of 0.985, and a recall of 0.982. When tested on the unseen data, the model attained an overall accuracy of 92.24%. The integration of MALDI-TOF MS and LSTM neural network markedly enhances the identification of Escherichia coli strains. This innovative approach offers an effective and accurate tool for MALDI-TOF MS-based strain-level identification, thus expanding the analytical capabilities of microbial diagnostics.