Aging
Navigate
Research Paper|Volume 16, Issue 13|pp 11018—11026

Identification of Escherichia coli strains using MALDI-TOF MS combined with long short-term memory neural networks

Qiqi Mao1, Xie Zhang2, Zeping Xu2, Ya Xiao3, Yufei Song4, Feng Xu4
  • 1Department of General Surgery, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
  • 2Department of Medicine and Pharmacy, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
  • 3School of Medicine, Ningbo University, Ningbo 315211, Zhejiang, China
  • 4Department of Gastroenterology, Li Huili Hospital Affiliated to Ningbo University, Ningbo 315040, China
* Equal contribution
Received: March 18, 2024Accepted: June 3, 2024Published: June 29, 2024

Copyright: © 2024 Mao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

The current study aims to develop a new technique for the precise identification of Escherichia coli strains, utilizing matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) combined with a long short-term memory (LSTM) neural network. A total of 48 Escherichia coli strains were isolated and cultured on tryptic soy agar medium for 24 hours for the generation of MALDI-TOF MS spectra. Eight hundred MALDI-TOF MS spectra were obtained per strain, resulting in a database of 38,400 spectra. Fifty percent of the data was utilized for LSTM neural network training, with fine-tuned parameters for strain-level identification. The other half served as the test set to assess model performance. Traditional PCA dimension reduction of MALDI-TOF MS spectra indicated 47 out of 48 strains to be unclassifiable. In contrast, the LSTM neural network demonstrated remarkable efficacy. After 20 training epochs, the model achieved a loss value of 0.0524, an accuracy of 0.999, a precision of 0.985, and a recall of 0.982. When tested on the unseen data, the model attained an overall accuracy of 92.24%. The integration of MALDI-TOF MS and LSTM neural network markedly enhances the identification of Escherichia coli strains. This innovative approach offers an effective and accurate tool for MALDI-TOF MS-based strain-level identification, thus expanding the analytical capabilities of microbial diagnostics.