%0 Journal Article
%T 基于双边滤波与受限玻尔兹曼机的冷冻电镜单颗粒图像识别
Identification of Cryo-EM Single Particle Images Using Bilateral Filter and Restricted Boltzmann Machine
%A 王桉迪
%A 姚睿捷
%A 黄强
%J Biophysics
%P 34-42
%@ 2330-1694
%D 2021
%I Hans Publishing
%R 10.12677/BIPHY.2021.91005
%X 冷冻电镜技术(Cryo-EM)起源于20世纪70年代,是结构生物学中蛋白质与核酸分子结构研究的重要技术手段。21世纪以来,计算机性能的提升与直接电子检测相机的极大发展,使得人们在小样本低剂量样本条件下仍可获得接近原子分辨率级的三维结构模型。由于三维结构模型是利用多角度投影,通过大量二维冷冻电镜单颗粒图像重构所得,因此,二维单颗粒图像的识别与分类直接影响最终模型的分辨率。目前,通过冷冻电镜获得的图像大部分噪声较多,因此对二维单颗粒图像的筛选,往往需要耗费有经验的科学工作者耗费大量时间。针对此问题,本文运用计算机图形学与机器学习相结合的方法,在预处理阶段以双边滤波器(Bilateral Filter)对信噪比较低的图像进行边缘优化,并通过直方图均衡化实现图像信息增强,最后以少量高置信度图像为训练样本,通过受限玻尔兹曼机(Restricted Boltzmann Machine,RBM)进行监督式学习并实现图像的分类与筛选,以提高二维单颗粒图像识别的效率与准确率。在方法检验阶段,首先,我们利用蛋白质数据库(Protein Data Bank, PDB)中已知的生物大分子结构,投影生成不同信噪比的模拟单颗粒模拟数据,验证了在低信噪比条件下应用本方法进行单颗粒图像识别分类的准确性。随后我们以瞬态受体电位离子通道蛋白子类V成员1 (Transient Receptor Potential cation channel subfamily V member 1,TRPV1)的真实二维单颗粒图像数据集进行识别分类与三维模型重构,通过cryoSPARC平台,以约53%的原始数据量重构出了与原分辨率3.6?相近的模型。因此,本研究不仅提高了传统人工筛选的效率,也为冷冻电镜单颗粒二维图像识别提供了新思路。
Cryo-EM is a crucial technological means to study protein and nucleic acid of structural biology which is originated in 1970s. The significant evolution of computing performance and direct electronic detection (DDD) camera make the atomic resolution of 3D structure of micromolecular under the condition of small dose possible since 21st century. The reconstruction of 3D model is based on identification and classification of 2D Cryo-EM single particle projection images which becomes an immediate cause of how good the resolution of final 3D model could share. Currently, the 2D single particle images selection was is such a time-consuming job even for the experienced scientific researchers as the signal noise ratio (SNR) is usually quite low. A new approach with the combination of computer graphics and machine learning is raised to this problem by using bilateral filter to optimize the detail of edge and histogram equalization to enhance graphic information in the pre-processing stage, moreover, small amount of high-confidence images was chosen as training sample under the restricted Boltzmann machine (RBM) network in supervised learning pattern to achieve the image selection and classification. In the verification stage, the effectiveness of this approach is proved to work well with simulated low SNR projection photos generated from the known micromolecular data from protein data bank (PDB). Subsequently, actual experimental 2D singlet particle data of transient receptor potential cation channel subfamily V member 1 (TRPV1) is applied to be identified and classified, and finally, a 3.6? 3D structural model is reconstructed through cryoSPARC platform by using only approximate 53% of the original data. Consequently, this research is not only improving the manual efficiency, but also providing a broader
%K 冷冻电镜,双边滤波,受限玻尔兹曼机
Cryo-EM
%K Bilateral Filter
%K Restricted Boltzmann Machine
%U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=40750