%0 Journal Article
%T 基于N-gram剪枝技术的隐患文本自动评估模型
An Automatic Assessment Model Based on N-gram Pruning Technique for Hidden Danger Text
%A 叶洪胜
%A 刘洪
%A 周宝山
%A 兰莉
%A 邹巧兰
%A 周啟梦
%A 王海宇
%J Mine Engineering
%P 388-394
%@ 2329-731X
%D 2024
%I Hans Publishing
%R 10.12677/me.2024.123047
%X 为了自动分析海上钻井平台隐患文本中蕴含的隐患响应程度信息,量化隐患严重程度,提出一种基于N-gram词袋向量的隐患响应等级量化评估模型。首先针对1565条钻井平台的现场隐患记录进行分词与过滤处理;其次再以N-gram作为特征单元重塑词袋维度;然后提出使用逆TF-IDF值来强化特征值;最后,使用朴素贝叶斯构建隐患量化模型。结果表明:使用该方法的隐患量化评估模型具有较高的精确率、召回率及F1值。
To automatically analyze the response level information of hidden dangers contained in hidden danger texts and quantify the severity, a quantitative evaluation model based on N-gram word bag vectors is proposed for the response level of hidden dangers. Firstly, segment and filter the on-site hazard records of 1565 drilling platforms; Secondly, using N-gram as feature units to reshape the bag of words dimension; Then, it is proposed to use the inverse TF-IDF value to enhance the feature values; Finally, use naive Bayes to construct a hazard quantification model. The results show that the hazard quantification evaluation model using this method has high accuracy, recall, and F1 value.
%K 语义分析,钻井平台,N-gram,词袋向量,隐患量化
Semantic Analysis
%K Drilling Platforms
%K N-gram
%K Word Bag Vector
%K Hazard Quantification
%U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=91161