Laser-printed document classification using random forest and gray prediction models

基于随机森林和灰色预测模型的激光打印文档分类

阅读:1

Abstract

This paper presents a classification method for laser-printed documents, integrating the random forest algorithm with the gray prediction model to enhance the accuracy and reliability of forensic document examination. The study utilizes 14 laser printers from five different brands as experimental subjects and extracts 14 key feature parameters such as gray mean, contrast, and distribution symmetry using the ImageXpert analysis system. Classification is done by the random forest algorithm, and the gray prediction model is used to enhance accuracy of classification. Finally, experimental results show that the proposed method achieves high precision or accuracy (96.00% for Chinese characters with fewer strokes and 92.86% for punctuation marks [periods]) for the character and punctuation classification. Compared to traditional classification methods, this approach exhibits superior stability and accuracy. The findings highlight the advantages of non-destructive analysis, efficient classification, and robustness, underscoring its potential as a valuable technological tool for forensic document examination in legal contexts.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。