水力发电学报
          Home  |  About Journal  |  Editorial Board  |  Instruction  |  Download  |  Contact Us  |  Ethics policy  |  News  |  中文

Journal of Hydroelectric Engineering ›› 2024, Vol. 43 ›› Issue (9): 124-136.doi: 10.11660/slfdxb.20240911

Previous Articles    

Intelligent interpretation method for non-editable texts of hydraulic concrete materials

  

  • Online:2024-09-25 Published:2024-09-25

Abstract: During the construction of a hydropower project, a large number of non-editable documents for hydraulic concrete materials are generated. Using manual interpretation methods to obtain texts is time-consuming, laborious and accuracy-uncontrollable, making it difficult to meet the demand for information management of material data. This paper develops an intelligent interpretation method for non-editable texts of hydraulic concrete materials. First, we construct a text detection model, HC-PSENet, based on pixel level segmentation, which integrates the backbone network of PP-HGNet to achieve accurate detection of text lines. Then, a professional corpus is created based on the domain knowledge to realize accurate character mapping. We construct a text recognition model HC-CRNN for hydraulic concrete materials, using detection text boxes and the professional corpus as its inputs, and adopt the backbone network of ResNet and the improved loss function C-CTC Loss to improve the accuracy of character classification. Finally, a transfer learning strategy is adopted to train the model with the self-designed dataset as an example; the effectiveness and superiority of our new method is verified through ablation and comparative experiments. The results show that it has a harmonic mean of 0.985 for detecting text regions and its accuracy of text recognition reaches 90.62%. It has an overall performance superior to classical methods and would provide new technical means for the automated reuse of non-editable text resources in concrete materials.

Key words: hydraulic concrete materials, text detection, text recognition, deep learning, domain knowledge

Copyright © Editorial Board of Journal of Hydroelectric Engineering
Supported by:Beijing Magtech