First study of the RuPatient health information system with optical character recognition of medical records based on machine learning
https://doi.org/10.15829/1728-8800-2021-3080
Abstract
RuPatient health information system (HIS) is a computer program consisting of a doctor-patient web user interface, which includes algorithms for recognizing medical record text and entering it into the corresponding fields of the system.
Aim. To evaluate the effectiveness of RuPatient HIS in actual clinical practice.
Material and methods. The study involved 10 cardiologists and intensivists of the department of cardiology and сardiovascular intensive care unit of the L. A. Vorokhobov City Clinical Hospital 67 We analyzed images (scanned copies, photos) of discharge reports from patients admitted to the relevant departments in 2021. The following fields of medical documentation was recognized: Name, Complaints, Anamnesis of life and illness, Examination, Recommendations. The correctness and accuracy of recognition of entered information were analyzed. We compared the recognition quality of RuPatient HIS and a popular optical character recognition application (FineReader for Mac).
Results. The study included 77 pages of discharge reports of patients from various hospitals in Russia from 50 patients (men, 52%). The mean age of patients was 57,7±7,9 years. The number of reports with correctly recognized fields in various categories using the program algorithms was distributed as follows: Name — 14 (28%), Diagnosis — 13 (26%), Complaints — 40 (80%), Anamnesis — 14 (28%), Examination — 24 (48%), Recommendations — 46 (92%). Data that was not included in the category was also recognized and entered in the comments field. The number of recognized words was 549±174,9 vs 522,4±215,6 (p=0,5), critical errors in words — 2,1±1,6 vs 4,4±2,8 (p<0,001), non-critical errors — 10,3±4,3 vs 5,6±3,3 (p<0,001) for RuPatient HIS and optical character recognition application for a personal computer, respectively.
Conclusion. The developed RuPatient HIS, which includes a module for recognizing medical records and entering data into the corresponding fields, significantly increases the document management efficiency with high quality of optical character recognition based on neural network technologies and the automation of filling process.
Keywords
About the Authors
A. A. KomkovNational Medical Research Center for Therapy and Preventive Medicine; L.A. Vorokhobov City Clinical Hospital № 67
Russian Federation
Moscow
V. P. Mazaev
Russian Federation
Moscow
S. V. Ryazanova
Russian Federation
Moscow
D. N. Samochatov
Russian Federation
Moscow
E. V. Koshkina
Russian Federation
Moscow
E. V. Bushueva
Russian Federation
Moscow
O. M. Drapkina
Russian Federation
Moscow
References
1. Halamka J. The Digital Reconstruction of Health Care. NEJM Catal. 2020;1(6):1-12. doi:10.1056/cat.20.0082.
2. Liu Y, Chen PHC, Krause J, et al. How to Read Articles That Use Machine Learning: Users’ Guides to the Medical Literature. JAMA. 2019;322(18):1806-16. doi:10.1001/jama.2019.16489.
3. Burnham JP, Lu C, Yaeger LH, et al. Using wearable technology to predict health outcomes: A literature review. J Am Med Inform Assoc. 2018;25(9):1221-7 doi:10.1093/jamia/ocy082.
4. Komkov AA, Mazaev VP, Ryazanova SV, et al. The main directions of the development of artificial intelligence in medicine. Scientific review. Medical sciences. 2020;5:33-40. (In Russ.) doi:10.17513/srms.1141.
5. Hosny A, Parmar C, Quackenbush J, et al. Artificial intelligence in radiology. Nat Rev Cancer. 2018;18(8):500-10. doi:10.1038/s41568-018-0016-5.
6. AI-Based Document Digitization in Healthcare — What’s Possible [Electronic resource]. Emerj. 2019. URL: https://emerj.com/ai-sector-overviews/document-digitization-healthcare. (01 Dec 2021).
7. 7 Dudchenko PV. Recognition of required data in the text of medical statements. Breakthrough scientific research problems, patterns, prospects. Collection of articles of the XI International Scientific and practical Conference. Penza, October 27, 2018. 2018:49-51. (In Russ.) ISBN: 978-5-907135-19-2.
8. Moskalev IV, Krotova OS, Khvorova LA. Automation of the process of extracting structured data from unstructured medical statements using text mining technologies. High-performance computing systems and technologies. 2020;4(1):163-7 (In Russ.)
9. Rudin RS, Friedberg MW, Shekelle P, et al. Getting Value From Electronic Health Records: Research Needed to Improve Practice. Ann Intern Med. 2020; 172(11):S130-6. doi:10.7326/M19-0878.
10. Evans RS. Electronic Health Records: Then, Now, and in the Future. Yearb Med Inform. 2016):S48-61. doi: 10.15265/IYS-2016-s006.
Supplementary files
Review
For citations:
Komkov A.A., Mazaev V.P., Ryazanova S.V., Samochatov D.N., Koshkina E.V., Bushueva E.V., Drapkina O.M. First study of the RuPatient health information system with optical character recognition of medical records based on machine learning. Cardiovascular Therapy and Prevention. 2021;20(8):3080. (In Russ.) https://doi.org/10.15829/1728-8800-2021-3080