Machine Learning in Medical Diagnosis: From Image Analysis to EHRs

Main Article Content

Xin Wang

Keywords

machine learning, medical diagnosis, electronic health records (EHR), deep learning models

Abstract

This article researched the widespread application of machine learning in medical diagnosis in depth, hoping to solve many of the current challenges in the medical field by improving the accuracy and efficiency of diagnosis. This paper expounds the important role of machine learning in disease prediction and patient risk stratification from three aspects: medical image analysis, electronic health record (EHR) processing and time series data analysis. Through algorithms such as supervised learning, unsupervised learning, and reinforcement learning, machine learning can extract valuable information from massive medical data to help doctors make more accurate diagnostic decisions. In the paper, the application of convolutional neural network (CNN), bidirectional encoder representation (BERT) and long short-term memory network (LSTM) in medical diagnosis is studied. Specifically, CNN performed well in medical image analysis and was able to quickly identify diseased areas; BERT extracts key information from unstructured text through natural language processing to support disease prediction and drug recommendations. LSTM specializes in processing time series data and can predict a patient’s disease risk. In addition, the successful cases of these models in data analysis such as pneumonia detection and heart disease prediction are also introduced. This shows their great potential in the field of medicine. Finally, the paper discusses the challenges of data privacy, model interpretability and algorithm optimization, and looks forward to future research directions such as federated learning, multimodal data fusion and real-time diagnosis, providing theoretical support and practical guidance for the intelligent transformation of healthcare industry.

Abstract 14 | PDF Downloads 7

References

  • Alsentzer, E., Murphy, J. R., Boag, W., Weng, W.-H., Jin, D., Naumann, T. and McDermott, M. B. A., (2019). Published. Publicly available clinical BERT embeddings. roceedings of the 2nd Clinical Natural Language Processing Workshop, Minneapolis, Minnesota. Association for Computational Linguistic, pp. 72-78.
  • Ardila, D., Kiraly, A. P., Bharadwaj, S., Choi, B., Reicher, J. J., Peng, L., Tse, D., Etemadi, M., Ye, W. and Corrado, G., (2019). End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nature medicine, vol. 25, no. 6, pp. 954-961.
  • Ching, T., Himmelstein, D. S., Beaulieu-Jones, B. K., Kalinin, A. A., Do, B. T., Way, G. P., Ferrero, E., Agapow, P.-M., Zietz, M. and Hoffman, M. M., (2018). Opportunities and obstacles for deep learning in biology and medicine. Journal of the royal society interface, vol. 15, no. 141, p. 20170387.
  • Dwork, C. and Roth, A., (2014). The algorithmic foundations of differential privacy. Foundations and trends® in theoretical computer science, vol. 9, no. 3–4, pp. 211-407.
  • Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M. and Thrun, S., (2017). Dermatologist-level classification of skin cancer with deep neural networks. nature, vol. 542, no. 7639, pp. 115-118.
  • Huang, K., Altosaar, J. and Ranganath, R., (2020). Published. ClinicalBERT: Modeling clinical notes and predicting hospital readmission. Proceedings of the ACM Conference on Health, Inference, and Learning, Toronto, ON, Canada. ACM, pp. 1-9.
  • Jiang, F., Jiang, Y., Zhi, H., Dong, Y., Li, H., Ma, S., Wang, Y., Dong, Q., Shen, H. and Wang, Y., (2017). Artificial intelligence in healthcare: past, present and future. Stroke and vascular neurology, vol. 2, no. 4, pp. 230-243.
  • Khera, A. V., Chaffin, M., Aragam, K. G., Haas, M. E., Roselli, C., Choi, S. H., Natarajan, P., Lander, E. S., Lubitz, S. A. and Ellinor, P. T., (2018). Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nature genetics, vol. 50, no. 9, pp. 1219-1224.
  • Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., Van Der Laak, J. A., Van Ginneken, B. and Sánchez, C. I., (2017). A survey on deep learning in medical image analysis. Medical image analysis, vol. 42, pp. 60-88.
  • McKinney, S. M., Sieniek, M., Godbole, V., Godwin, J., Antropova, N., Ashrafian, H., Back, T., Chesus, M., Corrado, G. S. and Darzi, A., (2020). International evaluation of an AI system for breast cancer screening. Nature, vol. 577, no. 7788, pp. 89-94.
  • Miotto, R., Wang, F., Wang, S., Jiang, X. and Dudley, J. T., (2018). Deep learning for healthcare: review, opportunities and challenges. Briefings in bioinformatics, vol. 19, no. 6, pp. 1236-1246.
  • Obermeyer, Z. and Emanuel, E. J., (2016). Predicting the future—big data, machine learning, and clinical medicine. The New England journal of medicine, vol. 375, no. 13, pp. 1216-1219.
  • Organization, W. H., (2016). Global strategy on human resources for health: workforce 2030, Geneva, Switzerland: World Health Organization.
  • Rajpurkar, P., Hannun, A. Y., Haghpanahi, M., Bourn, C. and Ng, A. Y., (2017). Cardiologist-level arrhythmia detection with convolutional neural networks. arXiv preprint arXiv:1707.01836.
  • Rieke, N., Hancox, J., Li, W., Milletari, F., Roth, H. R., Albarqouni, S., Bakas, S., Galtier, M. N., Landman, B. A. and Maier-Hein, K., (2020). The future of digital health with federated learning. NPJ Digital Medicine, vol. 3, no. 1, p. 119.
  • Topol, E. J., (2019). High-performance medicine: the convergence of human and artificial intelligence. Nature medicine, vol. 25, no. 1, pp. 44-56.
  • Zhang, Y., Chen, Q., Yang, Z., Lin, H. and Lu, Z., (2019). BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific data, vol. 6, no. 1, p. 52.

Similar Articles

1-10 of 19

You may also start an advanced similarity search for this article.