PhD
Assistant Professor
Research Areas
Artificial Intelligence to improve health care delivery and clinical research.
Steve "Aokun" Chen

Contact Information

Dr. Chen's training and expertise span machine learning and deep neural networks, with a particular emphasis on natural language processing (NLP) and sequence modeling for clinical data. his work centers on developing and deploying clinical language models to extract actionable information from unstructured health records, including medication histories and social determinants of health, across a range of conditions such as cancer, delirium, and autism spectrum disorder.

EDUCATION

University of Florida, Gainesville, Florida: Postdoctoral Fellow, Biomedical Informatics
University of Florida, Gainesville, Florida: Doctor of Philosophy, Computer Engineering
University of Florida, Gainesville, Florida: Master of Science, Computer Engineering

honors and awards

2018: Certificate of Outstanding Achievement, University of Florida

Key publications

Chang TY, Gou Q, Zhao L, Zhou T, Chen H, Yang D, Ju H, Smith KE, Sun C, Pan J, Huang Y, He X, Zhang X, Xu D, Xu J, Bian J, Chen A. From image to report: automating lung cancer screening interpretation and reporting with vision-language models. J Biomed Inform. 2025 Nov;171:104931. doi: 10.1016/j.jbi.2025.104931. Epub 2025 Oct 11. PMID: 41083099; PMCID: PMC12579329.

Xie Q, Chen Q, Chen A, Peng C, Hu Y, Lin F, Peng X, Huang J, Zhang J, Keloth V, Zhou X, He H, Ohno-Machado L, Wu Y, Xu H, Bian J. Me-LLaMA: Foundation Large Language Models for Medical Applications. Res Sq. 2024 May 22; PubMed Central PMCID: PMC11142305.

Peng C, Yang X, Chen A, Yu Z, Smith KE, Costa AB, Flores MG, Bian J, Wu Y. Generative large language models are all-purpose text analytics engines: text-to-text learning is all your need. J Am Med Inform Assoc. 2024 Sep 1;31(9):1892-1903. PubMed Central PMCID: PMC11339507.

Yang X, Chen A, PourNejatian N, Shin HC, Smith KE, Parisien C, Compas C, Martin C, Costa AB, Flores MG, Zhang Y, Magoc T, Harle CA, Lipori G, Mitchell DA, Hogan WR, Shenkman EA, Bian J, Wu Y. A large language model for electronic health records. NPJ Digit Med. 2022 Dec 26;5(1):194. PubMed Central PMCID: PMC9792464.

Chen A, Yu Z, Yang X, Guo Y, Bian J, Wu Y. Contextualized medication information extraction using Transformer-based deep learning architectures. J Biomed Inform. 2023 Jun;142:104370. PubMed Central PMCID: PMC10980542.

Yang S, Yang X, Lyu T, Huang JL, Chen A, He X, Braithwaite D, Mehta HJ, Wu Y, Guo Y, Bian J. Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models. J Healthc Inform Res. 2024 Sep;8(3):463-477. PubMed Central PMCID: PMC11310180.