large language models / en AI’s blind spots, one PhD student’s clear vision /news/2025-08/ais-blind-spots-one-phd-students-clear-vision <span>AI’s blind spots, one PhD student’s clear vision </span> <span><span>Ryley McGinnis</span></span> <span><time datetime="2025-08-11T14:36:42-04:00" title="Monday, August 11, 2025 - 14:36">Mon, 08/11/2025 - 14:36</time> </span> <div class="layout layout--gmu layout--twocol-section layout--twocol-section--70-30"> <div class="layout__region region-first"> <div data-block-plugin-id="field_block:node:news_release:body" class="block block-layout-builder block-field-blocknodenews-releasebody"> <div class="field field--name-body field--type-text-with-summary field--label-visually_hidden"> <div class="field__label visually-hidden">Body</div> <div class="field__item"><p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun CommentStart CommentHighlightPipeClicked CommentHighlightClicked intro-text" lang="EN-US">When</span><span class="TextRun SCXW153625060 BCX0 NormalTextRun CommentHighlightPipeClicked intro-text" lang="EN-US"> artificial intelligence makes a mistake, the consequences can be annoying—or they can be life-altering. For one 鶹Ƶ </span><span class="TextRun SCXW153625060 BCX0 NormalTextRun intro-text" lang="EN-US">computer science PhD student, the stakes are clear: biased algorithms in health care aren’t just a technical flaw, they’re a human risk. And with internship experience teaching him crucial skills beyond the classroom, Fardin Ahsan Sakib is ready to make a difference in health care.</span><span class="EOP SCXW153625060 BCX0 intro-text">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">Sakib’s research in natural language processing (NLP) addresses concerns that many may have when working with emerging large language models (LLMs), but when it is applied to health decisions, there could be deeper negative risks. Tools like ChatGPT pull from vast amounts of information, but they have sometimes been known to “hallucinate,” or make up information.&nbsp;</span><span class="EOP SCXW153625060 BCX0"> &nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">“Sometimes these systems are taking a shortcut to get you an answer,” said Sakib. “But regardless of where it comes from, it can provide you incorrect information.” And in the health care space, these LLMs could aid doctors, but not when they could be driving improper care.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">In electronic health records (EHRs), patients’ diagnoses, medical history, and demographic data are stored and organized. This information includes social determinants of health, like employment status, family situations, or housing conditions, hidden in records.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <figure role="group" class="align-right"> <div> <div class="field field--name-image field--type-image field--label-hidden field__item"> <img src="/sites/g/files/yyqcgq291/files/2025-08/fardin-inline-400x600x.jpg" width="400" height="600" loading="lazy"> </div> </div> <figcaption>Fardin Ahsan Sakib is seeking to bring accurate and reliable large language models' capabilities to healthcare.&nbsp;</figcaption> </figure> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">“These factors can affect up to 80% of health outcomes,” Sakib said. “So it's very important that we can extract them correctly from clinical notes and that the model is not introducing any bias.”</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">Sakib poses a scenario: A doctor is using an NLP tool to quickly assess a new patient’s health. The LLM is pulling from the patient’s records, and the physician asks the system, “Is this patient a smoker?” The system quickly sifts through the data, and says that yes, this patient is a smoker. The physician then proceeds to make care recommendations based on this information. Maybe they order a lung cancer screening, or they speak with the patient about smoking cessation resources.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">But what if that answer was incorrect? The LLM decided to take a shortcut, because, based off the information it has, it discovered that most people with similar demographic data to the patient are indeed smokers, but this one isn’t.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">“There are two things at play: bias and hallucination,” said Sakib. “First, algorithms can only use the information they have, and if that information is missing an entire racial or ethnic group, it can be biased. Second, these models can use this biased information to take shortcuts, leading to them delivering false information.”&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">Sakib isn't simply identifying these problems; he's developing solutions. His recent research on detecting and mitigating shortcut learning in health record processing was accepted for presentation at the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), one of the premier NLP conferences.</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="EOP SCXW153625060 BCX0"> </span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">That desire to build reliable and trustworthy NLP tools has driven Sakib’s academic work and his professional pursuits.</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="EOP SCXW153625060 BCX0"> </span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">At Brillient Corporation, where he interned last summer, Sakib worked on creating a retrieval augmented generation system that connected an LLM to the Food and Drug Administration’s (FDA) knowledge base to make accessing and retrieving information faster and grounded in facts. He and his colleagues submitted a patent for this effort.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">This summer, he's deep into another high-impact internship at Amazon. “Amazon wants to automate support so that when you ask for help, a large language model can try to solve the problem before handing it off to a human,” he said.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">Despite the different settings, he sees clear continuity between his internships and academic work. “The industry experience helps me in my research, and my research experience helps in industry. It goes both ways,” he said. “In academia, collaboration is usually focused within a lab or research group. In industry, I’ve worked with engineers, product managers, and domain experts all at once, spanning from health regulators to AWS cloud architects. That diversity of perspectives changes how you solve problems, and I’ve brought that mindset back into my research collaborations.”</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">It’s that dual perspective—academic precision paired with industry scale—that he plans to take with him into industry after graduation.</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> <p class="Paragraph SCXW153625060 BCX0"><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US"></span><span class="TextRun SCXW153625060 BCX0 NormalTextRun" lang="EN-US">“George 鶹Ƶ has prepared me for a lot. From day one, all of the professors have helped me grow as a researcher, a person, and as a team member,” said Sakib.&nbsp;</span><span class="EOP SCXW153625060 BCX0">&nbsp;</span></p> </div> </div> </div> </div> <div class="layout__region region-second"> <div data-block-plugin-id="inline_block:call_to_action" data-inline-block-uuid="c7dbce4f-4584-4b4c-bf4a-99dd6ac72e1b"> <div class="cta"> <a class="cta__link" href="http://www.gmu.edu/AI"> <p class="cta__title">More on AI Research at George 鶹Ƶ <i class="fas fa-arrow-circle-right"></i> </p> <span class="cta__icon"></span> </a> </div> </div> <div data-block-plugin-id="inline_block:text" data-inline-block-uuid="d2a6f63a-6348-4ee8-ac31-0f37683e2c7f" class="block block-layout-builder block-inline-blocktext"> <div class="field field--name-body field--type-text-with-summary field--label-hidden field__item"><hr> <p>&nbsp;</p> </div> </div> <div data-block-plugin-id="inline_block:news_list" data-inline-block-uuid="ea2ce9c0-be44-4f93-bfb9-b274fce7f55f" class="block block-layout-builder block-inline-blocknews-list"> <h2>Related News</h2> <div class="views-element-container"><div class="view view-news view-id-news view-display-id-block_1 js-view-dom-id-565a80128321ab1e64aa62f883264dd545ad3e62bf6ade4b75c26d8497cf7ce0"> <div class="view-content"> <div class="news-list-wrapper"> <ul class="news-list"> <li class="news-item"><div class="views-field views-field-title"><span class="field-content"><a href="/news/2025-09/george-masons-chief-information-security-officer-protects-keys-kingdom" hreflang="en">George 鶹Ƶ’s Chief Information Security Officer protects the keys to the kingdom</a></span></div><div class="views-field views-field-field-publish-date"><div class="field-content">September 4, 2025</div></div></li> <li class="news-item"><div class="views-field views-field-title"><span class="field-content"><a href="/news/2025-09/mason-korea-hosts-young-innovators-summer-camp-local-youth" hreflang="en">鶹Ƶ Korea hosts Young Innovators Summer Camp for local youth</a></span></div><div class="views-field views-field-field-publish-date"><div class="field-content">August 29, 2025</div></div></li> <li class="news-item"><div class="views-field views-field-title"><span class="field-content"><a href="/news/2025-08/cehd-and-university-libraries-are-building-tool-improve-navigating-education-research" hreflang="en">CEHD and University Libraries are building a tool to improve navigating education research</a></span></div><div class="views-field views-field-field-publish-date"><div class="field-content">August 28, 2025</div></div></li> <li class="news-item"><div class="views-field views-field-title"><span class="field-content"><a href="/news/2025-08/harnessing-vr-prevent-substance-use-relapse" hreflang="en">Harnessing VR to prevent substance use relapse </a></span></div><div class="views-field views-field-field-publish-date"><div class="field-content">August 15, 2025</div></div></li> <li class="news-item"><div class="views-field views-field-title"><span class="field-content"><a href="/news/2025-08/ais-blind-spots-one-phd-students-clear-vision" hreflang="en">AI’s blind spots, one PhD student’s clear vision </a></span></div><div class="views-field views-field-field-publish-date"><div class="field-content">August 12, 2025</div></div></li> </ul> </div> </div> </div> </div> </div> <div data-block-plugin-id="field_block:node:news_release:field_content_topics" class="block block-layout-builder block-field-blocknodenews-releasefield-content-topics"> <h2>Topics</h2> <div class="field field--name-field-content-topics field--type-entity-reference field--label-visually_hidden"> <div class="field__label visually-hidden">Topics</div> <div class="field__items"> <div class="field__item"><a href="/taxonomy/term/4656" hreflang="en">Artificial Intelligence</a></div> <div class="field__item"><a href="/taxonomy/term/21196" hreflang="en">large language models</a></div> <div class="field__item"><a href="/taxonomy/term/2186" hreflang="en">computer science</a></div> <div class="field__item"><a href="/taxonomy/term/21126" hreflang="en">Long Nguyen and Kimmy Duong School of Computing</a></div> <div class="field__item"><a href="/taxonomy/term/336" hreflang="en">Students</a></div> <div class="field__item"><a href="/taxonomy/term/721" hreflang="en">internships</a></div> </div> </div> </div> </div> </div> Mon, 11 Aug 2025 18:36:42 +0000 Ryley McGinnis 316486 at