When the COVID-19 pandemic began, scientists of all disciplines rushed to apply their skills and knowledge to the global crisis. That included computer scientists at the University of Chicago and Argonne National Laboratory, who joined an international collaboration to adapt AI language models for discovering variants of the SARS-CoV-2 virus and the potential for new more dangerous or transmissible forms.

At this year’s International Conference for High Performance Computing, Networking, Storage, and Analysis (SC22), that collaboration was given the ACM Gordon Bell Special Prize for HPC-Based COVID-19 Research. The paper, “GenSLMs: Genome-Scale Language Models Reveal SARS-CoV-2 Evolutionary Dynamics,” was honored for outstanding research achievement toward the understanding of the COVID-19 pandemic through the use of high-performance computing.

Members of the GenSLMs team gather in front of the U.S. Department of Energy’s booth at the SC22 conference.

UChicago authors on the paper included Professors of Computer Science Ian Foster and Rick Stevens, PhD students Alexander Brace, Austin Clyde, and J. Gregory Pauloski, undergraduate student Diangen (Dana) Lin, and postdoctoral researcher Valerie Hayot-Sasson. The collaboration also included several scientists from our partner Argonne National Laboratory, who wrote a feature piece on the award-winning research:

“When the pandemic began, we had several of these really harmful variants of the virus, like the Delta variant,” said Argonne computational biologist Arvind Ramanathan. ​“It resulted in a large death toll. But Delta evolved as a consequence of certain mutations that were happening when the virus was facing the human hosts. It’s a process of evolution of the virus inside of the human cell.”

Trained on a year’s worth of SARS-CoV-2 genome data, the model can infer the distinction between various viral strains. Each dot on the left corresponds to a sequenced SARS-CoV-2 viral strain, color coded by variant. The figure on the right zooms into one strain of the virus, which captures evolutionary couplings across the viral proteins specific to this strain. (Image by Argonne National Laboratory/Bharat Kale, Max Zvyagin and Michael E. Papka)

Their work resulted in the first genome-scale language model (GenSLM), which is a model that can analyze genes and rapidly identify VOCs. The model discussed in the paper was trained on data from the COVID-19 pandemic, and the hope is that models like this could potentially give health officials the tools they need to quickly respond to rising variants. GenSLM is the first whole genome-scale foundation model that can be altered and applied to other prediction tasks similar to VOC identification.

While these evolutionary variants may seem to crop up randomly to the human eye, tracking them is of the utmost concern. As such, the work of Ramanathan and his colleagues could seriously alter how we stay on top of viral outbreaks.

Read more about the project and how it utilized Argonne Leadership Computing Facility resources such as the Polaris supercomputer and the ALCF AI Testbed.


Related News

More UChicago CS stories from this research area.
UChicago CS News

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

Apr 07, 2023
UChicago CS News

UChicago, Stanford Researchers Explore How Robots and Computers Can Help Strangers Have Meaningful In-Person Conversations

Mar 29, 2023
UChicago CS News

Postdoc Alum John Paparrizos Named ICDE Rising Star

Mar 15, 2023
UChicago CS News

New EAGER Grant to Asst. Prof. Eric Jonas Will Explore ML for Quantum Spectrometry

Mar 03, 2023
UChicago CS News

Assistant Professor Chenhao Tan Receives Sloan Research Fellowship

Feb 15, 2023
UChicago CS News

UChicago Scientists Develop New Tool to Protect Artists from AI Mimicry

Feb 13, 2023
In the News

Chicago Magazine on Aurora, “The Computer That Will Change Everything”

Feb 01, 2023
In the News

Professors Rebecca Willett and Ben Zhao Discuss the Future of AI on Public Radio

Jan 26, 2023
UChicago CS News

UChicago Launches Transform Accelerator for Data Science & Emerging AI Startups

Jan 19, 2023
Two students looking at a wearable device
UChicago CS News

High School Students Find Their Place in Computing Through Wearables Workshop

Jan 13, 2023

Ian Foster – Better Information Faster: Programming the Continuum

Jan 06, 2023
UChicago CS News

Q&A: Ian Foster on Receiving the 2023 IEEE Internet Award

Jan 06, 2023
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube