Tandy Warnow (UIUC) – Theoretical and Empirical Advances in Large-Scale Species Tree Estimation

Date & Time:

May 24, 2019 12:00 pm – 1:00 pm

Location:

Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

05/24/2019 12:00 PM 05/24/2019 01:00 PM America/Chicago Tandy Warnow (UIUC) – Theoretical and Empirical Advances in Large-Scale Species Tree Estimation Center for Data and Computing Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

Theoretical and Empirical Advances in Large-Scale Species Tree Estimation

The estimation of the “Tree of Life” — a phylogeny encompassing all life on earth–is one of the big Scientific Grand Challenges. Maximum likelihood (ML) is a standard approach for phylogeny estimation, but estimating ML trees for large heterogeneous datasets is challenging for two reasons: (1) ML tree estimation is NP-hard (and the best current heuristics can use hundreds of CPU years on relatively small datasets, just to find local optima), and (2) the statistical models used in ML tree estimation methods are much too simple, failing to acknowledge heterogeneity across genomes or across the Tree of Life. These two “big data” issues — dataset size and heterogeneity — impact the accuracy of phylogenetic methods and have consequences for downstream analyses.

In this talk, I will describe a new “divide-and-conquer” approach to phylogeny estimation that addresses both types of heterogeneity. Our protocol operates as follows: (1) we divide the set of species into disjoint subsets, (2) we construct trees on the subsets (using appropriate statistical methods), and (3) we combine the trees together using auxiliary information, such as a matrix of pairwise distances. I will present three such strategies (all published in the last year) that operate in this fashion, and that improve the theoretical and empirical performance of phylogeny estimation methods. One of the main applications of this work is species tree estimation from multi-locus data sets when gene trees can differ from the species tree due to incomplete lineage sorting. This talk is largely based on joint work with my PhD student, Erin Molloy (Illinois).

Tandy Warnow

Founder Professor of Computer Science, University of Illinois at Urbana-Champaign

Tandy Warnow is the Founder Professor of Computer Science at the University of Illinois at Urbana-Champaign, where she is also an affiliate in Mathematics, Statistics, Bioengineering, Electrical and Computer Engineering, Animal Biology, Entomology, and Plant Biology. Tandy received her PhD in Mathematics at UC Berkeley under the direction of Gene Lawler, and did postdoctoral training with Simon Tavaré and Michael Waterman at USC. Her research combines computer science, statistics, and discrete mathematics, focusing on developing improved models and algorithms for reconstructing complex and large-scale evolutionary histories in biology and historical linguistics. She has published more than 160 papers and one textbook, graduated 11 PhD students, and has 5 current PhD students. She has been a visiting faculty member at many universities, including Princeton University, the University of Maryland, Yale University, Ecole Polytechnique Fédérale de Lausanne (EPFL), and Harvard University. Her awards include the NSF Young Investigator Award (1994), the David and Lucile Packard Foundation Award (1996), a Radcliffe Institute Fellowship (2006), and the John Simon Guggenheim Foundation Fellowship (2011). She was elected a Fellow of the Association for Computing Machinery (ACM) in 2015 and of the International Society for Computational Biology (ISCB) in 2017. Her national service includes being the lead NSF program officer for BigData (2012-2013), chairing the BioData Management and Analysis (BDMA) study section at NIH (2010-2012). Tandy was also a member of the Big Data Senior Steering Group of NITRD subcommittee of the National Technology Council (coordinating federal agencies), 2012-2013.

Resources

Community

University of Chicago PhD Graduates Secure Tenure-Track Faculty Positions Amid a Competitive Job Market

Democratizing Digital Graphics: An Undergrad’s Unlikely Path To Putting Agency of 3D-Generation in Users’ Hands

Faculty Spotlight: Get to Know Kexin Pei

The Future of AI Panel: Alumni Weekend

Can we authenticate human creativity?

AI and the Future of Work Panel: Featuring Nick Feamster

Theoretical and Empirical Advances in Large-Scale Species Tree Estimation

Tandy Warnow

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

In The News: U.N. Officials Urge Regulation of Artificial Intelligence

UChicago Computer Scientists Bring in Generative Neural Networks to Stop Real-Time Video From Lagging

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

UChicago, Stanford Researchers Explore How Robots and Computers Can Help Strangers Have Meaningful In-Person Conversations

Postdoc Alum John Paparrizos Named ICDE Rising Star

New EAGER Grant to Asst. Prof. Eric Jonas Will Explore ML for Quantum Spectrometry

Assistant Professor Chenhao Tan Receives Sloan Research Fellowship

UChicago Scientists Develop New Tool to Protect Artists from AI Mimicry

Professors Rebecca Willett and Ben Zhao Discuss the Future of AI on Public Radio

UChicago Launches Transform Accelerator for Data Science & Emerging AI Startups