Michael Maire, An Architect of Deep Learning, Joins CS from TTIC

In 2012, the ImageNet computer vision competition was the breakthrough moment for a new top contender for artificial intelligence applications: deep learning. The surprise victory of this approach over traditional machine learning methods revived interest in deep convolutional neural networks (CNNs), a decades-old concept rejuvenated by big data and more powerful computing. Since then, researchers have further explored these models for visual tasks such as image classification and face recognition, and expanded their use into robotics, natural language processing, computational biology, and other areas.

At the time of the ImageNet competition, new UChicago CS assistant professor Michael Maire was a postdoctoral researcher in the Caltech Computer Vision Lab. His work since, including 4 years at the Toyota Technological Institute at Chicago, has focused on the architectures of these CNNs, studying their underlying structure and the modifications that will help apply deep learning to new, more complicated applications.

“The goal is not just to focus on efficiency improvements,” Maire said, “but to get some understanding of what design details matter in the neural network architecture itself and to put engineering and design effort into that architecture, so that we can get new capabilities out of the neural network and train it to accomplish more complex tasks.”

Deep neural networks were originally inspired by the anatomy of the human brain, where information is processed by intricately connected systems of neurons. In a computational neural network, multiple connected layers take in raw input data, such as the pixels of an image, and gradually transform the information, identifying features and eventually assigning a label, such as determining whether the image contains a human or a dog.

Computer scientists like Maire have explored different sizes and structures for these networks, with the “deep” in deep learning typically referring to networks with tens or even hundreds of layers. Maire’s work also looks at how these networks can be trained, including on data with little or no human labeling, and higher-order visual capabilities, such as understanding the detailed composition of scenes containing many objects. These functions will be particularly useful as engineers further develop autonomous vehicles, robotics, and other technologies that rely upon advanced computer vision.

“The goal is human-level understanding and perception of the visual environment,” Maire said. “We’re moving towards models that learn something about the environment when presented with new objects, and that are capable of making decisions on the fly.”

In addition to studying the architecture of CNNs, Maire also contributes to the data they are tested with through his work on the COCO (Common Objects in Context) dataset. Comprised of over 330,000 images of complex everyday scenes, COCO provides a target for scientists to test new methods in object detection, captioning, and segmentation. A workshop and challenges take place each year, alternating between the ICCV and ECCV conferences.

As a nearby neighbor at TTIC, Maire already worked with UChicago graduate students. Last year, he published a paper with UChicago CS PhD student Gustav Larsson on building a network to automatically colorize images. As data for this task can be collected automatically, without human labeling effort, the network can learn in a self-supervised manner. In addition, colorization serves as a proxy task for larger goals of scene understanding; naming a plausible color for an object is linked to understanding its identity. This fall, Maire will teach a computer vision course that will alternate between the university and TTIC.

For more on Maire’s research, visit his webpage at http://ttic.uchicago.edu/~mmaire/

Related News

More UChicago CS stories from this research area.
UChicago CS News

New 2022-23 CS Faculty Add Expertise in Linguistics, Visualization, Economics, and Data Science Education

Aug 11, 2022
In the News

UChicago Co-Leads $10 Million NSF Institute on Foundations of Data Science

Aug 09, 2022
UChicago CS News

Head’s Up: UChicago CS Grad Student Designs Device That Directs User’s Head

Jul 26, 2022
UChicago CS News

UChicago CS Faculty Receive Industry Grants From J.P. Morgan, Google

Jul 19, 2022
UChicago CS News

UChicago London Colloquium Features Data Science, Quantum Research

Jul 01, 2022

Is it Ethical to Use Facial Imaging in Decision-Making?

Jun 28, 2022
UChicago CS News

Single Sign-On Migration for Chameleon Project Receives PEARC Best Paper Award

Jun 27, 2022
UChicago CS News

EPiQC Post-Doc Pens Op-Ed on Potential of Quantum Computing for Chemistry

Jun 24, 2022
UChicago CS News

Faculty Bill Fefferman and Chenhao Tan Receive Google Research Scholar Awards

Jun 21, 2022
UChicago CS News

Two Incoming UChicago CS PhD Students Receive Department of Energy Fellowship

Jun 16, 2022
UChicago CS News

Prof. Yanjing Li Receives Under-40 Innovators Award from DAC

Jun 15, 2022

Data Science Institute Summit

Jun 15, 2022
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube