Lihong Li, a senior principal scientist in Amazon Ads *(top left)*, has won the 2023 Seoul Test of Time award for the 2010 paper “A Contextual-Bandit Approach to Personalized News Article Recommendation.” His coauthors are Wei Chu *(top right)*, senior director of engineering for Ant Group; John Langford *(bottom left),* a partner research manager at Microsoft; and Robert E. Schapire *(bottom right)*, a partner researcher at Microsoft.

Lihong Li wins 2023 Seoul Test of Time Award

The Amazon senior principal scientist coauthored a 2010 paper that introduced a new way to develop algorithms that make personalized recommendations for website users.

By Staff writer

May 17, 2023

4 min read

Lihong Li, a senior principal scientist in Amazon Ads, has won the 2023 Seoul Test of Time award for the 2010 paper “A Contextual-Bandit Approach to Personalized News Article Recommendation.” The paper, coauthored by Wei Chu, John Langford, and Robert E. Schapire, introduced an innovative approach to personalized recommendation engines.

The Seoul Test of Time Award “is awarded annually to the author or authors of a paper presented at a previous World Wide Web conference that has, as the name suggests, stood the test of time.”

Jeff Wilke, who was then Amazon's consumer worldwide CEO, delivering a keynote presentation at re:MARS 2019

Contextual bandits

The paper proposed a contextual-bandit approach to driving personalized recommendations in news content “in which a learning algorithm sequentially selects articles to serve users based on contextual information about the users and articles, while simultaneously adapting its article-selection strategy based on user-click feedback to maximize total user clicks.”

“News content changes every hour within the day,” said Li. “That’s why we need a solution to quickly adapt to changing content, and recommend the best content to users.” In doing so, the solution has to balance two competing goals: maximizing user satisfaction and gathering information about “goodness of match” between user interest and content. Contextual bandits are a special class of reinforcement learning problems that are well-suited to the scenario.

The path to the prize

Li received a bachelor of engineering in computer science and technology at Tsinghua University in Beijing, then went on to earn a master of science in computing science at the University of Alberta. He earned his PhD in computer science from Rutgers University, working in the area of reinforcement learning.

During his time at Rutgers, Li met two mentors who would later become coauthors on the award-winning paper. Schapire was a Princeton professor on Li’s thesis defense committee, and Langford was Li’s internship mentor at Yahoo! in 2007. In October 2020, Li joined Amazon as a senior principal scientist.