Eugene Yan
Eugene Yan is an applied scientist at Amazon, but he’s also known for his personal site where he covers topics like machine learning systems, data science methodology, dealing with imposter syndrome, and building data science teams.
Courtesy of Eugene Yan

Eugene Yan and the art of writing about science

Why the Amazon applied scientist takes the time to break down his work for readers.  

Eugene Yan’s career path has taken some unusual turns, but his motivation has always been the same: understanding people so he can help them. A policy analyst turned data scientist, Yan is now an applied scientist at Amazon using customer-behavior data to help recommend the best products. In the world of machine learning, however, he’s best known for the way he writes it all down. On his personal site, Yan covers a range of professional and technical topics like machine learning systems, data science methodology, dealing with imposter syndrome, and building data science teams.

Eugene Yan started eugeneyan.com in 2020, focusing on general machine learning and career content. Initially it was for personal development, but then people started reading, and now writing posts takes up the majority of his leisure time.

He started the site for personal development, but then people started reading, his network started expanding, and now writing posts takes up the majority of his leisure time. “It snowballed,” he said. “Writing helps me learn better. And when I share it online, it attracts like-minded readers and helps me make new friends. ”

Born in Singapore, Yan studied psychology at Singapore Management University. “I was curious about people, how they perceive and how they behave,” he said. His college research focused on how competition affects people differently — motivating some and intimidating others. After college, he joined the Singapore government as a policy analyst sifting through legal cases and trade agreements. But it wasn’t long before he began to miss crunching numbers and following the data on human behavior. “I began to envy my colleagues in commodities who relied on numbers and worked with spreadsheets,” Yan said.

He decided to try and make the switch to a data science position based on some familiarity with the subject from his undergraduate research days. He landed a position at IBM in 2013, and from there he moved to data science roles at Lazada, a Southeast Asian e-commerce site, and then UCARE.AI, a healthcare startup.

A desire to help

“In every change in my career, what drives me is helping people,” Yan noted.

At IBM, it was helping people find new roles. At Lazada, it involved helping people find products they need. At UCARE.AI, it entailed predicting chronic diseases and preventing high insurance payouts. “This brings me way more satisfaction than dollars and cents,” he explained.

While at Lazada, Yan decided he needed more training in the fundamentals and pursued a master’s in computer science from the Georgia Institute of Technology. He graduated in 2019, and then he and his wife began considering a move overseas. He applied for a position at Amazon, drawn to the company’s leadership principles and the ability to help customers read more. He relocated to Seattle to join Amazon in 2020.

While Amazon has several ways to help readers find books, from Amazon Book Review to Amazon Charts, Yan is part of a team developing the recommendation systems that power the widgets behind the Amazon Store’s personalized book suggestions. “Customers tell us what they like based on what they do,” he explained. “They browse for a specific book, a genre or a topic.” His team uses those signals to help surface additional books a reader might like. Ultimately, Yan and his team want to make reading easier.

Writing it all down

Early in his transition to data science, Yan started interviewing mentors for advice, some of whom were “rock star data scientists.” He asked what skills he should cultivate to be successful. The one skill a majority of mentors suggested was communication. The people he spoke with emphasized how communication becomes more and more important as you rise in the ranks. “I was like, ‘Are you kidding me?’ But more and more mentors said the same thing,” he recalls. “I thought, ‘This can’t be right, but I'm just going to try it.’”

Yan started practicing his writing, first publishing to a WordPress site. He wrote dozens of posts unnoticed, but then in 2020 created eugeneyan.com and started writing more general machine learning and career content. His writing began to gain an audience. Posts like “Unpopular opinion — Data scientists should be more end-to-end” received more than 500 likes on Twitter. One post on note-taking received 35,000 unique views in a single day. Feedback and praise began to pour in, and his “practice” website swelled into something much bigger.

For a brief period, Yan tried to sustain this level of social engagement. He wrote to please a mass audience and get clicks. “That quickly became unfulfilling,” he said. Now he focuses his writing on topics he wants to learn and aims for an audience of people he’d hope to be friends or colleagues with. “I might have fewer readers now since I’m choosing more technical topics, but these readers comment, disagree, and email me. Each comment and real relationship are worth more than 10,000 likes,” he said.

The many benefits of good writing

Yan's decision to become a better communicator and writer is especially valuable at Amazon. “The writing culture is rigorous at Amazon,” he said.

Amazon’s working backward method starts with an individual or team imagining the product or service is ready to launch. The individual or team’s first step is to draft a press release announcing the product’s availability, and explaining its significance. Moreover, meetings often start with participants reading a six-page document about the meeting’s topic before discussion begins.

Finding your voice and niche doesn’t happen overnight — you have to write and share your work. So just start somewhere, anywhere, and keep writing.
Eugene Yan

“I write as many documents as I code,” Yan said. Recently he received feedback that one of his design documents was easy to understand and clearly laid out everything the reader needed to know. In this way, his writing skills complement his design and machine learning skills. He also started a new site, Applying ML, which includes interviews with machine learning practitioners.

Yan is often asked by aspiring writers for advice on how they can improve their skills. The number one piece of direction he offers is to write for yourself — what do you want to learn and clarify your thinking on? — rather than social engagement. The second piece of advice: “just write.” The best way to figure out your niche and your audience is to simply put fingers to keyboard and start practicing, he said. Maybe after a dozen — or a few dozen — pieces you find your voice, what you want to write about, or what resonates best with the people reading along.

“If you never start writing, how will you know? Just like Blue Origin’s motto ‘Gradatim Ferociter’, which means ‘Step by step, ferociously’. Finding your voice and niche doesn’t happen overnight — you have to write and share your work,” he said. “So just start somewhere, anywhere, and keep writing.”

Research areas

Related content

US, WA, Bellevue
The Artificial General Intelligent team (AGI) seeks an Applied Scientist with a strong background in machine learning and production level software engineering to spearhead the advancement and deployment of cutting-edge ML systems. As part of this team, you will collaborate with talented peers to create scalable solutions for an innovative conversational assistant, aiming to revolutionize user experiences for millions of Alexa customers. The ideal candidate possesses a solid understanding of machine learning fundamentals and has experience writing high quality software in production setting. The candidate is self-motivated, thrives in ambiguous and fast-paced environments, possess the drive to tackle complex challenges, and excel at swiftly delivering impactful solutions while iterating based on user feedback. Join us in our mission to redefine industry standards and provide unparalleled experiences for our customers. Key job responsibilities You will be expected to: · Analyze, understand, and model customer behavior and the customer experience based on large scale data · Build and measure novel online & offline metrics for personal digital assistants and customer scenarios, on diverse devices and endpoints · Create, innovate and deliver deep learning, policy-based learning, and/or machine learning based algorithms to deliver customer-impacting results · Build and deploy automated model training and evaluation pipelines · Perform model/data analysis and monitor metrics through online A/B testing · Research and implement novel machine learning and deep learning algorithms and models. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA | Boston, MA, USA
CA, ON, Toronto
Looking for your next challenge? North America Sort Centers (NASC) are experiencing growth and looking for a skilled, highly motivated Data Scientist to join the NASC Engineering Data, Product and Simulation Team. The Sort Center network is the critical Middle-Mile solution in the Amazon Transportation Services (ATS) group, linking Fulfillment Centers to the Last Mile. The experience of our customers is dependent on our ability to efficiently execute volume flow through the middle-mile network. Key job responsibilities The Data Scientist will design and implement solutions to address complex business questions using simulation. In this role, you will apply advanced analysis techniques and statistical concepts to draw insights from massive datasets, and create intuitive simulations and data visualizations. You can contribute to each layer of a data solution – you work closely with process design engineers, business intelligence engineers and technical product managers to obtain relevant datasets and create simulation models, and review key results with business leaders and stakeholders. Your work exhibits a balance between scientific validity and business practicality. On this team, you will have a large impact on the entire NASC organization, with lots of opportunity to learn and grow within the NASC Engineering team. This role will be the first dedicated simulation expert, so you will have an exceptional opportunity to define and drive vision for simulation best practices on our team. To be successful in this role, you must be able to turn ambiguous business questions into clearly defined problems, develop quantifiable metrics and deliver results that meet high standards of data quality, security, and privacy. About the team NASC Engineering’s Product and Analytics Team’s sole objective is to develop tools for under the roof simulation and optimization, supporting the needs of our internal and external stakeholders (i.e Process Design Engineering, NASC Engineering, ACES, Finance, Safety and Operations). We develop data science tools to evaluate what-if design and operations scenarios for new and existing sort centers to understand their robustness, stability, scalability, and cost-effectiveness. We conceptualize new data science solutions, using optimization and machine learning platforms, to analyze new and existing process, identify and reduce non-value added steps, and increase overall performance and rate. We work by interfacing with various functional teams to test and pilot new hardware/software solutions. We are open to hiring candidates to work out of one of the following locations: Toronto, ON, CAN
US, WA, Seattle
Join us at the cutting edge of Amazon's sustainability initiatives to work on environmental and social advancements to support Amazon's long term worldwide sustainability strategy. At Amazon, we're working to be the most customer-centric company on earth. To get there, we need exceptionally talented, bright, and driven people. The Worldwide Sustainability (WWS) organization capitalizes on Amazon’s scale & speed to build a more resilient and sustainable company. We manage our social and environmental impacts globally, driving solutions that enable our customers, businesses, and the world around us to become more sustainable. Sustainability Science and Innovation (SSI) is a multi-disciplinary team within the WW Sustainability organization that combines science, analytics, economics, statistics, machine learning, product development, and engineering expertise. We use this expertise and skills to identify, develop and evaluate the science and innovations necessary for Amazon, customers and partners to meet their long-term sustainability goals and commitments. We are seeking a Principal Applied Scientist who is not just adept in the theoretical aspects of Machine Learning (ML), Artificial Intelligence (AI), and Large Language Models (LLMs) but also possesses a pragmatic, hands-on approach to navigating the complexities of innovation. You will take the lead in conceptualization, building, and launching innovative models and solutions that significantly drive material impacts for our long-term sustainability and climate goals. You'll be guided by problems and customer needs. You'll use strong technical judgment to determine appropriate approaches - custom pre-training models, fine-tuning trusted base models, leveraging retrieval-augmented generation (RAGs), or combining approaches. You'll collaborate with business leaders, scientists, and engineers to incorporate sustainability domain nuances when creating data foundations, developing AI models/applications, and applying techniques like data indexing, validation metrics, model distillation, and customized loss functions. You'll work across teams to embed AI/ML solutions and capabilities into existing sustainability data and systems. You'll define key AI sustainability research directions, adopt/invent new ML techniques, conduct rigorous experiments, publish results, and ensure research translates into practice. You'll develop long-term strategies, persuade teams, propose goals and deliver. If you see yourself as a hands-on technical leader and innovator at the intersection of AI, technology, and sustainability, we'd like to connect. You don't need to be an expert in sustainability and climate domains. Key job responsibilities - Creating web-scale sustainability-specific data foundations that align with our impact areas and sustainability goals; - Models to measure environmental and economic impacts at scale; - Automated solutions simplifying complex, labor-intensive ESG tasks; reasoning mechanisms for multi-view decarbonization plans and multi-objective optimization models; - Models to create, monitor, and quality assure high-integrity forest carbon credits. About the team Diverse Experiences: World Wide Sustainability (WWS) values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Inclusive Team Culture: It’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth: We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance: We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve. We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA | New York City, NY, USA | San Francisco, CA, USA | Seattle, WA, USA
US, CA, San Francisco
If you are interested in this position, please apply on Twitch's Career site https://www.twitch.tv/jobs/en/ About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate and grow their personal interests and passions. We're always live at Twitch. About the Position We are looking for applied scientists to solve challenging and open-ended problems in the domain of user and content safety. As an applied scientist on Twitch's Community team, you will use machine learning to develop data products tackling problems such as harassment, spam, and illegal content. You will use a wide toolbox of ML tools to handle multiple types of data, including user behavior, metadata, and user generated content such as text and video. You will collaborate with a team of passionate scientists and engineers to develop these models and put them into production, where they can help Twitch's creators and viewers succeed and build communities. You will report to an Applied Science Manager. This position will be located in San Francisco. You Will - Build machine learning products to protect Twitch and its users from abusive behavior such as harassment, spam, and violent or illegal content. - Work backwards from customer problems to develop the right solution for the job, whether a classical ML model or a state-of-the-art one. - Collaborate with Community Health's engineering and product management team to productionize your models into flexible data pipelines and ML-based services. - Continue to learn and experiment with new techniques in ML, software engineering, or safety so that we can better help communities on Twitch grow and stay safe. Perks - Medical, Dental, Vision & Disability Insurance - 401(k) - Maternity & Parental Leave - Flexible PTO - Amazon Employee Discount We are open to hiring candidates to work out of one of the following locations: San Francisco, CA, USA
US, CA, San Diego
The Private Brands team is looking for an Applied Scientist to join the team in building science solutions at scale. Our team applies Optimization, Machine Learning, Statistics, Causal Inference, and Econometrics/Economics to derive actionable insights. We are an interdisciplinary team of Scientists, Engineers, and Economists and primary focus on building optimization and machine learning solutions in supply chain domain with specific focus on Amazon private brand products. Key job responsibilities You will work with business leaders, scientists, and economists to translate business and functional requirements into concrete deliverables, including the design, development, testing, and deployment of highly scalable optimization solutions and ML models. This is a unique, high visibility opportunity for someone who wants to have business impact, dive deep into large-scale problems, enable measurable actions on the consumer economy, and work closely with scientists and economists. As a scientist, you bring business and industry context to science and technology decisions. You set the standard for scientific excellence and make decisions that affect the way we build and integrate algorithms. Your solutions are exemplary in terms of algorithm design, clarity, model structure, efficiency, and extensibility. You tackle intrinsically hard problems, acquiring expertise as needed. You decompose complex problems into straightforward solutions. We are particularly interested in candidates with experience in predictive and machine learning models and working with distributed systems. Academic and/or practical background in Machine Learning are particularly relevant for this position. Familiarity and experience in applying Operations Research techniques to supply chain problems is a plus. We are open to hiring candidates to work out of one of the following locations: San Diego, CA, USA | Seattle, WA, USA
US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a highly-skilled Applied Scientist, to support the development and implementation of cutting-edge algorithms and push the boundaries of efficient inference for Generative Artificial Intelligence (GenAI) models. As an Applied Scientist, you will play a critical role in driving the development of GenAI technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities - Design and execute experiments to evaluate the performance of different decoding algorithms and models, and iterate quickly to improve results - Develop deep learning models for compression, system optimization, and inference - Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems in GenAI We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA | Boston, MA, USA | New York, NY, USA | Sunnyvale, CA, USA
US, CA, Pasadena
The Amazon Web Services (AWS) Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. We are looking to hire a Research Scientist with fabrication and data analysis experience working on Josephson Junction elements of a superconducting circuit. The position is on-site at our lab, located on the in Pasadena, CA. The ideal candidate will have had prior experience deep diving into fabrication details and electrical test data. We are looking for candidates with strong engineering principles, resourcefulness and data science experience. Organization and communication skills are essential. Key job responsibilities * Deep dive into the physics and related data associated with Josephson Junctions or metal-insulator-metal fabrication processes. * Develop and maintain data pipeline pertinent to superconducting device fabrication, in particular Josephson Junctions or general transmon elements. * Develop analytical tools to uncover new information about established and new junction processes. * Generate both custom and standardized reports summarizing inline and end of line electrical and process data from product material runs. * Devise experiments and provide recommendations for improvement of fabrication processes. * Communicate findings with colleagues by way of crisp documentation and presentations. A day in the life The role will be vital to the fabrication team and quantum computing device integration mechanism. The candidate will provide the most current information to project leads and fabrication area owners to drive data driven decision of production runs. Once the fabrication run starts the candidate will stay close to the details of fabrication providing data analysis and quick feedback to key stakeholders. At the end of fabrication runs custom and standardized reports will be generated by the candidate to provide insights into data generated from the run. This position may require occasional weekend work. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Hybrid Work We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices. About the team Our team is comprised of scientists and engineers who are building hardware that enables quantum computing technologies. Doing that requires the fabrication of quantum devices, which necessitates staying close to the details and analyzing data while building tools to better understand the data. We are open to hiring candidates to work out of one of the following locations: Pasadena, CA, USA
US, CA, Sunnyvale
The Amazon Devices team designs and engineers high-profile consumer electronics, including the best-selling Kindle family of products. We have also produced groundbreaking devices like Fire tablets, Fire TV, Amazon Dash, and Amazon Echo. What will you help us create? Work hard. Have fun. Make history. If you are an innovative Applied Scientist, have a track record of delivering to timelines with high quality and are deeply technical, we want to talk to you. You will be closely integrated with the research and development team, both developing and optimizing features. You will work with other world-leading scientists to build and deliver the world's most scalable robotics systems, working together from ideation-to-production using tools such as Computer Vision Deep Learning instance segmentation, pose estimation, activity understanding), CV geometry, active learning and reinforcement learning. A successful candidate will have excellent technical ability, scientific vision, project management skills, great communication skills, and a motivation to achieve results in a collaborative team environment. We are open to hiring candidates to work out of one of the following locations: Sunnyvale, CA, USA
GB, London
Amazon Advertising is looking for a Senior Applied Scientist to join its brand new initiative that powers Amazon’s contextual advertising product. Advertising at Amazon is a fast-growing multi-billion dollar business that spans across desktop, mobile and connected devices; encompasses ads on Amazon and a vast network of hundreds of thousands of third party publishers; and extends across US, EU and an increasing number of international geographies. We are looking for a dynamic, innovative and accomplished Senior Applied Scientist to work on machine learning and data science initiatives for contextual data processing and classification that power our contextual advertising solutions. Are you excited by the prospect of analyzing terabytes of data and leveraging state-of-the-art data science and machine learning techniques to solve real world problems? Do you like to own business problems/metrics of high ambiguity where yo get to define the path forward for success of a new initiative? As an applied scientist, you will invent ML and Artificial General Intelligence based solutions to power our contextual classification technology. As this is a new initiative, you will get an opportunity to act as a thought leader, work backwards from the customer needs, dive deep into data to understand the issues, conceptualize and build algorithms and collaborate with multiple cross-functional teams. Key job responsibilities * Design, prototype and test many possible hypotheses in a high-ambiguity environment, making use of both analysis and business judgment. * Collaborate with software engineering teams to integrate successful experiments into large-scale, highly complex Amazon production systems. * Promote the culture of experimentation and applied science at Amazon. * Demonstrated ability to meet deadlines while managing multiple projects. * Excellent communication and presentation skills working with multiple peer groups and different levels of management * Influence and continuously improve a sustainable team culture that exemplifies Amazon’s leadership principles. About the team The Supply Quality organization has the charter to solve optimization problems for ad-programs in Amazon and ensure high-quality ad-impressions. We develop advanced algorithms and infrastructure systems to optimize performance for our advertisers and publishers. We are focused on solving a wide variety of problems in computational advertising like Contextual data processing and classification, traffic quality prediction (robot and fraud detection), Security forensics and research, Viewability prediction, Brand Safety and experimentation. Our team includes experts in the areas of distributed computing, machine learning, statistics, optimization, text mining, information theory and big data systems. We are open to hiring candidates to work out of one of the following locations: London, GBR
ES, M, Madrid
At Amazon, we are committed to being the Earth’s most customer-centric company. The International Technology group (InTech) owns the enhancement and delivery of Amazon’s cutting-edge engineering to all the varied customers and cultures of the world. We do this through a combination of partnerships with other Amazon technical teams and our own innovative new projects. You will be joining the Tools and Machine learning (Tamale) team. As part of InTech, Tamale strives to solve complex catalog quality problems using challenging machine learning and data analysis solutions. You will be exposed to cutting edge big data and machine learning technologies, along to all Amazon catalog technology stack, and you'll be part of a key effort to improve our customers experience by tackling and preventing defects in items in Amazon's catalog. We are looking for a passionate, talented, and inventive Scientist with a strong machine learning background to help build industry-leading machine learning solutions. We strongly value your hard work and obsession to solve complex problems on behalf of Amazon customers. Key job responsibilities We look for applied scientists who possess a wide variety of skills. As the successful applicant for this role, you will with work closely with your business partners to identify opportunities for innovation. You will apply machine learning solutions to automate manual processes, to scale existing systems and to improve catalog data quality, to name just a few. You will work with business leaders, scientists, and product managers to translate business and functional requirements into concrete deliverables, including the design, development, testing, and deployment of highly scalable distributed services. You will be part of team of 5 scientists and 13 engineers working on solving data quality issues at scale. You will be able to influence the scientific roadmap of the team, setting the standards for scientific excellence. You will be working with state-of-the-art models, including image to text, LLMs and GenAI. Your work will improve the experience of millions of daily customers using Amazon in Europe and in other regions. You will have the chance to have great customer impact and continue growing in one of the most innovative companies in the world. You will learn a huge amount - and have a lot of fun - in the process! This position will be based in Madrid, Spain We are open to hiring candidates to work out of one of the following locations: Madrid, M, ESP