Theodore Vaslioudis, a former intern and full-time Amazon scientist
Theodore Vaslioudis, a former intern and full-time Amazon scientist since February 2020, uses his experiences to help customers gain the greatest value from AWS resources, and his colleagues make the most of working remotely.

From intern to applied scientist: How Theodore Vasiloudis made the transition

The applied scientist offers advice on how he utilized his internship to land a full-time job — and talks about how he and his colleagues won an award along the way.

In the early days of purchase data analysis, a study determined that people often bought diapers and beer together. When Theodore Vasiloudis, then a computer science undergrad, heard that from a professor at the Aristotle University of Thessaloniki, he was intrigued by the correlation: “I found it fascinating that, by aggregating the data of multiple users, you could extract weird and unexpected things like this.”

That course inspired Vasiloudis, today an applied scientist with Amazon Web Services (AWS), to direct his education toward machine learning. He left Greece in 2012 to study at the KTH Royal Institute of Technology in Stockholm, Sweden, which at the time had one of the few master’s programs in Europe dedicated to machine learning. After finishing his thesis on context-aware recommendations, he pursued an industrial PhD while employed at the Swedish Institute of Computer Science (industrial PhD students develop their research projects while working at a company to gain industrial experience).

In the final years of his PhD, Vasiloudis completed two summer internships at Amazon. One of those resulted in the publication of an award-winning research paper, Block-distributed Gradient Boosted Trees. In that paper, Vasiloudis and his colleagues Hyunsu Cho and Henrik Boström described the development of a new algorithm that was able to drastically reduce the communication cost to train massive, sparse datasets.

A full-time Amazon scientist since February 2020, Vasiloudis now uses his experiences to help customers make the best of AWS resources and his colleagues make the most of working remotely. He has even introduced to his team the custom of fika, the Swedish habit of pausing for a cup of coffee in the middle of the day. Each Friday, he and his teammates congregate over a remote coffee break at 3 p.m., which has helped sustain the team’s spirit during the pandemic. We asked Vasiloudis about his internship, what it was like to make the transition to full-time employee, and more.

Q. What made you interested in working at Amazon, and how was your experience as an intern?

With Amazon, you have the opportunity to reach hundreds of millions of people with your work. You can make changes that affect the everyday lives of such a large population. Also, because of the number of Amazon users, you are forced to design algorithms that can actually analyze massive amounts of data. So that's a very interesting challenge for me, to be able to create scalable algorithms that work regardless of the size of the data set.

For my first internship, I worked with Alexa Shopping and we looked into ways to generate realistic data sets to improve the customer’s experience. The second internship was with AWS, where my manager was Vineet Khare, then an applied science manager. There, I worked on how to get gradient-boosted trees to work with massive data sets that contain millions and billions of records, but also millions and billions of features. From that work, in close collaboration with my mentor Hyunsu Cho, we wrote the paper that won the best short paper award at SIGIR 2019.

These were both good experiences, because I got to work on interesting problems. And most importantly, I got to work with great colleagues. We had multiple interns within the team, and that meant that you could share the experience of being a science intern with other PhD students, and support each other through the internship. My full-time colleagues were also very helpful and fun to hang out with outside work as well. So I had a good time, and that's the main reason why I chose to return to Amazon for the full-time role.

One of the things that I definitely learned during my internships was the importance of writing high-quality code.  A common problem when you're writing research code is that you kind of go along without ensuring that everything works in a formal way. Whereas when writing code for a company, you need to prove and ensure that your code will always work regardless of the circumstances. And this is one of the Amazon leadership principles: That we have to insist on the highest standards.

Theodore Vasiloudis poses with the publication that won the best short paper award at SIGIR 2019.
Theodore Vasiloudis poses with the publication that won the best short paper award at SIGIR 2019.

Q. What set apart the paper that won at SIGIR 2019?

Gradient-boosted trees are designed to deal with very large data sets and are one of the most popular machine learning algorithms, widely used in both academia and industry. However, whenever we deal with very large data sets, often we have to use multiple computers.

Imagine you're trying to classify, for example, text. Let's say that this text is somebody’s loan application. If every possible word in this text is a feature, that means there can be millions of features because the vocabulary is practically limitless. So, when you try to share the model training among multiple computers — which can be a hundred, a thousand, or even more — you will very often run into problems because they are all competing for a tiny amount of bandwidth compared to the data set.

Previous systems were not efficient at communicating because they were wasting a lot of bandwidth with redundant information. Many real-world data sets are very sparse. In sparse data sets, most of the features are actually zeroes. Previous systems were still sending those over the network, and they were consuming a lot of unnecessary bandwidth. Whereas if you only send the non-zeroes over the network, then you're actually saving communication costs and bandwidth. That’s the main idea.

Q. How did you go about trying to find a solution for those sparse data sets?

We had two issues to solve: One regarding prediction and another regarding training. You can imagine a data set as a matrix. It has a bunch of rows, which are the records — for example, the loan application documents. And then each of those will have a number of features, which are the words in the document. So, you can have millions of documents, and millions of features as well. In previous systems, they would only partition the data set along the record dimension. They would take a few documents and put them in one computer, a few in another and then do the training and sync.

But if you want to really speed up the process, you can actually take part of a document and store it in one computer and another part in another computer. This is called block distribution. Instead of taking multiple rows from the same matrix, and storing them in the same computer, now we start taking a block — a few rows and a few columns — from that matrix and put it in one computer. That means that we have some additional communication to do to make predictions.

We used an existing algorithm for that called Quickscorer, which was designed for a completely different purpose, to speed up the prediction process locally. But that exact same approach can allow you to perform a very quick distributed prediction, and we modified that algorithm to adapt our use case. So that's how we solved that prediction issue. And then for the training, we did something similar, where we would only send for a given block the number of records that are necessary with a number of features, and then we would use an aggregation step in order to complete the training.

I think this work provides a good direction for future production systems. The communication pattern for very large data sets should be more flexible than the one that is currently used.

Q. What are you currently working on?

I'm working on SageMaker JumpStart. We create AWS solutions that allow customers to get started with SageMaker faster, and take their ideas to production more quickly and painlessly.  One part of my team’s responsibilities is to work directly with customers when they have a specific problem. But we also do a lot of innovating on the behalf of our customers.

Q. You started your full-time role right at the beginning of the pandemic. Did that affect your work in any way?

We stopped going to the office and started using a digital form of communication. In trying to keep the team spirit alive, one of the things that I try to have in our team is something that we used to have in Sweden, which is called the fika. It’s like a coffee break where you stop working for half an hour and chat with your colleagues about anything you want. It’s just some social time where everybody can relax and interact with colleagues.

If you have the opportunity to work at a company like Amazon, you should definitely take it, because you can gain a lot of experience that is impossible to gain during your PhD.
Theodore Vasiloudis

I saw that, with COVID-19, the interaction with colleagues goes down significantly, so it's good to have some time allocated in your calendar when you don’t have to work, just have some coffee and chat. An informal conversation is when a lot of important ideas come up, and it’s good to have that opportunity.

Q. What advice would you give to people considering following your footsteps?

If you have the opportunity to work at a company like Amazon, you should definitely take it, because you can gain a lot of experience that is impossible to gain during your PhD. The way that the industry works is very different from the way academia works. If you have done a couple of these internships, you're much more prepared to join the workforce.

For interns at Amazon hoping to migrate into a full-time job, I would say that the regular check-ins with your hiring manager are very important, because you need to be constantly aware whether you're on track for your full-time offer. Every second week you get to sit with your hiring manager, and you can check with them if you should be doing something more, if you're hitting your targets in terms of the progress of the work itself, and in terms of representing the leadership principles of Amazon in your work. And that gives you a better sense of accomplishment. You need to make sure that you set a few milestones in the meantime and make sure that you hit them as you progress through your internship.

Q. Any final tips on how to make the best out of your internship at Amazon?

How to become an intern at Amazon

If you’re a student with interest in an Amazon internship, you can find additional information here, and submit your details for review. Students can also learn more about internship opportunities at Amazon Student Programs.

Amazon values being independent and self-driven. And it's very good if you have a goal to publish a paper by the end of your internship and chase that publication. For example, we completed the writing of our paper after I had finished my internship, so if I hadn't pushed for that, I wouldn't have published this paper, and my co-authors and I wouldn't have gotten this award.

It's important to be motivated to work with your manager to make sure that you get all the necessary approvals before you finish your internship toward publishing the paper, because it's an important step for a career as a scientist, as well as for a PhD student, to publish high-quality papers. And it's a unique opportunity to do that when you have access to the infrastructure and data sets of Amazon.

Research areas

Related content

US, CA, San Francisco
Are you interested in a unique opportunity to advance the accuracy and efficiency of Artificial General Intelligence (AGI) systems? If so, you're at the right place! We are the AGI Autonomy organization, and we are looking for a driven and talented Member of Technical Staff to join us to build state-of-the art agents. As an MTS on our team, you will design, build, and maintain a Spark-based infrastructure to process and manage large datasets critical for machine learning research. You’ll work closely with our researchers to develop data workflows and tools that streamline the preparation and analysis of massive multimodal datasets, ensuring efficiency and scalability. We operate at Amazon's large scale with the energy of a nimble start-up. If you have a learner's mindset, enjoy solving challenging problems and value an inclusive and collaborative team culture, you will thrive in this role, and we hope to hear from you. Key job responsibilities * Develop and maintain reliable infrastructure to enable large-scale data extraction and transformation. * Work closely with researchers to create tooling for emerging data-related needs. * Manage project prioritization, deliverables, timelines, and stakeholder communication. * Illuminate trade-offs, educate the team on best practices, and influence technical strategy. * Operate in a dynamic environment to deliver high quality software.
IN, KA, Bangalore
Have you ever ordered a product on Amazon and when that box with the smile arrived you wondered how it got to you so fast? Have you wondered where it came from and how much it cost Amazon to deliver it to you? If so, the WW Amazon Logistics, Business Analytics team is for you. We manage the delivery of tens of millions of products every week to Amazon’s customers, achieving on-time delivery in a cost-effective manner. We are looking for an enthusiastic, customer obsessed, Applied Scientist with good analytical skills to help manage projects and operations, implement scheduling solutions, improve metrics, and develop scalable processes and tools. The primary role of an Operations Research Scientist within Amazon is to address business challenges through building a compelling case, and using data to influence change across the organization. This individual will be given responsibility on their first day to own those business challenges and the autonomy to think strategically and make data driven decisions. Decisions and tools made in this role will have significant impact to the customer experience, as it will have a major impact on how the final phase of delivery is done at Amazon. Candidates will be a high potential, strategic and analytic graduate with a PhD in (Operations Research, Statistics, Engineering, and Supply Chain) ready for challenging opportunities in the core of our world class operations space. Great candidates have a history of operations research, and the ability to use data and research to make changes. This role requires robust program management skills and research science skills in order to act on research outcomes. This individual will need to be able to work with a team, but also be comfortable making decisions independently, in what is often times an ambiguous environment. Responsibilities may include: - Develop input and assumptions based preexisting models to estimate the costs and savings opportunities associated with varying levels of network growth and operations - Creating metrics to measure business performance, identify root causes and trends, and prescribe action plans - Managing multiple projects simultaneously - Working with technology teams and product managers to develop new tools and systems to support the growth of the business - Communicating with and supporting various internal stakeholders and external audiences
US, NY, New York
Amazon is investing heavily in building a world class advertising business and we are responsible for defining and delivering a collection of self-service performance advertising products that drive discovery and sales. Our products are strategically important to our Retail and Marketplace businesses driving long term growth. We deliver billions of ad impressions and millions of clicks daily and are breaking fresh ground to create world-class products. We are highly motivated, collaborative and fun-loving with an entrepreneurial spirit and bias for action. With a broad mandate to experiment and innovate, we are growing at an unprecedented rate with a seemingly endless range of new opportunities. The Ad Response Prediction team in the Sponsored Products organization builds GenAI-based shopper understanding and audience targeting systems, along with advanced deep-learning models for Click-through Rate (CTR) and Conversion Rate (CVR) predictions. We develop large-scale machine-learning (ML) pipelines and real-time serving infrastructure to match shoppers' intent with relevant ads across all devices, contexts, and marketplaces. Through precise estimation of shoppers' interactions with ads and their long-term value, we aim to drive optimal ad allocation and pricing, helping to deliver a relevant, engaging, and delightful advertising experience to Amazon shoppers. As our business grows and we undertake increasingly complex initiatives, we are looking for entrepreneurial, and self-driven science leaders to join our team. Key job responsibilities As a Principal Applied Scientist in the team, you will: * Seek to understand in depth the Sponsored Products offering at Amazon and identify areas of opportunities to grow our business via principled ML solutions. * Mentor and guide the applied scientists in our organization and hold us to a high standard of technical rigor and excellence in ML. * Design and lead organization wide ML roadmaps to help our Amazon shoppers have a delightful shopping experience while creating long term value for our sellers. * Work with our engineering partners and draw upon your experience to meet latency and other system constraints. * Identify untapped, high-risk technical and scientific directions, and simulate new research directions that you will drive to completion and deliver. * Be responsible for communicating our ML innovations to the broader internal & external scientific community.
CA, BC, Vancouver
Do you want a role with deep meaning and the ability to make a major impact? As part of Intelligent Talent Acquisition (ITA), you'll have the opportunity to reinvent the hiring process and deliver unprecedented scale, sophistication, and accuracy for Amazon Talent Acquisition operations. ITA is an industry-leading people science and technology organization made up of scientists, engineers, analysts, product professionals and more, all with the shared goal of connecting the right people to the right jobs in a way that is fair and precise. Last year we delivered over 6 million online candidate assessments, and helped Amazon deliver billions of packages around the world by making it possible to hire hundreds of thousands of workers in the right quantity, at the right location and at exactly the right time. You’ll work on state-of-the-art research, advanced software tools, new AI systems, and machine learning algorithms, leveraging Amazon's in-house tech stack to bring innovative solutions to life. Join ITA in using technologies to transform the hiring landscape and make a meaningful difference in people's lives. Together, we can solve the world's toughest hiring problems. Global Hiring Science owns and develops products and services using Artificial Intelligence and Machine Learning (ML) that enhance recruitment. We collaborate with scientists to build and maintain machine learning solutions for hiring, offering opportunities to both apply and develop ML engineering skills in a production environment. Key job responsibilities • Design and implement advanced AI models using the latest LLM and GenAI technologies to develop fair and accurate machine learning models for hiring. • Clearly and cogently present your work and ideas, and respond effectively to feedback. • Collaborate with cross-functional teams with Research Scientists and Software Engineers to integrate AI-driven products into Amazon’s hiring process. • Stay at the advance of AI research, continuously exploring and implementing new techniques in NLP, LLMs, and GenAI to drive innovation in hiring. • Implement advanced natural language processing models to extract insights from diverse data sources. • Ensure effective teamwork, communication, collaboration, and commitment across multiple teams with competing priorities. • Contribute to the scientific community through publications, presentations, and collaborations with academic institutions. About the team The mission of Global Hiring Science (GHS) is to improve both the efficiency and effectiveness of hiring across Amazon with assessments and interview improvements. We are a team of experts in machine learning, industrial-organizational psychology, data science, and measuring the knowledge, skills, and abilities that it takes to be successful at Amazon.
US, CA, San Francisco
Amazon has launched a new research lab in San Francisco to develop foundational capabilities for useful AI agents. We’re enabling practical AI to make our customers more productive, empowered, and fulfilled. In particular, our work combines large language models (LLMs) with reinforcement learning (RL) to solve reasoning, planning, and world modeling in both virtual and physical environments. Our research builds on that of Amazon’s broader AGI organization, which recently introduced Amazon Nova, a new generation of state-of-the-art foundation models (FMs). Our lab is a small, talent-dense team with the resources and scale of Amazon. Each team in the lab has the autonomy to move fast and the long-term commitment to pursue high-risk, high-payoff research. We’re entering an exciting new era where agents can redefine what AI makes possible. We’d love for you to join our lab and build it from the ground up! Key job responsibilities You will contribute directly to AI agent development in an applied research role, including model training, dataset design, and pre- and post-training optimization. You will be hired as a Member of Technical Staff.
US, WA, Seattle
PXTCS is looking for an economist who can apply economic methods to address business problems. The ideal candidate will work with engineers and computer scientists to estimate models and algorithms on large scale data, design pilots and measure impact, and transform successful prototypes into improved policies and programs at scale. PXTCS is looking for creative thinkers who can combine a strong technical economic toolbox with a desire to learn from other disciplines, and who know how to execute and deliver on big ideas as part of an interdisciplinary technical team. Ideal candidates will work in a team setting with individuals from diverse disciplines and backgrounds. They will work with teammates to develop scientific models and conduct the data analysis, modeling, and experimentation that is necessary for estimating and validating models. They will work closely with engineering teams to develop scalable data resources to support rapid insights, and take successful models and findings into production as new products and services. They will be customer-centric and will communicate scientific approaches and findings to business leaders, listening to and incorporate their feedback, and delivering successful scientific solutions. A day in the life The Economist will work with teammates to apply economic methods to business problems. This might include identifying the appropriate research questions, writing code to implement a DID analysis or estimate a structural model, or writing and presenting a document with findings to business leaders. Our economists also collaborate with partner teams throughout the process, from understanding their challenges, to developing a research agenda that will address those challenges, to help them implement solutions. About the team The People eXperience and Technology Central Science (PXTCS) team uses economics, behavioral science, statistics, and machine learning to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, wellbeing, and the value of work to Amazonians. PXTCS is an interdisciplinary team that combines the talents of science and engineering to develop and deliver solutions that measurably achieve this goal.
US, CA, San Francisco
The Amazon General Intelligence “AGI” organization is looking for an Executive Assistant to support leaders of our Autonomy Team in our growing AI Lab space located in San Francisco. This role is ideal for exceptionally talented, dependable, customer-obsessed, and self-motivated individuals eager to work in a fast paced, exciting and growing team. This role serves as a strategic business partner, managing complex executive operations across the AGI organization. The position requires superior attention to detail, ability to meet tight deadlines, excellent organizational skills, and juggling multiple critical requests while proactively anticipating needs and driving improvements. High integrity, discretion with confidential information, and professionalism are essential. The successful candidate will complete complex tasks and projects quickly with minimal guidance, react with appropriate urgency, and take effective action while navigating ambiguity. Flexibility to change direction at a moment's notice is critical for success in this role. Key job responsibilities - Serve as strategic partner to senior leadership, identifying opportunities to improve organizational effectiveness and drive operational excellence - Manage complex calendars and scheduling for multiple executives - Drive continuous improvement through process optimization and new mechanisms - Coordinate team activities including staff meetings, offsites, and events - Schedule and manage cost-effective travel - Attend key meetings, track deliverables, and ensure timely follow-up - Create expense reports and manage budget tracking - Serve as liaison between executives and internal/external stakeholders - Build collaborative relationships with Executive Assistants across the company and with critical external partners - Help us build a great team culture in the SF Lab!
US, CA, San Francisco
Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers to push the boundaries of what's possible in robotic intelligence. As an Applied Scientist, you'll be at the forefront of developing breakthrough foundation models that enable robots to perceive, understand, and interact with the world in unprecedented ways. You'll drive independent research initiatives in areas such as perception, manipulation, science understanding, locomotion, manipulation, sim2real transfer, multi-modal foundation models and multi-task robot learning, designing novel frameworks that bridge the gap between state-of-the-art research and real-world deployment at Amazon scale. In this role, you'll balance innovative technical exploration with practical implementation, collaborating with platform teams to ensure your models and algorithms perform robustly in dynamic real-world environments. You'll have access to Amazon's vast computational resources, enabling you to tackle ambitious problems in areas like very large multi-modal robotic foundation models and efficient, promptable model architectures that can scale across diverse robotic applications. Key job responsibilities - Drive independent research initiatives across the robotics stack, including robotics foundation models, focusing on breakthrough approaches in perception, and manipulation, for example open-vocabulary panoptic scene understanding, scaling up multi-modal LLMs, sim2real/real2sim techniques, end-to-end vision-language-action models, efficient model inference, video tokenization - Design and implement novel deep learning architectures that push the boundaries of what robots can understand and accomplish - Lead full-stack robotics projects from conceptualization through deployment, taking a system-level approach that integrates hardware considerations with algorithmic development, ensuring robust performance in production environments - Collaborate with platform and hardware teams to ensure seamless integration across the entire robotics stack, optimizing and scaling models for real-world applications - Contribute to the team's technical strategy and help shape our approach to next-generation robotics challenges A day in the life - Design and implement novel foundation model architectures and innovative systems and algorithms, leveraging our extensive infrastructure to prototype and evaluate at scale - Collaborate with our world-class research team to solve complex technical challenges - Lead technical initiatives from conception to deployment, working closely with robotics engineers to integrate your solutions into production systems - Participate in technical discussions and brainstorming sessions with team leaders and fellow scientists - Leverage our massive compute cluster and extensive robotics infrastructure to rapidly prototype and validate new ideas - Transform theoretical insights into practical solutions that can handle the complexities of real-world robotics applications About the team At Frontier AI & Robotics, we're not just advancing robotics – we're reimagining it from the ground up. Our team is building the future of intelligent robotics through innovative foundation models and end-to-end learned systems. We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems to creating adaptive manipulation strategies that work in complex, real-world scenarios. What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich real-world datasets to train and deploy state-of-the-art foundation models. Our work spans the full spectrum of robotics intelligence – from multimodal perception using images, videos, and sensor data, to sophisticated manipulation strategies that can handle diverse real-world scenarios. We're building systems that don't just work in the lab, but scale to meet the demands of Amazon's global operations. Join us if you're excited about pushing the boundaries of what's possible in robotics, working with world-class researchers, and seeing your innovations deployed at unprecedented scale.
US, CA, San Francisco
Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers to push the boundaries of what's possible in robotic intelligence. As a Senior Applied Scientist, you'll spearhead the development of breakthrough foundation models and full-stack robotics systems that enable robots to perceive, understand, and interact with the world in unprecedented ways. You'll drive technical excellence in areas such as perception, manipulation, science understanding, locomotion, manipulation, sim2real transfer, multi-modal foundation models and multi-task robot learning, designing novel frameworks that bridge the gap between state-of-the-art research and real-world deployment at Amazon scale. In this role, you'll combine hands-on technical work with scientific leadership, ensuring your team delivers robust solutions for dynamic real-world environments. You'll leverage Amazon's vast computational resources to tackle ambitious problems in areas like very large multi-modal robotic foundation models and efficient, promptable model architectures that can scale across diverse robotic applications. Key job responsibilities - Lead technical initiatives across the robotics stack, driving breakthrough approaches through hands-on research and development in areas including robotics foundation models, focusing on breakthrough approaches in perception, and manipulation, for example open-vocabulary panoptic scene understanding, scaling up multi-modal LLMs, sim2real/real2sim techniques, end-to-end vision-language-action models, efficient model inference, video tokenization - Guide technical direction for full-stack robotics projects from conceptualization through deployment, taking a system-level approach that integrates hardware considerations with algorithmic development, ensuring robust performance in production environments - Mentor fellow scientists while maintaining strong individual technical contributions - Collaborate with platform and hardware teams to ensure seamless integration across the entire robotics stack - Influence technical decisions and implementation strategies within your area of focus A day in the life - Design and implement novel foundation model architectures and innovative systems and algorithms, leveraging our extensive infrastructure to prototype and evaluate at scale - Guide fellow scientists in solving complex technical challenges across the full robotics stack - Lead focused technical initiatives from conception through deployment, ensuring successful integration with production systems - Drive technical discussions within your team and with key stakeholders - Conduct experiments and prototype new ideas using our massive compute cluster and extensive robotics infrastructure - Mentor team members while maintaining significant hands-on contribution to technical solutions About the team At Frontier AI & Robotics, we're not just advancing robotics – we're reimagining it from the ground up. Our team is building the future of intelligent robotics through innovative foundation models and end-to-end learned systems. We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems to creating adaptive manipulation strategies that work in complex, real-world scenarios. What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich real-world datasets to train and deploy state-of-the-art foundation models. Our work spans the full spectrum of robotics intelligence – from multimodal perception using images, videos, and sensor data, to sophisticated manipulation strategies that can handle diverse real-world scenarios. We're building systems that don't just work in the lab, but scale to meet the demands of Amazon's global operations. Join us if you're excited about pushing the boundaries of what's possible in robotics, working with world-class researchers, and seeing your innovations deployed at unprecedented scale.
US, CA, San Francisco
Amazon AGI Autonomy develops foundational capabilities for useful AI agents. We are the research lab behind Amazon Nova Act, a state-of-the-art computer-use agent. Our work combines Large Language Models (LLMs) with Reinforcement Learning (RL) to solve reasoning, planning, and world modeling in the virtual world. We are a small, talent-dense team with the autonomy to move fast and the long-term commitment to pursue high-risk, high-payoff research. Come be a part of our journey! --- About the team We’re looking for a generalist software engineer to build and evolve our internal data platform. The team builds data-intensive services that ingest, process, store, and distribute multi-modal training data across multiple internal and external sources. This work emphasizes data integrity, reliability, and extensibility in support of large-scale training and experimentation workloads. The team also builds and maintains APIs and SDKs that enable product engineers and researchers to build on top of the platform. As research directions change, so does our data, and today the team is focused on hardening the platform to reliably deliver an evolving set of data schemas, sources, and modalities. By building strong foundations and durable abstractions, we aim to enable new kinds of tooling and workflows over time. The team will play a key role in shaping them as the research evolves. --- Key job responsibilities * Build and operate reliable, performant backend and data platform services that support continuous ingestion and use of multi-modal training data. * Identify and implement opportunities to accelerate data generation, validation, and usage across training and evaluation workflows from multiple internal and external sources. * Partner closely with Human Feedback, Data Generation, Product Engineering, and Research teams to evolve and scale the data platform, APIs, and SDKs. * Own projects end to end, from technical design and implementation through deployment, observability, and long-term maintainability. * Write clear technical documentation and communicate design decisions and tradeoffs to stakeholders across multiple teams. * Raise the team’s technical aptitude through thoughtful code reviews, knowledge sharing, and mentorship.