Human Trafficking
The International Labor Organization estimates that today, more than 24.9 million people around the world are victims of human trafficking.
Credit: kgtoh

How Marinus Analytics uses knowledge graphs powered by Amazon Neptune to combat human trafficking

Traffic Jam leverages machine learning technologies from Amazon Web Services to find patterns in ads posted by sexual traffickers on the internet every day.

The International Labor Organization estimates that today, more than 24.9 million people around the world are victims of human trafficking. Nearly 20% of these victims are sexually exploited.

According to the U.S. State Department 2019 Trafficking in Persons Report, 7,481 traffickers were convicted worldwide in 2018. These numbers may appear low, but they represent an increase of 68% from 2014.

Organizations like Marinus Analytics that leverage the power of machine learning to analyze patterns in the advertisements offering sexual services on the internet are helping increase the number of convictions by providing actionable insights to law enforcement organizations.

Emily Kennedy started working on the idea that would eventually become Marinus Analytics when she was an undergraduate student at Pittsburgh’s Carnegie Mellon University (CMU). Kennedy decided to fight the scourge of human trafficking after a trip to Eastern Europe as a teenager, where she came across orphans believed to be controlled by the Russian mafia begging on the streets.

Marinus Analytics Leaders
Emily Kennedy (l), and Cara Jones are the co-founders of Marinus Analytics. The company focuses on how AI can turn big data online into actionable intelligence.
Credit: Marinus Analytics

Kennedy wanted to leverage the power of big data to help rescue victims of human trafficking. She pitched her idea to researchers at CMU’s machine learning- focused Auton Lab, who were intrigued by Kennedy’s vision. At the Auton Lab, Kennedy connected with researcher and engineer Cara Jones to make the then nascent Traffic Jam product operational.

Traffic Jam leverages machine learning technologies from Amazon Web Services to find patterns in the 300,000 plus ads, many of which are posted by sexual traffickers on the internet every day. Viswanathan’s team at AWS conducted a deep dive exploration of Traffic Jam’s data to arrive at the optimal for storage of crawled ad networks’ data in Amazon Neptune. The team also developed a knowledge graph to capture the information found in online classifieds websites, uncover underlying patterns, surface insights to investigators, and bring criminals to justice.

Today, law enforcement officials use Traffic Jam to find victims of human trafficking and dismantle organized crime rings. In 2019 alone, Traffic Jam was used to identify and rescue an estimated 3,800 victims of sex trafficking.

Small needles in especially large haystacks

Prem Viswanathan is a data scientist with AWS Professional Services, and also an adjunct professor at CMU. At CMU he had met Emily Kennedy, during one of her guest lectures, when she was working on Traffic Jam. Today, at AWS Professional Services, Viswanathan is helping organizations like Marinus Analytics harness the power of machine learning to meet their objectives.

“Identifying an ad posted by an organized crime network is challenging,” Viswanathan says. “First, most of the ads posted on the Internet don’t have structured data. To analyze information effectively, it is necessary to sift through the text of every ad to pull out relevant information like the location, date of posting, images, social media handles and other pertinent information.”

To complicate matters even more, there are millions of ads offering sexual services posted on the internet every day. A majority of these ads are placed by people who are offering these services on their own accord. Traffic Jam is focused on finding victims of human trafficking who are forced into the trade against their will.

Traffic Jam uses knowledge graphs to accomplish this objective. Knowledge graphs comprise entities or nodes. Nodes are distinct entities that hold a piece of information. For example, in Traffic Jam, each ad is represented as a distinct node, as are other criteria such as the ad location, phone number, and the month in which the ad was posted.

Traffic Jam know
Traffic Jam utilizes knowledge graphs to help find human traffickers. The knowledge graph for human trafficking contains more than 1 billion edges connecting ads, phone numbers, images, and other entities.
Credit: Marinus Analytics

Knowledge graphs also store the relationships among these different nodes. They do this in the form of edges. With the rapidly growing number of ads added to the internet every day, the knowledge graph utilized by Traffic Jam contains more than a billion edges connecting ads, phone numbers, images and other entities.

“Traffic Jam sifts through the information contained in these large number of nodes to uncover suspicious patterns,” says Viswanathan. “Consider an example of two ads that have different images, and posted from different locations, but share the same phone number. If you combine text indicators of potential human trafficking to these signals, you arrive at a movement pattern that analysts might identify as problematic, and surface to law enforcement for further review.”

AWS also developed a custom user interface using ReactJS and D3. The user interface enables investigators to visualize the patterns. The knowledge graph-based setup also enables investigators to query up to four times more information than previously feasible, while performing their analysis. This allows them to find prior ads more easily, where a member of a human trafficking network might have used a real phone number or revealed other identifying information.

Deep Graph Learning – an area ripe for innovation

George Karypis is a professor within the Department of Computer Science & Engineering at the University of Minnesota. In the course of his career, Karypis has focused on a variety of areas related to big data including data mining, recommender systems, and high-performance computing. Karypis joined Amazon in 2019 as an Amazon Scholar—a select group of academic professionals that work on large-scale technical challenges while continuing to teach and conduct research at their universities. "The opportunity to help organizations like Marinus Analytics to harness the power of big data, and have a real-world impact is deeply meaningful to me," Karypis said.

George Karypis
Amazon Scholar George Karypis is a professor at the University of Minnesota.

At Amazon, Karypis’ team is focused on unlocking innovations that drive efficient and scalable deep learning on knowledge graphs. The team has been responsible for developing the Deep Graph Library (DGL), an easy-to-use, high performance and scalable Python package for deep learning on graphs. DGL is a framework that allows developers to program a class of machine learning models called graph neural networks (GNN). DGL supplements existing tensor-based frameworks such as Tensorflow, PyTorch, and MXNet to support the growing area of deep graph learning.

The adoption of GNNs has exploded in recent years, as data scientists move beyond developing deep learning models for 2D signals (such as images) and 3D signals (such as video) to learning from structured, related data embedded in graphs.

Today, GNNs are used in a number of fields. For example, they play an increasingly important role in social networks, where graphs show connections among related people. At Amazon, they are used to develop recommender systems, build mechanisms for fraud and abuse detection and develop Alexa chatbots among other applications.

Organizations like Marinus Analytics use GNNs to contrast information between different nodes, and surface interesting insights, such as whether a particular ad has characteristics common with ads posted by organized crime rings.

For Karypis, GNNs represent one of the most exciting areas in the world of machine learning. More specifically, he believes there are three areas in the world of deep graph learning that are particularly ripe for innovation.

“At the most basic level, there are multiple experiments that are trying to determine the best way to express machine learning models in deep graph learning,” says Karypis. “What are the right models? What are the most appropriate abstractions?”

The integration with Amazon Neptune has been a game changer for Traffic Jam
Cara Jones, CEO, Marinus Analytics

The second challenge pertains to the training of these models. GNN training requires irregular memory accesses. In addition, the training involves fewer operations for each word of memory that it accesses and is computationally demanding. Moreover, knowledge graphs such as the one used by Traffic Jam have billions of data points. “In order to realize the benefits afforded by GNNs, it is critical to develop efficient and scalable distributed GNN training approaches for large graphs,” says Karypis.

Finally, Karypis and his team are intrigued by the most effective ways to compute knowledge graph embeddings. This involves embedding both the entities of a graph and underlying relations in a vector form in a d-dimensional space. For Traffic Jam, representing nodes and their relations in a vector form is what enables the comparison of different ad networks, each of which is represented as a sub-graph.

“Language modelling is a very well understood problem, as are various facets related to computer vision,” he says. “However, it’s still early days when it comes to GNNs, and I’m excited to be at AWS where a lot of the innovation is happening.”

Traffic Jam’s new offerings that use Amazon Neptune and advanced ML techniques to track different ad networks and analyze their likelihood of belonging to an existing crime group is currently in beta. The new features are expected to be made generally available to users soon.

“The integration with Amazon Neptune has been a game changer for Traffic Jam,” says Cara Jones, CEO and co-founder of Marinus Analytics. “Using the knowledge graph and associated sub-graphs, we are now able to capture four times as much information as previously possible. More importantly, we are able to analyze data and identify potential crime groups in real-time, even as new information comes in.”

Research areas

Related content

US, WA, Seattle
The Amazon Economics Team is hiring Economist Interns. We are looking for detail-oriented, organized, and responsible individuals who are eager to learn how to work with large and complicated data sets to solve real-world business problems. Some knowledge of econometrics, as well as basic familiarity with Stata, R, or Python is necessary. Experience with SQL, UNIX, Sawtooth, and Spark would be a plus. These are full-time positions at 40 hours per week, with compensation being awarded on an hourly basis. You will learn how to build data sets and perform applied econometric analysis at Internet speed collaborating with economists, data scientists and MBAʼs. These skills will translate well into writing applied chapters in your dissertation and provide you with work experience that may help you with future job market placement. Roughly 85% of interns from previous cohorts have converted to full-time economics employment at Amazon. If you are interested, please send your CV to our mailing list at econ-internship@amazon.com. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
GB, Cambridge
Our team undertakes research together with multiple organizations to advance the state-of-the-art in speech technologies. We not only work on giving Alexa, the ground-breaking service that powers Echo, her voice, but we also develop cutting-edge technologies with Amazon Studios, the provider of original content for Prime Video. Do you want to be part of the team developing the latest technology that impacts the customer experience of ground-breaking products? Then come join us and make history. We are looking for a passionate, talented, and inventive Senior Applied Scientist with a background in Machine Learning to help build industry-leading Speech, Language and Video technology. As a Senior Applied Scientist at Amazon you will work with talented peers to develop novel algorithms and modelling techniques to drive the state of the art in speech and vocal arts synthesis. Position Responsibilities: - Participate in the design, development, evaluation, deployment and updating of data-driven models for digital vocal arts applications. - Participate in research activities including the application and evaluation and digital vocal and video arts techniques for novel applications. - Research and implement novel ML and statistical approaches to add value to the business. - Mentor junior engineers and scientists. We are open to hiring candidates to work out of one of the following locations: Cambridge, GBR
US, VA, Arlington
The People eXperience and Technology Central Science Team (PXTCS) uses economics, behavioral science, statistics, and machine learning to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, wellbeing, and the value of work to Amazonians. We are an interdisciplinary team that combines the talents of science and engineering to develop and deliver solutions that measurably achieve this goal. We are looking for economists who are able to apply economic methods to address business problems. The ideal candidate will work with engineers and computer scientists to estimate models and algorithms on large scale data, design pilots and measure their impact, and transform successful prototypes into improved policies and programs at scale. We are looking for creative thinkers who can combine a strong technical economic toolbox with a desire to learn from other disciplines, and who know how to execute and deliver on big ideas as part of an interdisciplinary technical team. Ideal candidates will work in a team setting with individuals from diverse disciplines and backgrounds. They will work with teammates to develop scientific models and conduct the data analysis, modeling, and experimentation that is necessary for estimating and validating models. They will work closely with engineering teams to develop scalable data resources to support rapid insights, and take successful models and findings into production as new products and services. They will be customer-centric and will communicate scientific approaches and findings to business leaders, listening to and incorporate their feedback, and delivering successful scientific solutions. Key job responsibilities Use reduced-form causal analysis and/or structural economic modeling methods to evaluate the impact of policies on employee outcomes, and examine how external labor market and economic conditions impact Amazon's ability to hire and retain talent. A day in the life Work with teammates to apply economic methods to business problems. This might include identifying the appropriate research questions, writing code to implement a DID analysis or estimate a structural model, or writing and presenting a document with findings to business leaders. Our economists also collaborate with partner teams throughout the process, from understanding their challenges, to developing a research agenda that will address those challenges, to help them implement solutions. About the team We are a multidisciplinary team that combines the talents of science and engineering to develop innovative solutions to make Amazon Earth's Best Employer. We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA
US, WA, Seattle
We are expanding our Global Risk Management & Claims team and insurance program support for Amazon’s growing risk portfolio. This role will partner with our risk managers to develop pricing models, determine rate adequacy, build underwriting and claims dashboards, estimate reserves, and provide other analytical support for financially prudent decision making. As a member of the Global Risk Management team, this role will provide actuarial support for Amazon’s worldwide operation. Key job responsibilities ● Collaborate with risk management and claims team to identify insurance gaps, propose solutions, and measure impacts insurance brings to the business ● Develop pricing mechanisms for new and existing insurance programs utilizing actuarial skills and training in innovative ways ● Build actuarial forecasts and analyses for businesses under rapid growth, including trend studies, loss distribution analysis, ILF development, and industry benchmarks ● Design actual vs expected and other metrics dashboards to assist decision makings in pricing analysis ● Create processes to monitor loss cost and trends ● Propose and implement loss prevention initiatives with impact on insurance pricing in mind ● Advise underwriting decisions with analysis on driver risk profile ● Support insurance cost budgeting activities ● Collaborate with external vendors and other internal analytics teams to extract insurance insight ● Conduct other ad hoc pricing analyses and risk modeling as needed We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA | New York, NY, USA | Seattle, WA, USA
US, NY, New York
The Amazon SCOT Forecasting team seeks a Senior Applied Scientist to join our team. Our research team conducts research into the theory and application of reinforcement learning. This research is shared in top journals and conferences and has a significant impact on the field. Through our launch of several Deep RL models into production, our work also affects decision making in the real world. Members of our group have varied interests—from the mathematical foundations of reinforcement learning, to language modeling, to maintaining the performance of generative models in the face of copyrights, and more. Recent work has focused on sample efficiency of RL algorithms, treatment effect estimation, and RL agents integrating real-world constraints, as applied in supply chains. Previous publications include: - Linear Reinforcement Learning with Ball Structure Action Space - Meta-Analysis of Randomized Experiments with Applications to Heavy-Tailed Response Data - A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation - Deep Inventory Management - What are the Statistical Limits of Offline RL with Linear Function Approximation? Working collaboratively with a group of fellow scientists and engineers, you will identify complex problems and develop solutions in the RL space. We encourage collaboration across teammates and their areas of specialty, leading to creative and ambitious projects with the goal of publication and production. Key job responsibilities - Drive collaborative research and creative problem solving - Constructively critique peer research; mentor junior scientists - Create experiments and prototype implementations of new algorithms and techniques - Collaborate with engineering teams to design and implement software built on these new algorithms - Contribute to progress of the Amazon and broader research communities by producing publications We are open to hiring candidates to work out of one of the following locations: New York, NY, USA
US, CA, Virtual Location - California
If you are interested in this position, please apply on Twitch's Career site https://www.twitch.tv/jobs/en/ About Us: Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate and grow their personal interests and passions. We're always live at Twitch. About the Role: As a Data Scientist, Analytics member of the Data Platform - Insights team, you'll provide data analysis and support for platform, service, and operational engineering teams at Twitch, shaping the way success is measured. Defining what questions should be asked and scaling analytics methods and tools to support our growing business. Additionally, you will help support the vision for business analytics, solutions architecture for data related business constructs, as well as tactical execution such as experiment analysis and campaign performance reporting. You are paving the way for high-quality, high-velocity decisions and will report to the Manager, Data Science. For this role, we're looking for an experienced data staff who will oversee data instrumentation, dashboard/report building, metrics reviews, inform team investments, guidance on success/failure metrics and ad-hoc analysis. You will also work with technical and non-technical staff members throughout the company, and your effort will have an impact on hundreds of partners at Twitch You Will: - Work with members of Platforms & Services to guide them towards better decision making from the available data. - Promote data knowledge and insights through managing communications with partners and other teams, collaborate with colleagues to complete data projects and ensure all parties can use the insights to further improve. - Maintain a customer-centric focus while being a domain and product expert through data, develop trust amongst peers, and ensure that the teams and programs have access to data to make decisions - Manage ambiguous problems and adapt tools to answer complicated questions. - Identify the trade-offs between speed and quality of different approaches. - Create analytical frameworks to measure team success by partnering with teams to establish success metrics, create approaches to track the data and troubleshoot errors, measure and evaluate the data to develop a common language for all colleagues to understand these metrics. - Operationalize data processes to provide partners with ad-hoc analysis, automated dashboards, and self-service reporting tools so that everyone gets a good sense of the state of the business Perks: - Medical, Dental, Vision & Disability Insurance - 401(k), Maternity & Parental Leave - Flexible PTO - Commuter Benefits - Amazon Employee Discount - Monthly Contribution & Discounts for Wellness Related Activities & Programs (e.g., gym memberships, off-site massages), -Breakfast, Lunch & Dinner Served Daily - Free Snacks & Beverages We are open to hiring candidates to work out of one of the following locations: Irvine, CA, USA | Seattle, WA, USA | Virtual Location - CA
US, WA, Bellevue
Have you ever ordered a product on Amazon and when that box with the smile arrived you wondered how it got to you so fast? Have you wondered where it came from and how much it cost Amazon to deliver it to you? Have you also wondered what are different ways that the transportation assets can be used to delight the customer even more. If so, the Amazon transportation Services, Product and Science is for you . We manage the delivery of tens of millions of products every week to Amazon’s customers, achieving on-time delivery in a cost-effective manner. We are looking for an enthusiastic, customer obsessed Applied Scientist with strong scientific thinking, good software and statistics experience, skills to help manage projects and operations, improve metrics, and develop scalable processes and tools. The primary role of an Applied Scientist within Amazon is to address business challenges through building a compelling case, and using data to influence change across the organization. This individual will be given responsibility on their first day to own those business challenges and the autonomy to think strategically and make data driven decisions. Decisions and tools made in this role will have significant impact to the customer experience, as it will have a major impact on how we operate the middle mile network. Ideal candidates will be a high potential, strategic and analytic graduate with a PhD in (Operations Research, Statistics, Engineering, and Supply Chain) ready for challenging opportunities in the core of our world class operations space. Great candidates have a history of operations research, machine learning , and the ability to use data and research to make changes. This role requires robust skills in research and implementation of scalable products and models . This individual will need to be able to work with a team, but also be comfortable making decisions independently, in what is often times an ambiguous environment. Responsibilities may include: - Develop input and assumptions based preexisting models to estimate the costs and savings opportunities associated with varying levels of network growth and operations - Creating metrics to measure business performance, identify root causes and trends, and prescribe action plans - Managing multiple projects simultaneously - Working with technology teams and product managers to develop new tools and systems to support the growth of the business - Communicating with and supporting various internal stakeholders and external audiences We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA
US, CA, Los Angeles
The Alexa team is looking for a passionate, talented, and inventive Applied Scientist with a strong machine learning background, to help build industry-leading Speech and Language technology. Key job responsibilities As an Applied Scientist with the Alexa team, you will work with talented peers to develop novel algorithms and modeling techniques to advance the state of the art in spoken language understanding. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in spoken language understanding. About the team The Alexa team has a mission to push the envelope in Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), and Audio Signal Processing, in order to provide the best-possible experience for our customers. We are open to hiring candidates to work out of one of the following locations: Los Angeles, CA, USA
US, WA, Seattle
Are you fascinated by the power of Natural Language Processing (NLP) and Large Language Models (LLM) to transform the way we interact with technology? Are you passionate about applying advanced machine learning techniques to solve complex challenges in the e-commerce space? If so, Amazon's International Seller Services team has an exciting opportunity for you as an Applied Scientist. At Amazon, we strive to be Earth's most customer-centric company, where customers can find and discover anything they want to buy online. Our International Seller Services team plays a pivotal role in expanding the reach of our marketplace to sellers worldwide, ensuring customers have access to a vast selection of products. As an Applied Scientist, you will join a talented and collaborative team that is dedicated to driving innovation and delivering exceptional experiences for our customers and sellers. You will be part of a global team that is focused on acquiring new merchants from around the world to sell on Amazon’s global marketplaces around the world. The position is based in Seattle but will interact with global leaders and teams in Europe, Japan, China, Australia, and other regions. Join us at the Central Science Team of Amazon's International Seller Services and become part of a global team that is redefining the future of e-commerce. With access to vast amounts of data, cutting-edge technology, and a diverse community of talented individuals, you will have the opportunity to make a meaningful impact on the way sellers engage with our platform and customers worldwide. Together, we will drive innovation, solve complex problems, and shape the future of e-commerce. Please visit https://www.amazon.science for more information Key job responsibilities - Apply your expertise in LLM models to design, develop, and implement scalable machine learning solutions that address complex language-related challenges in the international seller services domain. - Collaborate with cross-functional teams, including software engineers, data scientists, and product managers, to define project requirements, establish success metrics, and deliver high-quality solutions. - Conduct thorough data analysis to gain insights, identify patterns, and drive actionable recommendations that enhance seller performance and customer experiences across various international marketplaces. - Continuously explore and evaluate state-of-the-art NLP techniques and methodologies to improve the accuracy and efficiency of language-related systems. - Communicate complex technical concepts effectively to both technical and non-technical stakeholders, providing clear explanations and guidance on proposed solutions and their potential impact. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
US, CA, Palo Alto
We’re working to improve shopping on Amazon using the conversational capabilities of large language models. We are open to hiring candidates to work out of one of the following locations: Palo Alto, CA, USA