A hybrid approach to cross-lingual product review summarization

Saleh Soltan; Victor Soto; Ke Tran; Wael Hamza

Publication

A hybrid approach to cross-lingual product review summarization

By Saleh Soltan, Victor Soto, Ke Tran, Wael Hamza

2022

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

We present a hybrid approach for product review summarization which consists of: (i) an unsupervised extractive step to extract the most important sentences out of all the reviews, and (ii) a supervised abstractive step to summarize the extracted sentences into a coherent short summary. This approach allows us to develop an efficient cross-lingual abstractive summarizer that can generate summaries in any language, given the extracted sentences out of thousands of reviews in a source language. In order to train and test the abstractive model, we create the Cross-lingual Amazon Reviews Summarization (CARS) dataset which provides English summaries for training, and English, French, Italian, Arabic, and Hindi summaries for testing based on selected English reviews. We show that the summaries generated by our model are as good as human written summaries in coherence, informativeness, non-redundancy, and fluency.

A hybrid approach to cross-lingual product review summarization

Latest news

Work with us