Arab Press

بالشعب و للشعب
Saturday, Feb 22, 2025

Google’s SummAE AI generates abstract summaries of paragraphs

Google’s SummAE AI generates abstract summaries of paragraphs

Google researchers propose a novel AI summarization model - SummAE- capable of generating abstract summaries of paragraphs.
Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.


Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.

Recommended videosPowered by AnyClip
Go Eat A McRib
Play

Unmute
Duration
0:59
/
Current Time
0:17

Fullscreen
Up Next

NOW PLAYINGGo Eat A McRib
Scientists Discover What Makes 'Water Bears' Virtually Indestructible
Doctor diagnoses his own cancer with an app
There's A Bigger Danger To Pedestrians Than Walking While Distracted
Prince Harry to edit National Geographic's Instagram
The Secret Culprit Of America's Student Debt Crisis
5 Quotes About The Power of Books

The data set and code are freely available on GitHub, along with the configuration settings for the best model.

“As one of the very first works approaching single-document [abstract summarization], we propose a novel neural model — SummAE,” wrote the coauthors. “[We believe it] is therefore desirable to have models capable of automatically summarizing documents abstractively with little to no supervision.”

SummAE contains a denoising autoencoder that encodes (that is, generates numerical representations of) sentences and paragraphs of the target text in a shared space. Guided by a decoder whose input is prepended with a token signaling whether to decode a sentence or a paragraph, the system generates summaries by decoding each sentence from the encoded paragraphs.

The researchers discovered that most traditional approaches to training the auto-encoder resulted in long, multi-sentence summaries. To encourage it to learn higher-level concepts disentangled from their original expression, the team employed two denoising approaches — randomly masking tokens and permuting the order of sentences within paragraphs — that increased the number of training examples substantially. They also experimented with an adversarial critic component that could distinguish between sentences and paragraphs, in addition to two pretraining tasks that encouraged the encoder to learn how sentences narratively followed within a paragraph.

The researchers trained three different variations of SummAE on the ROCStories, a corpus of self-contained, diverse, non-technical, and concise prose. They split the original 98,159 training stories into three separate collections — a training set, a validation set, and a test set — and collected three human summaries each for 500 validation examples and 500 test examples.

After 100,000 training steps with pretraining, the team reports that the best model significantly outperformed a baseline extractive sentence generator on the Recall-Oriented Understudy for Gisting Evaluation (ROUGE), a set of metrics devised to evaluate automatic summarization. Moreover, they say that in a qualitative study involving evaluators recruited through Amazon’s Mechanical Turk, volunteers rated one of the three SummAE models’ summaries “fluent” and “information-relevant” 80% of the time.

“The paragraph reconstructions show some coherence, although with some disfluencies and factual inaccuracies that are common with neural generative models,” wrote the coauthors. “Since the summaries are decoded from the same latent vector as the reconstructions, improving them could lead to more accurate summaries.”
Newsletter

Related Articles

Arab Press
0:00
0:00
Close
The negotiation teams of Trump and Putin meet directly, establishing the groundwork for a significant advance.
Israeli Minister Urges Hamas to Surrender and Depart from Gaza.
Iran Considers Moving Its Capital Due to Urban Difficulties
Israel and Hamas Finalize Sixth Exchange of Hostages and Prisoners During Continuing Gaza Ceasefire
Leaders of BRICS to Gather in Rio de Janeiro for July Summit
Muhsin Hendricks, a trailblazing openly gay imam, was killed in South Africa.
Trump's special envoy for hostage affairs cautions Hamas against challenging Trump before Saturday's deadline.
Two British citizens apprehended in Iran amid escalating tensions.
Israel Issues Threat of Military Action as Hostage Negotiations with Hamas Continue
Hamas Coordinates Worldwide Solidarity Marches in Reaction to U.S. and Israeli Initiative
Israel Warns of Ending Gaza Ceasefire Due to Hostage Situation
King Abdullah II Dismisses US Proposal to Relocate Palestinians, Commits to Welcoming Gaza Children.
Lebanon Installs New Government with Hezbollah's Impact on Key Ministries
Report: Iran Attempted to Assassinate Trump During Election Campaign
U.S. Authorizes $7.4 Billion Arms Sale to Israel
Iran's Supreme Leader Rejects Nuclear Negotiations with the U.S.
UN Chief Denounces Trump's Gaza Plan, Cautions Against Ethnic Cleansing
Pressure Intensifies for a Free Trade Agreement between the UK and GCC in Light of Economic Difficulties
Israel to Withdraw from UN Human Rights Council Due to Accusations of Anti-Semitism
EU Reaffirms Gaza's Essential Role in Future Palestinian State Following Trump's Proposal
Iranian Currency Reaches All-Time Low Amid US 'Maximum Pressure' Initiative.
UN Reaffirms Ban on Deportation from Occupied Territories Amid US Gaza Proposal
Palestinians Fear Repeat of 'Nakba' Amid Ongoing Crisis in Gaza
UAE Aids in the Exchange of 300 Prisoners Between Russia and Ukraine
Egypt Seeks Global Backing for Two-State Solution Following US Proposal for Gaza Plan
Trump's Suggestion to 'Seize Control' of Gaza Represents a Significant Shift in US Policy
French President is the first EU leader to extend congratulations to the new Syrian President.
Tunisian President Appoints New Finance Minister Amid Economic Crisis
Trump Suggests U.S. 'Takeover' of Gaza, Prompting Global Worries
Trump's Proposal for Gaza Provokes Global Debate
President Trump Suggests Moving Gaza's Palestinian Population
Aga Khan IV, Spiritual Leader and Philanthropist, Dies at 88
Erdogan and Syria's Sharaa Talk About Collaboration to Counter Kurdish Militants
Trump Suggests U.S. Control of Gaza Strip Amid Ongoing Conflict
Trump Resumes 'Maximum Pressure' Strategy to Limit Iran's Oil Exports.
Ex-British Soldier Sentenced for Espionage on Behalf of Iran and Fleeing from Prison
Gazans in Egypt Reject Displacement, Struggle with Return to War-Torn Home
Queen Rania Urges Protection of Children’s Rights at Vatican Summit
Hamas Officials Ready to Begin Negotiations for Phase Two of Gaza Truce
Trump Expresses Caution Over Gaza Ceasefire as Netanyahu Visits Washington
Oman to Host 18th Indian Ocean Conference on Maritime Security and Trade
Emir of Kuwait Meets BlackRock CEO for Talks on Investment Opportunities
Queen Rania of Jordan Calls for Global Action on Children’s Rights at Vatican Summit
Egyptian President El-Sisi Invited for White House Meeting Following Jordanian King’s Visit
Queen Rania Calls for Protection of Children’s Rights at Vatican Summit
Israeli Military Operations Continue on Lebanon Border Amid Ceasefire Tensions
Israeli Hostage's Release Highlights Uncertainty Over Family's Fate
Israeli Military Operations Escalate in Southern Lebanon Amid Hezbollah Tensions
Zayed Award for Human Fraternity Announces 2025 Honorees
Kuwait Anticipates a 12% Increase in Budget Deficit for the 2025-2026 Fiscal Year
×