A* Beam Search

Clara Meister; Ryan Cotterell; Tim Vieira

A* Beam Search

Clara Meister, Ryan Cotterell, Tim Vieira

Abstract Connected Papers Add to Favorites

Language Generation Tacl Paper

Zoom-5B: Nov 17, Zoom-5B: Nov 17 (08:00-09:00 UTC) [Join Zoom Meeting]

You can open the pre-recorded video in a separate window.

Abstract: Decoding for many NLP tasks requires an effective heuristic algorithm for approximating exact search since the problem of searching the full output space is often intractable, or impractical in many settings. The default algorithm for this job is beam search--a pruned version of breadth-first search. Quite surprisingly, beam search often returns better results than exact inference due to beneficial search bias for NLP tasks. In this work, we show that the standard implementation of beam search can be made up to 10x faster in practice. Our method assumes that the scoring function is monotonic in the sequence length, which allows us to safely prune hypotheses that cannot be in the final set of hypotheses early on. We devise effective monotonic approximations to popular nonmonontic scoring functions, including length normalization and mutual information decoding. Lastly, we propose a memory-reduced variant of Best-First Beam Search, which has a similar beneficial search bias in terms of downstream performance, but runs in a fraction of the time.

NOTE: Video may display a random order of authors. Correct author list is at the top of this page.

Connected Papers in EMNLP2020

A* Beam Search

Clara Meister, Ryan Cotterell, Tim Vieira

Connected Papers in EMNLP2020

Similar Papers

A Streaming Approach For Efficient Batched Beam Search

Kevin Yang, Violet Yao, John DeNero, Dan Klein,

Training for Gibbs Sampling on Conditional Random Fields with Neural Scoring Factors

Sida Gao, Matthew R. Gormley,

Consistency of a Recurrent Language Model With Respect to Incomplete Decoding

Sean Welleck, Ilia Kulikov, Jaedeok Kim, Richard Yuanzhe Pang, Kyunghyun Cho,

Gradient-guided Unsupervised Lexically Constrained Text Generation

Lei Sha,