BERTweet: A pre-trained language model for English Tweets

Dat Quoc Nguyen; Thanh Vu; Anh Tuan Nguyen

BERTweet: A pre-trained language model for English Tweets

Dat Quoc Nguyen, Thanh Vu, Anh Tuan Nguyen

Abstract Paper

BERTweet

Add to Favorites

Demo Paper

Gather-2M: Nov 17, Gather-2M: Nov 17 (10:00-12:00 UTC) [Join Gather Meeting]

Abstract: We present BERTweet, the first public large-scale pre-trained language model for English Tweets. Our BERTweet, having the same architecture as BERT-base (Devlin et al., 2019), is trained using the RoBERTa pre-training procedure (Liu et al., 2019). Experiments show that BERTweet outperforms strong baselines RoBERTa-base and XLM-R-base (Conneau et al., 2020), producing better performance results than the previous state-of-the-art models on three Tweet NLP tasks: Part-of-speech tagging, Named-entity recognition and text classification. We release BERTweet under the MIT License to facilitate future research and applications on Tweet data. Our BERTweet is available at https://github.com/VinAIResearch/BERTweet

BERTweet: A pre-trained language model for English Tweets

Dat Quoc Nguyen, Thanh Vu, Anh Tuan Nguyen

Similar Papers

On Losses for Modern Language Models

Stéphane Aroca-Ouellette, Frank Rudzicz,

Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT

Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser,

On the importance of pre-training data volume for compact language models

Vincent Micheli, Martin d'Hoffschmidt, François Fleuret,

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation