Posted in Web RoBERTa: A Robustly Optimized BERT Pretraining Approach May 12, 2026 We find that BERT was significantly undertrained and propose an im-proved recipe for training BERT models, which we call RoBERTa, that can match or exceed the performance of all of the post-BERT methods. https://arxiv.org/pdf/1907.11692