BERT tags Transformers, NLP paper (Devlin et al. 2019) Parameter count Base = 110M Large = 340M Bibliography Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova. May 24, 2019. May 24, 2019DOI. Links to this note ALBERT BART Big bird DistillBERT ERNIE Imagen Megatron RoBERTa Semantic similarity Vision transformer Last changed 2022.07.22 | authored by Hugo Cisneros
Loading comments...