BERT

tags
Transformers, NLP
paper
(Devlin et al. 2019)

Parameter count

  • Base = 110M
  • Large = 340M

Bibliography

  1. . . May 24, 2019DOI.
Last changed | authored by

Comments

Loading comments...

Leave a comment

Back to Notes