European Open Source AI Index
DatabaseNewsGuidesAboutContribute

BERT

by Google AI

This entry collates base BERT and its many derivatives
Text
Limited
https://research.google/blog/open-sourcing-bert-state-of-the-art-pre-training-for-natural-language-processing/
(undefined)
BERT
Apache-2.0
Major technology company, operator of Google Search.
https://ai.google
November 2018
Availability
Base Model Data
English Wikipedia, Bookcorpus (no longer available). See also: https://en.wikipedia.org/wiki/BookCorpus
https://github.com/google-research/bert
End User Model Data
English Wikipedia, Bookcorpus (no longer available). See also: https://en.wikipedia.org/wiki/BookCorpus
https://github.com/google-research/bert
Base Model Weights
https://github.com/google-research/bert
End User Model Weights
https://github.com/google-research/bert
Training Code
https://github.com/google-research/bert
Documentation
Code Documentation
https://github.com/google-research/bert
Hardware Architecture
https://github.com/google-research/bert
Preprint
https://arxiv.org/abs/1810.04805
Paper
https://aclanthology.org/N19-1423.pdf
Modelcard
https://github.com/google-research/bert
Datasheet
https://github.com/google-research/bert
Access
Package
API and Meta Prompts
Licenses
weights under apache 2.0, training data unclear
Is this information not up to date?
Contribute here ->
Supported by the Centre for Language Studies and the Dutch Research Council. Website design & development © 2024 by BSTN. This version of the index generated 10 March 2026, website content last updated 11 March 2026.