European Open Source AI Index
DatabaseNewsGuidesAboutContribute

MPT

by Databricks

Open LLM by Databricks which has been taken down.
Text
Full
https://huggingface.co/mosaicml/mpt-30b-instruct
MPT-30B
MPT-30B-Instruct
Apache-2.0
Databricks, a data platform.
https://www.databricks.com
June 2023
Availability
Base Model Data
C4 is part of the dataset but a precise specification of source data is hard to find
https://huggingface.co/datasets/c4
End User Model Data
dolly-hhrlhf, combination of Databrick dolly-15k dataset and a filtered subset of Anthropic HH-RLHF
https://huggingface.co/datasets/mosaicml/dolly_hhrlhf
Base Model Weights
Weights removed from HuggingFace.
End User Model Weights
Weights removed from HuggingFace.
Training Code
Codebase part of LLM foundry
https://github.com/mosaicml/llm-foundry/tree/main/llmfoundry/models/mpt
Documentation
Code Documentation
LLM Foundry codebase is well-documented and in active development.
https://github.com/mosaicml/llm-foundry/
Hardware Architecture
Architecture reasonably well-documented
https://huggingface.co/mosaicml/mpt-30b-instruct
Preprint
Paper
Modelcard
Modelcard is somewhat lacking in detail
https://huggingface.co/mosaicml/mpt-30b-instruct
Datasheet
Datasheet not available; data somewhat documented in blog post at link
https://www.mosaicml.com/blog/mpt-30b
Access
Licenses
Apache 2.0
Is this information not up to date?
Contribute here ->
Supported by the Centre for Language Studies and the Dutch Research Council. Website design & development © 2024 by BSTN. This version of the index generated 09 April 2026, website content last updated 11 March 2026.