European Open Source AI Index
DatabaseNewsGuidesAboutContribute

Open Assistant

by Open Assistant

Community-built model aiming to bridge the gap between open-source models and ChatGPT.
Text
Limited
https://open-assistant.io/
Pythia-12B
Pythia-12B-SFT-v8-7K-Steps
Apache-2.0
Early intiative to bolster the open-source community.
https://open-assistant.io/
February 2023
Availability
Base Model Data
Training data published on HuggingFace for Pythia-based models.
https://huggingface.co/datasets/EleutherAI/the_pile_deduplicated
End User Model Data
OpenAssistant Conversations is 'a human-generated, human-annotated assistant-style conversation corpus consisting of 161443 messages distributed across 66497 conversation trees, in 35 different languages, annotated with 461292 quality ratings' (preprint).
https://huggingface.co/datasets/OpenAssistant/oasst1
Base Model Weights
Model weights available via HuggingFace.
https://huggingface.co/EleutherAI/pythia-12b-deduped
End User Model Weights
Model weights available via HuggingFace.
https://huggingface.co/OpenAssistant/pythia-12b-sft-v8-7k-steps
Training Code
GitHub containing training code available.
https://github.com/LAION-AI/Open-Assistant
Documentation
Code Documentation
Separate website provides entry point to comprehensive documentation.
https://projects.laion.ai/Open-Assistant/docs/intro
Hardware Architecture
Hardware architecture is listed on WandB pages. Some pages are no longer accessible.
https://wandb.ai/open-assistant/supervised-finetuning/runs/pcw1ejda
Preprint
Preprint describes creation of OpenAssistant Conversations corpus for instruction tuning, but not the base LLM, hence partial.
https://arxiv.org/abs//2304.07327
Paper
Preprint was published in NeurIPS.
https://proceedings.neurips.cc/paper_files/paper/2023/hash/949f0f8f32267d297c2d4e3ee10a2e7e-Abstract-Datasets_and_Benchmarks.html
Modelcard
Various model cards exist
https://huggingface.co/OpenAssistant
Datasheet
Most data sets are linked and some contain a data sheet.
https://docs.google.com/spreadsheets/d/1NYYa6vHiRnk5kwnyYaCT0cBO62--Tm3w4ihdBtp4ISk/edit?pli=1&gid=1537161081#gid=1537161081
Access
Licenses
Apache 2.0
https://projects.laion.ai/Open-Assistant/docs/faq#what-license-does-open-assistant-use
Is this information not up to date?
Contribute here ->
Supported by the Centre for Language Studies and the Dutch Research Council. Website design & development © 2024 by BSTN. This version of the index generated 09 April 2026, website content last updated 11 March 2026.