We used distilabel
to create several pipelines for generating instruction-following and multi-turn datasets for the post-training of SmolLM2.
Note
This section is still in WIP. We will upload the rest of the pipelines soon. Thanks for your patience!