eryk-mazus
/

polka-1.1b-chat

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

eryk-mazus commited on Feb 7

Commit

4074583

•

1 Parent(s): a0d9543

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -13,16 +13,15 @@ pipeline_tag: text-generation
 # polka-1.1B-dpo
-`eryk-mazus/polka-1.1b-dpo` is the first Polish model trained to act as a helpful, conversational assistant that can be run locally.
-This model is based on [TinyLlama-1.1B](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) with an extended tokenizer for more efficient Polish text generation, pretrained on an additional 6 billion Polish tokens. It was then fine-tuned using synthetically created and machine-translated multi-turn conversations with the [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290) performed on top of it.
 Context size: 4,096 tokens
-In addition, we've releasing:
 * [polka-1.1b](https://huggingface.co/eryk-mazus/polka-1.1b) - our base model with an extended tokenizer and additional pre-training on Polish corpus sampled using [DSIR](https://github.com/p-lambda/dsir)
 * [polka-pretrain-en-pl-v1](https://huggingface.co/datasets/eryk-mazus/polka-pretrain-en-pl-v1) - the pre-training dataset
-* [polka-1.1b-sft](https://huggingface.co/eryk-mazus/polka-1.1b-sft) - SFT version of the base model trained on polish conversations
 * [polka-dpo-v1](https://huggingface.co/datasets/eryk-mazus/polka-dpo-v1) - dataset of DPO pairs
 ## Usage

 # polka-1.1B-dpo
+`eryk-mazus/polka-1.1b-dpo` is **the first Polish model trained to act as a helpful, conversational assistant that can be run locally.**
+This model is based on [TinyLlama-1.1B](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) with an extended tokenizer for more efficient Polish text generation that pretrained on an additional 6 billion Polish tokens. It was then fine-tuned using synthetically generated and machine-translated multi-turn conversations with the [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290) performed on top of it.
 Context size: 4,096 tokens
+In addition, we're releasing:
 * [polka-1.1b](https://huggingface.co/eryk-mazus/polka-1.1b) - our base model with an extended tokenizer and additional pre-training on Polish corpus sampled using [DSIR](https://github.com/p-lambda/dsir)
 * [polka-pretrain-en-pl-v1](https://huggingface.co/datasets/eryk-mazus/polka-pretrain-en-pl-v1) - the pre-training dataset
 * [polka-dpo-v1](https://huggingface.co/datasets/eryk-mazus/polka-dpo-v1) - dataset of DPO pairs
 ## Usage