arxiv:2307.01403
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Organizations
Papers
1
models
136
mnoukhov/pythia2.8b-rm-tldr6.9b
Text Classification
•
Updated
•
146
mnoukhov/pythia2.8b-sft-tldr
Text Generation
•
Updated
•
346
mnoukhov/pythia160m-sft-tldr
Text Generation
•
Updated
•
4
mnoukhov/pythia160m-rm-tldr6.9b
Text Classification
•
Updated
•
6
mnoukhov/pythia1b-rm-tldr6.9b
Text Classification
•
Updated
•
37
mnoukhov/pythia1b-sft-tldr
Text Generation
•
Updated
•
137
mnoukhov/EleutherAI_pythia-1b-deduped__sft__tldr-rm-tldr6.9b
Text Classification
•
Updated
•
5
mnoukhov/EleutherAI_pythia-1b-deduped__sft__tldr
Text Generation
•
Updated
•
3
mnoukhov/pythia410m-rm-tldr6.9b
Text Classification
•
Updated
•
655
mnoukhov/pythia160m-rm-tldr
Text Classification
•
Updated
•
8
datasets
49
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b
Viewer
•
Updated
•
177k
•
2
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel2_llama8b
Viewer
•
Updated
•
92.1k
•
2
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_llama8b
Viewer
•
Updated
•
176k
•
2
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr_relabel_pythia1b
Viewer
•
Updated
•
107k
•
2
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr
Viewer
•
Updated
•
107k
•
2
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia1b
Viewer
•
Updated
•
177k
•
2
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144
Viewer
•
Updated
•
179k
•
2
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873_relabel_pythia1b
Viewer
•
Updated
•
20k
•
2
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873
Viewer
•
Updated
•
20k
•
2
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_dpo_costa_2.8b_bf16.yml_6e799_new
Viewer
•
Updated
•
20k
•
2