iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 1.92k • 231
Malikeh1375/medical-question-answering-datasets Viewer • Updated Nov 2, 2023 • 1.26M • 645 • 27
llm-wizard/dolly-15k-instruction-alpaca-format Viewer • Updated Apr 13, 2023 • 15k • 130 • 29
Telugu-LLM-Labs/marathi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 39 • 1
Telugu-LLM-Labs/nepali_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 49 • 5
generative-technologies/synth-ehr-icd10-alpaca-format Viewer • Updated Jun 24 • 379k • 150 • 1
Vanessasml/cybersecurity_32k_instruction_input_output Viewer • Updated Apr 19 • 32.6k • 101 • 11
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C25-L25-E25-R05 Viewer • Updated Nov 29, 2023 • 10.1M • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 109
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 119
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3 Viewer • Updated Mar 25 • 40k • 37
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3_16 Viewer • Updated Mar 26 • 20k • 35
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft Viewer • Updated May 20 • 6.37M • 54 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_xlarge__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1M • 39
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions_gen_eval_sft Viewer • Updated Mar 7 • 1.2k • 72
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 79
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 51
y1xing/natural_language_prompt_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 276 • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16 Viewer • Updated Mar 26 • 20k • 37
phyloforfun/HLT_MICH_Angiospermae_SLTPvC_v1-0_medium_OCR-C25-L25-E50-R05 Viewer • Updated Mar 15 • 10k • 35 • 1
somosnlp-hackathon-2023/ask2democracy-cfqa-salud-pension Viewer • Updated Apr 11, 2023 • 3.81k • 62 • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 46
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 94.6k • 40
y1xing/orpo_llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 23 • 568k • 118
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 65
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 568k • 224
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 51
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 66
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2 Viewer • Updated Mar 7 • 60k • 43
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2_random Viewer • Updated Mar 10 • 60k • 53
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-8_random Viewer • Updated Mar 10 • 60k • 43
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 320
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 22 • 568k • 181
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 21 • 568k • 200
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 22 • 568k • 260
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 74
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 511k • 153
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 189k • 53
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 103
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.5 Viewer • Updated Mar 27 • 568k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 125
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 49
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 81
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 118
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 26 • 94.6k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.5 Viewer • Updated Mar 26 • 568k • 85
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.25 Viewer • Updated Mar 26 • 568k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.7 Viewer • Updated Mar 27 • 568k • 86
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 61
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 96
vinhtran2611/ArtifactAI_arxiv-physics-instruct-tune-30k_formated Viewer • Updated Jun 7 • 30.2k • 36
vinhtran2611/arxiv-physics-instruct-tune-30k_filtered_formated Viewer • Updated Jun 17 • 324 • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 14 • 37.9k • 43
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 193
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 568k • 119
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 59
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.25 Viewer • Updated Mar 26 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.9 Viewer • Updated Mar 26 • 568k • 67
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.9 Viewer • Updated Mar 26 • 568k • 86
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 27 • 568k • 236
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.3 Viewer • Updated Mar 27 • 568k • 180
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.5 Viewer • Updated Mar 27 • 568k • 78
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 136
Telugu-LLM-Labs/sindhi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 41 • 2
Telugu-LLM-Labs/assamese_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 54 • 1
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_tiny__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 14 • 37.9k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 21 • 37.9k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 21 • 568k • 194
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 158
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 120
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 226
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 126
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 135
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 71
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 91
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 24 • 189k • 65
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 24 • 189k • 54
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 568k • 154
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 53
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 26 • 94.6k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 27 • 568k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.9 Viewer • Updated Mar 27 • 568k • 189
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.9 Viewer • Updated Mar 27 • 568k • 100
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 87
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Apr 26 • 303k • 98
gogo8232/experiment_perplexity_instruction_llama3_8b_response Viewer • Updated Jul 5 • 34.9k • 34
oliverwang15/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Jul 11, 2023 • 67.2k • 41 • 9
lucasmccabe-lmi/sql-create-context_alpaca_style Viewer • Updated May 15, 2023 • 78.6k • 46 • 5
japneets/Alpaca_instruction_fine_tune_Punjabi_small Viewer • Updated Apr 16, 2023 • 10k • 44 • 1
filopedraz/swedish-sentiment-instruction-fine-tuning Viewer • Updated Jun 13, 2023 • 164k • 38 • 1
anton96vice/samantha-1.1-uncensored-split-and-prepared Viewer • Updated Mar 7 • 2.04k • 45 • 1
Telugu-LLM-Labs/konkani_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 64 • 1
Hadnet/olavo-article-17k-llama2-chat-dataset-text Viewer • Updated Sep 25, 2023 • 17.4k • 37 • 1
UMCU/WikiDocPatientInformation_Dutch_translated_with_MariaNMT Viewer • Updated Jan 22 • 5.76k • 49
Cesar7980/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 8, 2023 • 76.8k • 41
rodrfons/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 18, 2023 • 76.8k • 36
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1.0_OCR-C25-L25-E50-R10 Viewer • Updated Nov 29, 2023 • 230 • 32
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10.1M • 136
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_tiny__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 87 • 33
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_large__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 100k • 35
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C25-L25-E50-R05 Viewer • Updated Nov 30, 2023 • 10k • 34
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_large__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100k • 34
mfmezger/sandboxai_german_to_english_translations_seperated Viewer • Updated Feb 15 • 1.35M • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_160m Viewer • Updated Mar 14 • 37.9k • 54
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.3_self_160m Viewer • Updated Mar 21 • 37.9k • 35
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_1.0_self_160m Viewer • Updated Mar 21 • 18.9k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 94
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 21 • 568k • 189
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 339
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 219
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 21 • 568k • 82
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 24 • 568k • 234
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 169
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 87
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 186
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 148
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 107
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 189k • 54
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 189k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 189k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 189k • 59
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1 Viewer • Updated Mar 25 • 40k • 35
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 70
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 125
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 568k • 140
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.75 Viewer • Updated Mar 26 • 568k • 125
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0_eval Viewer • Updated Mar 28 • 568k • 253
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0_eval Viewer • Updated Mar 29 • 568k • 91
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0_eval Viewer • Updated Mar 29 • 568k • 242
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 271
thusinh1969/llama-2-7b-LongContext-mixed-64k-30APRIL2024 Viewer • Updated May 1 • 81.8k • 50 • 1
HachiML/oasst1_for_self-rewarding_EFT_Mixtral-8x22B-Instruct Viewer • Updated May 29 • 5.24k • 39
murugeshmarvel/a5d87d8c1326b4f0c531065dbe7f5068a2bab8a56edc9a9d4aab95be427bb171 Viewer • Updated Jun 5 • 95k • 32
generative-technologies/synth-ehr-icd10-llama3-format Viewer • Updated Jun 23 • 379k • 98 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_small__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1.01k • 39
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_medium__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 10k • 38
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_full__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1.42M • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 14 • 37.9k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 14 • 37.9k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 179
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 79
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 120
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 136
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 68
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 568k • 243
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 51
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 65
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2 Viewer • Updated Mar 25 • 40k • 40
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Apr 19 • 568k • 152
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 160
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 153
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.9 Viewer • Updated Mar 26 • 568k • 91
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.5 Viewer • Updated Mar 26 • 568k • 150
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.5 Viewer • Updated Mar 26 • 568k • 186
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 27 • 568k • 54
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.1 Viewer • Updated Mar 27 • 568k • 134
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.3 Viewer • Updated Mar 27 • 568k • 130
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.1 Viewer • Updated Mar 27 • 568k • 146
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.7 Viewer • Updated Mar 27 • 568k • 198
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.9 Viewer • Updated Mar 27 • 568k • 106
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0_eval Viewer • Updated Mar 28 • 568k • 213
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 135
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 198
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 155
Mitsuki-Sakamoto/alpaca_farm-RM-Mistral-7B-re-preference-256-nsample-2 Viewer • Updated Apr 15 • 20k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Apr 26 • 303k • 123
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_5 Viewer • Updated Apr 26 • 303k • 103
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10k • 41
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_small__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 14 • 37.9k • 40
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 107
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 152
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 22 • 568k • 98
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 94
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 149
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 568k • 121
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 64
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Apr 19 • 568k • 202
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 158
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 82
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 112
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.75 Viewer • Updated Mar 26 • 568k • 109
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.3 Viewer • Updated Mar 27 • 568k • 165
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.5 Viewer • Updated Mar 27 • 568k • 137
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 216
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 121
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 391
y1xing/natural_language_prompt_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 85
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1_16 Viewer • Updated Mar 26 • 20k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.75 Viewer • Updated Mar 26 • 568k • 165
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.25 Viewer • Updated Mar 26 • 568k • 109
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.1 Viewer • Updated Mar 27 • 568k • 71
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.7 Viewer • Updated Mar 27 • 568k • 123
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0_eval Viewer • Updated Mar 28 • 568k • 114
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Apr 26 • 303k • 59
y1xing/llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 32
y1xing/llama_chris_examples_generated_synthetic_data_instruct_dataset Viewer • Updated Jul 13 • 1.85k • 32
y1xing/partially_correct_llama_all_synthetic_data_instruct_dataset Viewer • Updated Jul 14 • 1.53k • 33
y1xing/llama_all_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 33
Mitsuki-Sakamoto/alpaca_farm-alpaca_gpt4_preference-re-preference_eval Viewer • Updated Jan 15 • 197k • 31
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions-re-preference Viewer • Updated Jan 17 • 22k • 164
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-eval-preference Viewer • Updated Feb 5 • 2k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-are-preference-256 Viewer • Updated Mar 1 • 22k • 36
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test Viewer • Updated Apr 19 • 40 • 35
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-4 Updated Mar 6 • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-8 Viewer • Updated Mar 6 • 20k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-16 Viewer • Updated Mar 7 • 20k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-16_random Viewer • Updated Mar 10 • 60k • 156
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 15 • 37.9k • 153
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 18 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 18 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_160m Updated Mar 21 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_160m Updated Mar 18 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 15 • 37.9k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 18 • 189k • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_160m Viewer • Updated Mar 18 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 19 • 189k • 133
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 19 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 19 • 189k • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_160m Updated Mar 19 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_160m Updated Mar 19 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_160m Updated Mar 19 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.3_self_160m Updated Mar 21 • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_1.0_self_160m Updated Mar 21 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Apr 19 • 568k • 34
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0_eval Viewer • Updated Mar 29 • 568k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_4 Viewer • Updated Apr 25 • 40k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_5 Viewer • Updated Apr 25 • 40k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test-alpaca-gen Viewer • Updated May 12 • 20 • 34
Karmukilan/Malikeh1375_medical-question-answering-datasets Viewer • Updated Jul 16 • 1k • 42 • 2
y1xing/natural_language_prompt_w_correct_ans_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 276 • 35
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 435 • 35
y1xing/natural_language_prompt_w_correct_ans_dataset_json_evaluation_instruct_dataset Viewer • Updated Jul 29 • 276 • 42
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_json_instruct_dataset Viewer • Updated Jul 29 • 435 • 34
y1xing/natural_language_prompt_w_correct_ans_dataset_training_instruct_dataset Viewer • Updated Jul 30 • 2.99k • 35
UMCU/MedicalFlashCards_Dutch_translated_with_MariaNMT Viewer • Updated Oct 31, 2023 • 32.9k • 37
Mitsuki-Sakamoto/sft_alpaca_pythia-1.4b-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 33
Mitsuki-Sakamoto/sft_alpaca_pythia-160m-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 33
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned Viewer • Updated Aug 8 • 6.36M • 39
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned-Split Viewer • Updated Aug 8 • 6.36M • 40
louisbrulenaudet/code-pensions-civiles-militaires-retraite Viewer • Updated about 2 hours ago • 257 • 80
louisbrulenaudet/code-disciplinaire-penal-marine-marchande Viewer • Updated about 2 hours ago • 6 • 88
louisbrulenaudet/code-domaine-public-fluvial-navigation-interieure Viewer • Updated about 2 hours ago • 2 • 96
louisbrulenaudet/code-domaine-etat-collectivites-mayotte Viewer • Updated about 2 hours ago • 3 • 65
louisbrulenaudet/code-legion-honneur-medaille-militaire-ordre-national-merite Viewer • Updated about 2 hours ago • 224 • 73
louisbrulenaudet/code-propriete-personnes-publiques Viewer • Updated about 2 hours ago • 1.13k • 92
louisbrulenaudet/code-postes-communications-electroniques Viewer • Updated about 2 hours ago • 730 • 102
louisbrulenaudet/code-instruments-monetaires-medailles Viewer • Updated about 2 hours ago • 6 • 89
arcee-globe/Evaluated_CohereForAI-aya_collection-aya_dataset Viewer • Updated Aug 20 • 14k • 38
Epic3123/election_misinformation_sleeper_agents_dataset_llama27b Viewer • Updated Aug 29 • 733 • 38
FoxySapiens/teknofest-egitim-hukuk-tarim-surdurulebilirlik-dataset Viewer • Updated Sep 7 • 233k • 31
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted Viewer • Updated Sep 15 • 6.13k • 31
DLI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test Viewer • Updated Sep 18 • 6.13k • 31
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted-v2 Viewer • Updated Sep 19 • 6.13k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_instruct Viewer • Updated Sep 27 • 100k • 38 • 1
tayyibsupercool/resource_allocation_telecom_energy_efficiency_instruct Viewer • Updated Sep 27 • 100k • 36 • 1
DLI-Lab/Mind2Web-cleaned-lite-acctree-value-model-w-cot-formatted Viewer • Updated Sep 26 • 6.13k • 31
JiaweiGuo123/Alpaca-gpt4-English-with-gsm8k-semantic-similarity Viewer • Updated Oct 2 • 52k • 32
aamina/channel_gains_vs_tx_powers_ee_augmented_with_context_10k Viewer • Updated Oct 4 • 10k • 33
Self-GRIT/open-hermes-2.5-sft-llama3-inference-query-reformulation-tokens Viewer • Updated Oct 4 • 33.3k • 33
aamina/channel_gains_vs_tx_powers_ee_augmented_with_30_examples_context_10k Viewer • Updated Oct 5 • 10k • 40
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 9 • 52k • 38
zyusc/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 10 • 52k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_3_users_instruct Viewer • Updated Oct 13 • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_3_users_instruct Viewer • Updated Oct 13 • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 30
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-optimize Viewer • Updated Oct 10 • 802 • 37
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-code-sementic-similarity Viewer • Updated Oct 10 • 802 • 42
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-without-comment Viewer • Updated Oct 11 • 802 • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_5_instruct Viewer • Updated Oct 11 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_5_instruct Viewer • Updated Oct 11 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct Viewer • Updated Oct 12 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct Viewer • Updated Oct 12 • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct Viewer • Updated Oct 12 • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct Viewer • Updated Oct 12 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct Viewer • Updated Oct 12 • 1.25k • 43
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct Viewer • Updated Oct 12 • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct Viewer • Updated Oct 12 • 1.25k • 37
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct Viewer • Updated Oct 12 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct Viewer • Updated Oct 12 • 1.25k • 37
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct Viewer • Updated Oct 12 • 1.25k • 40
aamina/channel_gains_vs_tx_powers_ee_augmented_with_300_examples_context Viewer • Updated Oct 13 • 10k • 49
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_500_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_500_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_30_area_instruct Viewer • Updated Oct 13 • 12.5k • 31
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_30_area_instruct Viewer • Updated Oct 13 • 12.5k • 32
aamina/channel_gains_vs_tx_powers_ee_augmented_with_100_examples_context Viewer • Updated Oct 13 • 10k • 40
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_150_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_150_instruct Viewer • Updated Oct 13 • 12.5k • 31
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_250_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_250_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_350_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_350_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_10k Viewer • Updated 23 days ago • 12.5k • 38
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_10k Viewer • Updated 23 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_10k Viewer • Updated 23 days ago • 12.5k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_10k Viewer • Updated 23 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_10k Viewer • Updated 23 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_10k Viewer • Updated 23 days ago • 12.5k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_10k Viewer • Updated 23 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_10k Viewer • Updated 23 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_10k Viewer • Updated 23 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_10k Viewer • Updated 23 days ago • 12.5k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_10k Viewer • Updated 23 days ago • 12.5k • 42
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_10k Viewer • Updated 23 days ago • 12.5k • 37
aamina/channel_gains_vs_tx_powers_se_augmented_with_300_examples_context Viewer • Updated about 1 month ago • 10k • 49
aamina/channel_gains_vs_tx_powers_se_augmented_with_30_examples_context_10k Viewer • Updated 28 days ago • 10k • 53
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_1k Viewer • Updated 25 days ago • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_1k Viewer • Updated 25 days ago • 1.25k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_1k Viewer • Updated 25 days ago • 1.25k • 36
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_1k Viewer • Updated 25 days ago • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_1k Viewer • Updated 25 days ago • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_1k Viewer • Updated 25 days ago • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_1k Viewer • Updated 25 days ago • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_1k Viewer • Updated 25 days ago • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_1k Viewer • Updated 25 days ago • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_1k Viewer • Updated 25 days ago • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_1k Viewer • Updated 25 days ago • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_1k Viewer • Updated 25 days ago • 1.25k • 34
MakiAi/OKU_wiki_llama3.1_8b_inst_Reflexive_chunk200_overlap700 Viewer • Updated 16 days ago • 703 • 25
antash420/long-context-text-summarization-alpaca-format Viewer • Updated 13 days ago • 216k • 69
Gramacho/complete_pira_train_val_corpus1_ptbr_llama3_alpaca_1484 Viewer • Updated 11 days ago • 1.48k • 63
namejun12000/AW_finetuning_5core_split1_all_final_valid Viewer • Updated 9 days ago • 22.4k • 56
Gramacho/complete_pira_test_corpus1_ptbr_llama3_alpaca_181 Viewer • Updated 11 days ago • 181 • 27
namejun12000/AW_finetuning_5core_try1_all_final_valid_include Viewer • Updated 8 days ago • 22.4k • 31
namejun12000/AW_finetuning_5core_split1_all_final_valid_include Viewer • Updated 8 days ago • 22.4k • 99
namejun12000/AW_finetuning_5core_split1_all_final_final Viewer • Updated 9 days ago • 22.4k • 22
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_50 Viewer • Updated 8 days ago • 22.4k • 80
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_10 Viewer • Updated 8 days ago • 22.4k • 48
mlfoundations-dev/unnatural_instructions_gpt-4o-mini_test Viewer • Updated 3 days ago • 100 • 36
zsj999999999/llama3_medical_meadow_wikidoc_instruct_dataset Viewer • Updated 4 days ago • 10k • 11
sert121/synthetic_data_textual_leavingT_Q_W_O_V_U_X Viewer • Updated about 22 hours ago • 9.54k • 6
sert121/synthetic_data_textual_leaving_T_Q_W_O_V_U_X Viewer • Updated about 22 hours ago • 9.54k • 5