TAUR-dev/testing__pvv2_lora
2B
•
Updated
•
5
TAUR-dev/testing__lf_pvv2_resume
2B
•
Updated
•
6
TAUR-dev/M-1023_longmult__0epoch_longmult3dig-rl
2B
•
Updated
•
4
TAUR-dev/M-1022_longcontext__maxlen8192_1e_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen8192_1e_3and4arg-rl
Updated
TAUR-dev/sf_pvv2_cd3arg_10resps_sft
2B
•
Updated
•
5
TAUR-dev/M-1022_longcontext__maxlen8192_0epoch_3args-rl
Updated
TAUR-dev/pvv2_longmult3dig_sft
2B
•
Updated
•
4
TAUR-dev/M-1022_longcontext__maxlen4096_1e_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_0epoch_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen8192_0epoch_3and4arg-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_0epoch_3and4arg-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_1e_3and4arg-rl
Updated
TAUR-dev/testing_llamafactory_helper_quick_test
0.5B
•
Updated
•
12
TAUR-dev/testing_llamafactory_helper_quick_test__interactive
0.5B
•
Updated
•
5
TAUR-dev/testing_llamafactory_helper_quick_test__local
0.5B
•
Updated
•
5
0.5B
•
Updated
•
5
TAUR-dev/testing_llamafactory_helper_quick_test1
0.5B
•
Updated
•
2
TAUR-dev/qwen25_vl_7b_element_lookup_01format_09_coordinate_02reflect_thrsh20_no_feedback_10_20
Updated
TAUR-dev/M-rl_ours_AT_fixed-rl
Updated
TAUR-dev/M-sft_exp_AT_pvv2__fixed-sft
2B
•
Updated
•
2
TAUR-dev/M-rl_1e_v2__pv__4ominireflections-rl
2B
•
Updated
•
5
TAUR-dev/M-sft_exp_pvv2__gpt4ominiref-sft
2B
•
Updated
•
5
TAUR-dev/M-rl_1e_v2__pv_v2__32k-rl
2B
•
Updated
•
3
TAUR-dev/M-rl_rlonly__32k-rl
Updated
TAUR-dev/M-0921__zayne1_alltask1_grpo-rl
2B
•
Updated
•
4
TAUR-dev/M-0921__zayne1_alltask2_grpo-rl
2B
•
Updated
•
3
TAUR-dev/M-R1_distilled_baseline_cd3args_only
2B
•
Updated
•
2
TAUR-dev/M-0921__zayne1_alltask1_grpo_resume-rl
Updated
TAUR-dev/M-0921__zayne1_alltask2_grpo_resume-rl
Updated