arxiv:2309.17382
Shenao Zhang
ShenaoZhang
AI & ML interests
None yet
Organizations
None yet
models 123
ShenaoZhang/0.01_version_debug_iter_1
Text Generation • 7B • Updated
• 1
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated
• 1
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_3
Text Generation • 7B • Updated
• 3
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_2
Text Generation • 7B • Updated
• 5
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_1
Text Generation • 7B • Updated
• 7
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated
• 1
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_3
Text Generation • 7B • Updated
• 5
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_2
Text Generation • 7B • Updated
• 6
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated
• 7
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_1
Text Generation • 7B • Updated
• 4
datasets 37
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 9
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 7
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 9
ShenaoZhang/0.0_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 2k • 6
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated
• 46.8k • 7
ShenaoZhang/0.0_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 44.8k • 9
ShenaoZhang/0.01_4iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated
• 34.6k • 8
ShenaoZhang/0.001_2iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 65.1k • 16
ShenaoZhang/0.01_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 67.1k • 28
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 67.1k • 16