Spaces:

bpHigh
/

financial-task-env

Running

App Files Files Community

financial-task-env

322 MB

Ctrl+K

Ctrl+K

4 contributors

History: 36 commits

bpHigh's picture

Add blog and final data

18d1157 3 months ago

data
Add SFT corpus + Kimi-K2.5 teacher run + Kimi eval run 3 months ago
data_pipeline
SFT run #2: 8K-context Qwen2.5-Coder-3B (qwen3b-office-sft-kimi-long) 3 months ago
graders
Phase 7: close the 'submit source unchanged' exploit Kimi-K2.5 found 3 months ago
runs
SFT eval on 22-task held-out split — fill in leaderboard 3 months ago
server
SFT eval on 22-task held-out split — fill in leaderboard 3 months ago
.dockerignore

87 Bytes
Financial Task Environment — code execution with real xlsx 3 months ago
.gitattributes

256 Bytes
Round 2 README, Qwen2.5-Coder-3B baseline, missing data_pipeline pullers 3 months ago
.gitignore

4.79 kB
Add SFT corpus + Kimi-K2.5 teacher run + Kimi eval run 3 months ago
Blog.md

14.1 kB
Add blog and final data 3 months ago
Dockerfile

1.14 kB
Phase 11.5: Gradio dashboard at /dashboard (now the Space's base_path) 3 months ago
LICENSE

1.07 kB
Initial commit 3 months ago
README.md

19 kB
SFT eval on 22-task held-out split — fill in leaderboard 3 months ago
__init__.py

391 Bytes
Financial Task Environment — code execution with real xlsx 3 months ago
client.py

2.08 kB
Fix client._parse_result to unwrap {observation,reward,done} payload 3 months ago
edits.md

57.5 kB
GRPO Phase 13: custom rollout_func for markdown JSON tool calls 3 months ago
eval_lora.py

20.8 kB
eval_lora: fix truncation drop-direction bug + add subprocess preflight 3 months ago
inference.py

30 kB
Phase 9.1: --skip-completed flag for cheap re-runs 3 months ago
models.py

1.85 kB
Financial Task Environment — code execution with real xlsx 3 months ago
openenv-grpo.ipynb

7.64 MB
Add blog and final data 3 months ago
openenv.yaml

58 kB
Attribute hand-curated Round-1 tasks to Finch + list ALL 119 tasks in openenv.yaml 3 months ago
pyproject.toml

775 Bytes
Phase 11.5: Gradio dashboard at /dashboard (now the Space's base_path) 3 months ago
rewards.py

11.9 kB
Add extended arena stuff 3 months ago
tasks.py

15.7 kB
Add extended arena stuff 3 months ago
train_grpo.py

23 kB
Add blog and final data 3 months ago
train_sft.py

9.38 kB
train_sft: drop fp16, prefer bf16 (MPS-compatible without grad scaler) 3 months ago
uv.lock

542 kB
Financial Task Environment — code execution with real xlsx 3 months ago