Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
bpHigh
/
financial-task-env
like
2
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
financial-task-env
322 MB
Ctrl+K
Ctrl+K
4 contributors
History:
36 commits
bpHigh
Add blog and final data
18d1157
29 days ago
data
Add SFT corpus + Kimi-K2.5 teacher run + Kimi eval run
29 days ago
data_pipeline
SFT run #2: 8K-context Qwen2.5-Coder-3B (qwen3b-office-sft-kimi-long)
29 days ago
graders
Phase 7: close the 'submit source unchanged' exploit Kimi-K2.5 found
30 days ago
runs
SFT eval on 22-task held-out split β fill in leaderboard
29 days ago
server
SFT eval on 22-task held-out split β fill in leaderboard
29 days ago
.dockerignore
Safe
87 Bytes
Financial Task Environment β code execution with real xlsx
about 2 months ago
.gitattributes
256 Bytes
Round 2 README, Qwen2.5-Coder-3B baseline, missing data_pipeline pullers
30 days ago
.gitignore
4.79 kB
Add SFT corpus + Kimi-K2.5 teacher run + Kimi eval run
29 days ago
Blog.md
14.1 kB
Add blog and final data
29 days ago
Dockerfile
1.14 kB
Phase 11.5: Gradio dashboard at /dashboard (now the Space's base_path)
29 days ago
LICENSE
Safe
1.07 kB
Initial commit
about 2 months ago
README.md
19 kB
SFT eval on 22-task held-out split β fill in leaderboard
29 days ago
__init__.py
Safe
391 Bytes
Financial Task Environment β code execution with real xlsx
about 2 months ago
client.py
2.08 kB
Fix client._parse_result to unwrap {observation,reward,done} payload
29 days ago
edits.md
57.5 kB
GRPO Phase 13: custom rollout_func for markdown JSON tool calls
29 days ago
eval_lora.py
20.8 kB
eval_lora: fix truncation drop-direction bug + add subprocess preflight
29 days ago
inference.py
30 kB
Phase 9.1: --skip-completed flag for cheap re-runs
29 days ago
models.py
Safe
1.85 kB
Financial Task Environment β code execution with real xlsx
about 2 months ago
openenv-grpo.ipynb
7.64 MB
Add blog and final data
29 days ago
openenv.yaml
58 kB
Attribute hand-curated Round-1 tasks to Finch + list ALL 119 tasks in openenv.yaml
29 days ago
pyproject.toml
775 Bytes
Phase 11.5: Gradio dashboard at /dashboard (now the Space's base_path)
29 days ago
rewards.py
11.9 kB
Add extended arena stuff
30 days ago
tasks.py
15.7 kB
Add extended arena stuff
30 days ago
train_grpo.py
23 kB
Add blog and final data
29 days ago
train_sft.py
9.38 kB
train_sft: drop fp16, prefer bf16 (MPS-compatible without grad scaler)
29 days ago
uv.lock
Safe
542 kB
Financial Task Environment β code execution with real xlsx
about 2 months ago