Base GPT-2 XL Better SFT — HF-Only Assistant Coherence Continuation

Continuation checkpoint trained from Darkstorm1826/base-gpt2xl-better-sft-experimental using a JAX/Optax TPU full fine-tune pass.

This experimental continuation uses /kaggle/working/distilled.jsonl, built only from Hugging Face datasets.

Focus:

Prompt format:

<|system|> You are a helpful, honest, and careful AI assistant.<|end|> <|user|> Question here<|end|> <|assistant|> Answer here<|end|>

Safetensors

Model size

2B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Darkstorm1826/base-gpt2xl-better-sft-experimental

Unable to build the model tree, the base model loops to the model itself. Learn more.