You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Deeplm โ€” Auto-upload (step 200)

Model checkpoint otomatis dari training loop. Setiap 100 langkah, seluruh state dikirim ke HF.

Training Metrics

  • step: 200
  • loss: 2.6842
  • lr: 0.0000
  • grad_norm: 0.2827
  • phase:
  • state:
  • confidence: 0

Charts

grad_norm.png

loss.png

loss_analysis.png

lr.png

Included Files (setiap upload)

File Deskripsi
model.safetensors BitNet ternary weights
config.json Model configuration
tokenizer.json BPETokenizer
tokenizer_config.json Tokenizer config
checkpoint-200/ Full checkpoint (model.pt, optimizer.pt, training_state.json)
charts/ Training visualization PNGs
metrics.jsonl Full training log (all steps)
tuner_state.json AutoTuner internal state
README.md This file

Penggunaan

import sys; sys.path.insert(0, 'deeplm')
from deeplm.config import DeeplmConfig
from deeplm.model.deeplm import DeeplmModel
from safetensors.torch import load_file

config = DeeplmConfig()
model = DeeplmModel(config)
state_dict = load_file('model.safetensors')
model.load_state_dict(state_dict, strict=False)
Downloads last month
4,033
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support