What is the context length of this model?

by mstachow - opened Jul 28, 2024

Jul 28, 2024

I can't seem to find details in the model card. What is the context length? Any ideas for how to use it beyond the length?

guychuk

Aug 3, 2024

@mstachow usually you can find it using
max_tokens = tokenizer.model_max_length

armorerlabs

16 days ago

For BERT/DistilBERT-style prompt-injection classifiers the practical ceiling is usually the tokenizer/model max length, commonly 512 tokens. You can confirm with tokenizer.model_max_length, as noted above.

For longer inputs, I would avoid head-only truncation. The failure mode is that an injection appended after benign content disappears before classification. A safer runtime pattern is:

split into overlapping windows near the model max length
score every window
aggregate with max-risk / any-risk semantics
keep the triggering span or window in the result so the caller can explain why it blocked

If this is going into a tool-calling agent, it also helps to scan by surface: retrieved content, model output, tool-call args, and outbound payloads should not necessarily share the same threshold. We are taking that staged approach in Armorer Guard as a fast local pre-tool-call gate: https://huggingface.co/armorer-labs/armorer-guard-semantic-classifier

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment