Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rameyjm7
/
llm-preference-unlearning
like
1
Transformers
unlearning
alignment
large-language-models
qwen2.5
lora
fine-tuning
safety
preference-modeling
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-preference-unlearning
/
notebooks
1 MB
1 contributor
History:
1 commit
rameyjm7
Made one unified unlearning notebook activation_based_unlearning.ipynb
c0f546b
12 days ago
datasets
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
00_recommender.ipynb
Safe
12.5 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
01_activation_probe.ipynb
Safe
7.46 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
02_activation_overlap.ipynb
Safe
146 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
03_saliency_maps.ipynb
Safe
105 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
04_gradient_analysis.ipynb
Safe
231 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
05_fisher_information.ipynb
Safe
214 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
06_drift_analysis.ipynb
Safe
132 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
07_activation_unlearning.ipynb
Safe
95.1 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
08_activation_guided_masked_lora_unlearning.ipynb
Safe
37.2 kB
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago
qwen_unlearn.yaml
Safe
548 Bytes
Made one unified unlearning notebook activation_based_unlearning.ipynb
12 days ago