OPTML-Group
/

SimNPO-TOFU-forget10-Llama-2-7b-chat

Text Generation

machine-unlearning

large-language-models

trustworthy-machine-learning

text-generation-inference

Model card Files Files and versions

a-F1 commited on Oct 28, 2024

Commit

b29fb81

·

verified ·

1 Parent(s): 2516497

Update README.md

Files changed (1) hide show

README.md +47 -3

README.md CHANGED Viewed

@@ -1,3 +1,47 @@
----
-license: mit
----

+---
+license: mit
+---
+# LLaMA-2-chat 7B unlearned using SimNPO on TOFU Forget10
+## Model Details
+- **Base Model**: LLaMA-2-chat 7B
+- **Training**: Fine-tuned on TOFU dataset
+- **Unlearning**: SimNPO on TOFU Forget10
+## Unlearning Algorithm
+This model uses the `SimNPO` unlearning algorithm with the following parameters:
+- Learning Rate: `1e-5`
+- beta: `4.5`
+- lambda: `0.125`
+- gamma: `0.0`
+## Loading the Model
+```python
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("OPTML-Group/SimNPO-TOFU-forget10-Llama-2-7b-chat", use_flash_attention_2=True, torch_dtype=torch.bfloat16, trust_remote_code=True)
+```
+## Citation
+If you use this model in your research, please cite:
+```
+@misc{fan2024simplicityprevailsrethinkingnegative,
+      title={Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning},
+      author={Chongyu Fan and Jiancheng Liu and Licong Lin and Jinghan Jia and Ruiqi Zhang and Song Mei and Sijia Liu},
+      year={2024},
+      eprint={2410.07163},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2410.07163},
+}
+```
+## Contact
+For questions or issues regarding this model, please contact [email protected].