LoRA adapter for kshitijthakkar/deepseek-v4-mini-300M-from-flash

SFT LoRA adapter. Load with:

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base = AutoModelForCausalLM.from_pretrained(
    "kshitijthakkar/deepseek-v4-mini-300M-from-flash", trust_remote_code=True
)
model = PeftModel.from_pretrained(base, "kshitijthakkar/deepseek-v4-mini-300M-from-flash-sft-test-lora")
tokenizer = AutoTokenizer.from_pretrained("kshitijthakkar/deepseek-v4-mini-300M-from-flash", trust_remote_code=True)
Downloads last month
30
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kshitijthakkar/deepseek-v4-mini-300M-from-flash-sft-test-lora