SmolLM-135M: Base vs SFT vs SFT+DPO
Compare outputs across the full fine-tuning pipeline. Built for IBA NLP Assignment 04.
Your Prompt
Compare All 3 Models
Base Model (no tuning)
SFT Only (Trial 3)
SFT + DPO (Final)
Examples
What causes seasons on Earth?
What is the capital of Australia?
Explain photosynthesis briefly.
Give me 3 tips to reduce plastic waste.
What are the planets in our solar system?