Start with RAG
Retrieval is the right first move here. Build a solid retrieval layer, measure grounding, and only revisit fine-tuning if a concrete behaviour gap remains.
Elige esto cuando
- You're adding knowledge the model doesn't have
- The information changes, or answers must cite sources
- You don't yet have a large, clean training set
Compensaciones
- Longer prompts mean higher per-call cost and latency
- Retrieval quality becomes its own thing to design and measure
- Won't, on its own, change deeply ingrained style or format