Fin-R1: a Financial Reasoning LLM with Reinforcement Learning and CoT
Introduction
Fin-R1 is a new model specifically fine-tuned for financial reasoning, with performance that beats much larger models like DeepSeek-R1.
This post will use this model and compare it with phi3 across various tasks.
- phi3 for comparison
Phi-3: a lightweight, general-purpose model known for its efficiency and strong reasoning performance at smaller parameter scales. It serves as a great baseline for assessing how domain-specific tuning in Fin-R1 improves financial understanding and response structure.
