Error-Driven Prompt Optimization for Arithmetic Reasoning
Reading time: 1 minute
...
📝 Original Info
- Title: Error-Driven Prompt Optimization for Arithmetic Reasoning
- ArXiv ID: 2512.13323
- Date: 2025-12-15
- Authors: ** 논문에 저자 정보가 제공되지 않았습니다. **
📝 Abstract
Recent advancements in artificial intelligence have sparked interest in industrial agents capable of supporting analysts in regulated sectors, such as finance and healthcare, within tabular data workflows. A key capability for such systems is performing accurate arithmetic operations on structured data while ensuring sensitive information never leaves secure, on-premises environments. Here, we introduce an error-driven optimization framework for arithmetic reasoning that enhances a Code Generation Agent (CGA), specifically applied to on-premises small language models (SLMs). Through a systematic evaluation of a leading SLM (Qwen3 4B), we find that while the base model exhibits fundamental limitations in arithmetic tasks, our proposed error-driven method, which clusters erroneous predictions to refine prompt-rules iteratively, dramatically improves performance, elevating the model's accuracy to 70.8\%. Our results suggest that developing reliable, interpretable, and industrially deployable AI assistants can be achieved not only through costly fine-tuning but also via systematic, error-driven prompt optimization, enabling small models to surpass larger language models (GPT-3.5 Turbo) in a privacy-compliant manner.💡 Deep Analysis
📄 Full Content
Reference
This content is AI-processed based on open access ArXiv data.