Re-FORC: Adaptive Reward Prediction for Efficient Chain-of-Thought Reasoning

Reading time: 1 minute
...

📝 Original Info

  • Title: Re-FORC: Adaptive Reward Prediction for Efficient Chain-of-Thought Reasoning
  • ArXiv ID: 2511.02130
  • Date: 2025-11-03
  • Authors: 정보가 제공되지 않음

📝 Abstract

None

💡 Deep Analysis

Figure 1

📄 Full Content

📸 Image Gallery

combined_fig_core_cumulative.png early_stopping_1.7B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png early_stopping_4B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png early_stopping_8B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png early_stopping_by_model_size_5tick.png fig_core_cumulative.png modeling_selection_with_frequency.png routing_costaxis_AIME2024_3way.png routing_costaxis_AIME2025_3way.png routing_costaxis_AMC2024_3way.png routing_costaxis_MATH500_3way.png routing_costaxis_Minerva_3way.png test_time_scaling_all_three_5tick.png token_usage_1.7B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png token_usage_4B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png token_usage_8B_Minerva_MATH500_AMC2024_AIME2024_AIME2025.png token_useage_across_models.png tts_individual_dataset_1_7b.png tts_individual_dataset_4b.png tts_individual_dataset_8b.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut