Investigating Intra-Abstraction Policies For Non-exact Abstraction Algorithms

Reading time: 2 minute
...

📝 Original Info

  • Title: Investigating Intra-Abstraction Policies For Non-exact Abstraction Algorithms
  • ArXiv ID: 2510.24297
  • Date: 2025-10-28
  • Authors: 정보 없음 (제공되지 않음)

📝 Abstract

One weakness of Monte Carlo Tree Search (MCTS) is its sample efficiency which can be addressed by building and using state and/or action abstractions in parallel to the tree search such that information can be shared among nodes of the same layer. The primary usage of abstractions for MCTS is to enhance the Upper Confidence Bound (UCB) value during the tree policy by aggregating visits and returns of an abstract node. However, this direct usage of abstractions does not take the case into account where multiple actions with the same parent might be in the same abstract node, as these would then all have the same UCB value, thus requiring a tiebreak rule. In state-of-the-art abstraction algorithms such as pruned On the Go Abstractions (pruned OGA), this case has not been noticed, and a random tiebreak rule was implicitly chosen. In this paper, we propose and empirically evaluate several alternative intra-abstraction policies, several of which outperform the random policy across a majority of environments and parameter settings.

💡 Deep Analysis

Figure 1

📄 Full Content

📸 Image Gallery

1000its_scores.png 100its_scores.png 200its_scores.png 500its_scores.png abs_dropping.png asap_example.png intra_eps_comp_improv_eps.png intra_eps_comp_pairings_eps.png intra_pruned_comp_improv_pruned.png intra_pruned_comp_pairings_pruned.png intra_random_comp_improv_random.png intra_random_comp_pairings_random.png intraabs_legend2.png intraabs_optimized_aa.png intraabs_optimized_ct.png intraabs_optimized_eo.png intraabs_optimized_gol.png intraabs_optimized_man.png intraabs_optimized_navigation.png intraabs_optimized_recon.png intraabs_optimized_sa.png intraabs_optimized_saving.png intraabs_optimized_st.png intraabs_optimized_sw_15.png intraabs_optimized_tam.png intraabs_optimized_tr.png intraabs_optimized_trt.png its_scores.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut