Towards a Measure of Algorithm Similarity

Reading time: 1 minute
...

📝 Original Info

  • Title: Towards a Measure of Algorithm Similarity
  • ArXiv ID: 2510.27063
  • Date: 2025-10-31
  • Authors: 논문에 명시된 저자 정보가 제공되지 않았습니다.

📝 Abstract

Given two algorithms for the same problem, can we determine whether they are meaningfully different? In full generality, the question is uncomputable, and empirically it is muddied by competing notions of similarity. Yet, in many applications (such as clone detection or program synthesis) a pragmatic and consistent similarity metric is necessary. We review existing equivalence and similarity notions and introduce EMOC: An Evaluation-Memory-Operations-Complexity framework that embeds algorithm implementations into a feature space suitable for downstream tasks. We compile PACD, a curated dataset of verified Python implementations across three problems, and show that EMOC features support clustering and classification of algorithm types, detection of near-duplicates, and quantification of diversity in LLM-generated programs. Code, data, and utilities for computing EMOC embeddings are released to facilitate reproducibility and future work on algorithm similarity.

💡 Deep Analysis

Figure 1

📄 Full Content

📸 Image Gallery

clustering_reuslts.png model_size_v_ops_used.png model_size_vs_m_term_c_term.png orig_v_modified_prompt.png temp_comparison.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut