EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning
| Method | Avg. Tokens | Single Hop | Multi Hop | Temporal | Open Domain | Overall |
|---|---|---|---|---|---|---|
| GPT-4o-mini backbone | ||||||
| MemoryOS | 5.2k | 62.43 | 56.50 | 37.18 | 40.28 | 54.70 |
| Mem0 | 1.0k | 66.71 | 58.16 | 55.45 | 40.62 | 61.00 |
| MemU | 4.0k | 72.77 | 62.41 | 33.96 | 46.88 | 61.15 |
| MemOS | 2.5k | 81.45 | 69.15 | 72.27 | 60.42 | 75.87 |
| Zep | 1.4k | 88.11 | 71.99 | 74.45 | 66.67 | 81.06 |
| EverMemOS | 2.5k | 91.08 (↑3.4%) | 86.17 (↑19.7%) | 81.93 (↑10.0%) | 66.67 (↑0.0%) | 86.76 (↑7.0%) |
| GPT-4.1-mini backbone | ||||||
| MemoryOS | 5.5k | 67.30 | 59.34 | 42.26 | 59.03 | 60.11 |
| Mem0 | 1.0k | 68.97 | 61.70 | 58.26 | 50.00 | 64.20 |
| MemU | 4.0k | 74.91 | 72.34 | 43.61 | 54.17 | 66.67 |
| MemOS | 2.5k | 85.37 | 79.43 | 75.08 | 64.58 | 80.76 |
| Zep | 1.4k | 90.84 | 81.91 | 77.26 | 75.00 | 85.22 |
| EverMemOS | 2.3k | 96.67 (↑6.4%) | 91.84 (↑12.1%) | 89.72 (↑16.1%) | 76.04 (↑1.4%) | 93.05 (↑9.2%) |
| Method | Token | SS-User | SS-Asst | SS-Pref | Multi-S | Know. Upd | Temp. Reas | Overall |
|---|---|---|---|---|---|---|---|---|
| MemU | 0.5k | 67.14 | 19.64 | 76.67 | 42.10 | 41.02 | 17.29 | 38.40 |
| Zep | 1.6k | 92.90 | 75.00 | 53.30 | 47.40 | 74.40 | 54.10 | 63.80 |
| Mem0 | 1.1k | 82.86 | 26.78 | 90.00 | 63.15 | 66.67 | 72.18 | 66.40 |
| MemOS | 1.4k | 95.71 | 67.86 | 96.67 | 70.67 | 74.26 | 77.44 | 77.80 |
| EverMemOS | 2.8k | 97.14 ($`\uparrow`$1.5%) | 85.71 ($`\uparrow`$14.3%) | 93.33 ($`\downarrow`$3.5%) | 73.68 ($`\uparrow`$4.3%) | 89.74 ($`\uparrow`$20.6%) | 77.44 ($`\uparrow`$0.0%) | 83.00 ($`\uparrow`$6.7%) |