EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning

EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning
Method Avg. Tokens Single Hop Multi Hop Temporal Open Domain Overall
GPT-4o-mini backbone
MemoryOS 5.2k 62.43 56.50 37.18 40.28 54.70
Mem0 1.0k 66.71 58.16 55.45 40.62 61.00
MemU 4.0k 72.77 62.41 33.96 46.88 61.15
MemOS 2.5k 81.45 69.15 72.27 60.42 75.87
Zep 1.4k 88.11 71.99 74.45 66.67 81.06
EverMemOS 2.5k 91.08 (3.4%) 86.17 (19.7%) 81.93 (10.0%) 66.67 (0.0%) 86.76 (7.0%)
GPT-4.1-mini backbone
MemoryOS 5.5k 67.30 59.34 42.26 59.03 60.11
Mem0 1.0k 68.97 61.70 58.26 50.00 64.20
MemU 4.0k 74.91 72.34 43.61 54.17 66.67
MemOS 2.5k 85.37 79.43 75.08 64.58 80.76
Zep 1.4k 90.84 81.91 77.26 75.00 85.22
EverMemOS 2.3k 96.67 (6.4%) 91.84 (12.1%) 89.72 (16.1%) 76.04 (1.4%) 93.05 (9.2%)
Method Token SS-User SS-Asst SS-Pref Multi-S Know. Upd Temp. Reas Overall
MemU 0.5k 67.14 19.64 76.67 42.10 41.02 17.29 38.40
Zep 1.6k 92.90 75.00 53.30 47.40 74.40 54.10 63.80
Mem0 1.1k 82.86 26.78 90.00 63.15 66.67 72.18 66.40
MemOS 1.4k 95.71 67.86 96.67 70.67 74.26 77.44 77.80
EverMemOS 2.8k 97.14 ($`\uparrow`$1.5%) 85.71 ($`\uparrow`$14.3%) 93.33 ($`\downarrow`$3.5%) 73.68 ($`\uparrow`$4.3%) 89.74 ($`\uparrow`$20.6%) 77.44 ($`\uparrow`$0.0%) 83.00 ($`\uparrow`$6.7%)