Correcting Mean Bias in Text Embeddings: A Refined Renormalization with Training-Free Improvements on MMTEB
We find that current text embedding models produce outputs with a consistent bias, i.e., each embedding vector e can be decomposed as tilde{e} + μ , where μ is almost identical across all sentences. We propose a plug-and-play, training-free and


























