Jack Morris
commited on
Commit
·
b74493f
1
Parent(s):
a8d6b4e
update again
Browse files
README.md
CHANGED
@@ -8650,6 +8650,12 @@ model-index:
|
|
8650 |
|
8651 |
# Contextual Document Embeddings (CDE)
|
8652 |
|
|
|
|
|
|
|
|
|
|
|
|
|
8653 |
<a href="github.com/jxmorris12/cde">Github</a>
|
8654 |
|
8655 |
Our new model that naturally integrates "context tokens" into the embedding process. As of January 13th, 2025, `cde-small-v2` is the best small model (under 400M params) on the [MTEB leaderboard](https://huggingface.co/spaces/mteb/leaderboard) for text embedding models, with an average score of 65.58.
|
|
|
8650 |
|
8651 |
# Contextual Document Embeddings (CDE)
|
8652 |
|
8653 |
+
<div style="background-color: #f8f9fa; border-left: 6px solid #007bff; padding: 10px 20px; margin: 20px; font-family: Arial, sans-serif; line-height: 1.6;">
|
8654 |
+
<p><strong>Note on parameter count: </strong>Although HuggingFace reports the size of this model as 281M params, it's really closer to 140M. That's because our weights actually contain the weights of two models (dubbed "first stage" and "second stage"), and only the second-stage model is used to compute embeddings at search time.</p>
|
8655 |
+
</div>
|
8656 |
+
|
8657 |
+
**Note on parameter count**:
|
8658 |
+
|
8659 |
<a href="github.com/jxmorris12/cde">Github</a>
|
8660 |
|
8661 |
Our new model that naturally integrates "context tokens" into the embedding process. As of January 13th, 2025, `cde-small-v2` is the best small model (under 400M params) on the [MTEB leaderboard](https://huggingface.co/spaces/mteb/leaderboard) for text embedding models, with an average score of 65.58.
|