Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Unresolved
Priority: Unknown
Fix Version/s: None
Affects Version/s: None
Component/s: AI/ML, LangChain
Labels:
None

Quarter:
- FY27Q2-candidate
Confidence Status:
None

Assigned Teams:

Python Drivers

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Link:
None
Goal Name(s):
None

Context

Describe the background behind the problem.

All of our existing work simply stores python floats. These are not contiguous in memory or on disk. They are also doubles (float64) whereas Lucene only uses float32 so each value in the list is downcast each time we perform a search.

Definition of done

What must be done to consider the task complete?

THE TARGET LIBRARY IS PYMONGO-SEARCH-UTILS

Search and inventory our implementations of embedding vectors. Implement encoding outputs of embedding models to binary vectors. Perform benchmarks to compare.

Pitfalls

Python is slow. The benefits mentioned above may be offset by the cost to encode floats.

Assignee:: Unassigned
Reporter:: Casey Clements
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Jul 15 2025 08:40:39 PM UTC
Updated:: Feb 09 2026 03:09:52 PM UTC

Details

Description

Context

Definition of done

Pitfalls

Attachments

Activity

People

Dates