You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We were eagerly loading the full dataset into GPU instead of letting CAGRA manage streaming it to GPU.
We were passing full datasets to createMultiLayerHnswGraph which crashed my java heap space when benchmarking 10M 1536d (~60GB) dataset; now we can stream manageable subsets instead.
This fixes two issues:
We were eagerly loading the full dataset into GPU instead of letting CAGRA manage streaming it to GPU.
We were passing full datasets to createMultiLayerHnswGraph which crashed my java heap space when benchmarking 10M 1536d (~60GB) dataset; now we can stream manageable subsets instead.
PR: #141