Skip to content

Cuvs-Lucene-139: Fix GPU OOMs && Java Heap OOMs#141

Open
nvzm123 wants to merge 2 commits into
rapidsai:mainfrom
nvzm123:cuvslucene-139__zackm
Open

Cuvs-Lucene-139: Fix GPU OOMs && Java Heap OOMs#141
nvzm123 wants to merge 2 commits into
rapidsai:mainfrom
nvzm123:cuvslucene-139__zackm

Conversation

@nvzm123
Copy link
Copy Markdown

@nvzm123 nvzm123 commented Apr 29, 2026

GPU OOM fix:
Replaced usage of CuVSMatrix.deviceBuilder with usage of CuVSMatrix.hostBuilder instead. deviceBuilder was eagerly loading the full dataset to GPU. using hostBuilder gives responsibility to CAGRA as to how to optimally stream data from host memory to GPU.

Java Heap OOM fix:
Stream subsets of data during HNSW graph construction instead of trying to load the full dataset onto the Java Heap.

Note: this PR also includes the commit from cuvs-lucene-137 (#137)

EC2 Default User added 2 commits April 28, 2026 21:13
without loading the full set of data on the Java Heap, but instead allows us
to stream the set of data to the Java Heap.
@nvzm123 nvzm123 requested a review from a team as a code owner April 29, 2026 04:26
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 29, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@nvzm123
Copy link
Copy Markdown
Author

nvzm123 commented Apr 29, 2026

@narangvivek10 narangvivek10 added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Apr 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

improvement Improves an existing functionality non-breaking Introduces a non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants