Skip to content

Avoid GPU OOMs and also Java-heap OOMs #139

@nvzm123

Description

@nvzm123

This fixes two issues:

  1. We were eagerly loading the full dataset into GPU instead of letting CAGRA manage streaming it to GPU.

  2. We were passing full datasets to createMultiLayerHnswGraph which crashed my java heap space when benchmarking 10M 1536d (~60GB) dataset; now we can stream manageable subsets instead.

PR: #141

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions