Choice of np.uint64?

https://github.com/ekzhu/datasketch/blob/ebe4ca4a5ddf5763df8ea80a9b6851a6044b1fd0/datasketch/minhash.py#L12

in this implementation of minhash, it seems like the hasher is using 32 bits (`sha1_hash32`)
why is the `_max_hash = np.uint64((1 << 32) - 1)` using `np.uint64` ?
I tried experiments with `np.uint32` with the mersenne prime `np.uint64((1 << 31) - 1)` and it seems there arent much difference in the results. 
If I understand correctly, this will automatically halve memory consumption as well.

Is there a reason to insist on `np.uint64`?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choice of np.uint64? #212

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Choice of np.uint64? #212

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions