Skip to content

Bug Fix - Fix checkpointing#87

Open
chhwang wants to merge 1 commit into
devfrom
chhwang/ckpt
Open

Bug Fix - Fix checkpointing#87
chhwang wants to merge 1 commit into
devfrom
chhwang/ckpt

Conversation

@chhwang

@chhwang chhwang commented Dec 19, 2025

Copy link
Copy Markdown

No description provided.

@chhwang chhwang requested a review from a team as a code owner December 19, 2025 23:18
"""Returns size in bytes of a given sharded tensor."""
if sh_ten.flattened_range is None:
numel = np.product(sh_ten.local_shape)
numel = np.prod(sh_ten.local_shape)

@yzygitzh yzygitzh Dec 20, 2025

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this due to that np.product is deprecated? Can we make this unchanged?

# Copyright (c) 2023, NVIDIA CORPORATION. All rights reserved.

""" Storage writer for PyT Distributed format allowing asynchronous save. """
"""Storage writer for PyT Distributed format allowing asynchronous save."""

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is not related to bug fix, shall we drop it?

if "serialization_format" in inspect.signature(_write_item).parameters:
from torch.distributed.checkpoint.filesystem import SerializationFormat

extra_kwargs['serialization_format'] = SerializationFormat.TORCH_SAVE

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Previously there is no issue here, is this due to an upgrade of PyTorch? Shall we put the reason of change in PR description

@github-actions

Copy link
Copy Markdown

Marking as stale. No activity in 60 days.

@github-actions github-actions Bot added the stale label Feb 18, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants