Skip to content

feat(datasets): add datasets SDK and CLI commands for OSS registry#859

Merged
dengwx2009 merged 11 commits into
masterfrom
feature/datasets
Apr 21, 2026
Merged

feat(datasets): add datasets SDK and CLI commands for OSS registry#859
dengwx2009 merged 11 commits into
masterfrom
feature/datasets

Conversation

@dengwx2026

Copy link
Copy Markdown
Collaborator

Summary

  • Add rock.sdk.envhub.datasets package — DatasetSpec, UploadResult, BaseDatasetRegistry ABC, OssDatasetRegistry (list + concurrent upload via oss2), and DatasetClient
  • Extend ConfigManager with DatasetConfig dataclass reading [dataset] section from .rock/config.ini
  • Add DatasetsCommand with rock datasets list and rock datasets upload subcommands; CLI flags override config values

Test Plan

  • 20 dataset unit tests pass (models, OSS registry list + upload, DatasetClient, ConfigManager, DatasetsCommand)
  • Full fast unit suite passes (710 tests, pre-existing failures unaffected)
  • CLI smoke test: rock datasets --help, list --help, upload --help

closes #858

dengwx2009 and others added 11 commits April 20, 2026 19:30
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…dataset

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@dengwx2009 dengwx2009 merged commit a375d7f into master Apr 21, 2026
11 checks passed
Issac-Newton pushed a commit to Issac-Newton/ROCK that referenced this pull request Apr 23, 2026
…libaba#859)

* feat: add docs

* feat: update docs

* docs(datasets): update design doc to reuse OssRegistryInfo from bench

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add models and package skeleton

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add BaseDatasetRegistry ABC

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add OssDatasetRegistry with list_datasets and upload_dataset

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add DatasetClient

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): extend ConfigManager with [dataset] section

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add DatasetsCommand (list + upload)

* feat: refine DatasetSpec

* fix: test fail

---------

Co-authored-by: dengwx <wanxi.dengwx@alibaba-inc.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
zhongwen666 pushed a commit to zhongwen666/ROCK that referenced this pull request May 17, 2026
…libaba#859)

* feat: add docs

* feat: update docs

* docs(datasets): update design doc to reuse OssRegistryInfo from bench

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add models and package skeleton

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add BaseDatasetRegistry ABC

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add OssDatasetRegistry with list_datasets and upload_dataset

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add DatasetClient

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): extend ConfigManager with [dataset] section

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(datasets): add DatasetsCommand (list + upload)

* feat: refine DatasetSpec

* fix: test fail

---------

Co-authored-by: dengwx <wanxi.dengwx@alibaba-inc.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

feat(datasets): add datasets SDK and CLI commands for OSS registry

2 participants