Skip to content

Allow more params when loading dataset DiffuserCamMirflickrHF #176

@Charley-xiao

Description

@Charley-xiao

So in DiffuserCamMirflickrHF.__init__() we get the dataset by this line:

self.dataset = load_dataset(repo_id, split=split)

And sometimes it takes notoriously slow, and sometimes the connection breaks and just refuses to retry.

It would be nice if we are allowed to pass some params like num_proc, cache_dir, etc to that line just to make life easier.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions