Skip to content
This repository was archived by the owner on May 16, 2025. It is now read-only.
This repository was archived by the owner on May 16, 2025. It is now read-only.

How to fix the following errors? #6

@missflash

Description

@missflash

The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding __init__.py to /content/trlx/examples/summarize_rlhf/reward_model/, I still get the same error.
How can I fix it?

     10 import torch
     11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
     13 from tqdm import tqdm
     14 from transformers import AutoTokenizer

ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions