We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.
There was an error while loading. Please reload this page.