Skip to content

4.0#402

Open
jsuarez5341 wants to merge 740 commits into
3.0from
4.0
Open

4.0#402
jsuarez5341 wants to merge 740 commits into
3.0from
4.0

Conversation

@jsuarez5341

Copy link
Copy Markdown
Contributor

This PR will not be merged. We are targeting EoY and 4.0 will just become master. Key goals:

  • Sweeps for all envs, largest ever public dataset of RL experiments
  • Constellation
  • Major perf enhancements

TBD: cpp/barracuda, final constellation features, xlstm, advantage calc tweaks

PLAZMAMA and others added 30 commits May 2, 2026 03:54
Both were declared as const int, silently truncating their float values.
Moved to a const float line so the types match the intent.
Fix float truncation in SPEED and MIN_TICKS constants
Fix memory leak of StaticOMPArg array in vecenv
Native C binding for NetHack via a modified NLE that isolates all
mutable state per-env. Achieves 136K+ training SPS at N=4096 T=4.

- ocean/nethack/ — env binding, reward shaping, prompt handling
- config/nethack.ini — training hyperparameters
- build.sh — auto-clones modified NLE from liujonathan24/NetHack,
  adds EXTRA_LDFLAGS plumbing, makes OMP_LIB/ccache portable
- src/kernels.cu — add cast_dispatch overload for ByteTensor obs

Modified NLE: https://github.com/liujonathan24/NetHack

Setup:
  bash build.sh nethack
  NETHACKDIR=vendor/nle/src/build/dat puffer train nethack
Add NetHack environment (native C vecenv)
[bugfix] continuous action precision mismatch
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.