Add a targeted HFTokenizer encode-latency benchmark by digantdesai · Pull Request #194 · meta-pytorch/tokenizers

digantdesai · 2026-06-12T05:16:08Z

Stack from ghstack (oldest at bottom):

Standalone Release-optimized benchmark for the HFTokenizer long-prompt encode
path. It repeats prose/code/dialogue templates to long inputs (~0.5k-8k tokens)
and prints mean encode latency. This is a targeted latency benchmark for this
code path, not a generic tokenizer harness; correctness is covered by the unit
tests and it exits nonzero if any encode errors.

Baseline latency on the original (pre-fix) code, Gemma-4-31B tokenizer, Release,
mean of 5 reps:

  vector          chars     ids     mean_ms
  prose_0.5k       2240     491       241.4
  code_1k          4514    1891       789.2
  dialogue_1.5k    6720    1345      2212.5
  prose_2k         8946    1450      3905.4
  prose_8k        49416    8005    123504.8

Latency grows with the square of input length (chars x5.5 from prose_2k to
prose_8k -> time x31.6 ~= 5.5^2); an 8k-token prompt takes ~123 s. Gemma's
normalizer turns spaces into the word marker before the space-splitter runs, so
the whole prompt is BPE-merged as a single piece.

Authored with assistance from Claude Code.

[ghstack-poisoned]

Update

e14575a

[ghstack-poisoned]

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 12, 2026

This was referenced Jun 12, 2026

Make HFWord::merge_all O(n log n) #195

Open

Make ReplaceNormalizer::normalize O(N) (single forward pass) #196

Open

mergennachin approved these changes Jun 12, 2026

View reviewed changes

digantdesai marked this pull request as ready for review June 15, 2026 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a targeted HFTokenizer encode-latency benchmark#194

Add a targeted HFTokenizer encode-latency benchmark#194
digantdesai wants to merge 1 commit into
gh/digantdesai/1/basefrom
gh/digantdesai/1/head

digantdesai commented Jun 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

digantdesai commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

digantdesai commented Jun 12, 2026 •

edited

Loading