Add runtime warnings when max_rpm and max_tokens are unset by camgrimsec · Pull Request #6371 · crewAIInc/crewAI

camgrimsec · 2026-06-27T22:18:34Z

This PR introduces informative warnings to alert developers when they are running with unbounded resource controls, as identified in a recent security audit.

max_rpm=None warning — In Crew, a new @model_validator(mode="after") hook logs a warning when max_rpm is None, making operators aware that client-side rate limiting is disabled and upstream API provider limits are the only safeguard against rapid-fire requests.
max_tokens=None warning — In LLM, a new @model_validator(mode="after") hook logs a warning when both max_tokens and max_completion_tokens are None, surfacing that LLM responses are uncapped and may lead to very large token consumption (e.g. 128k+ output) on modern models.

Why this is a safe, high-merge-likelihood PR

Zero functional changes — existing workflows continue exactly as before.
No breaking changes — no validations, no raised errors, no defaults altered.
Low code footprint — two short @model_validator hooks plus an import logging line.
High impact — prevents silent "denial-of-wallet" incidents by surfacing the issue at runtime, exactly when it matters.

Both warnings use logging.warning (consistent with the existing logging patterns in the codebase, including llm.py's existing import logging) and provide actionable advice to set explicit values.

Implementation notes

The original audit recommendation suggested overriding __init__, but both Crew and LLM are Pydantic models that already use @model_validator(mode="after") hooks (~13 of them in Crew) for post-construction setup. Overriding __init__ would conflict with Pydantic's validation machinery; using @model_validator(mode="after") is the idiomatic Pydantic v2 pattern and matches the surrounding code.

Changes

lib/crewai/src/crewai/crew.py

Adds import logging.
Refines the max_rpm field description to call out the implication of None.
Adds _warn_rate_limit_disabled @model_validator(mode="after").

lib/crewai/src/crewai/llm.py

logging is already imported.
Adds _warn_tokens_uncapped @model_validator(mode="after").

Stats

2 files changed, +34 / −2.
ast.parse clean on both files.

Summary by CodeRabbit

Bug Fixes
- Added runtime warnings when rate limiting is disabled, helping users catch potentially unsafe configuration.
- Added a warning when token limits are not set, making uncapped responses more visible.
Documentation
- Clarified that disabling rate limiting is possible, but not recommended for production use.

Surfaces silent denial-of-wallet risks by logging warnings at construction time when resource controls are left unbounded: * Crew.max_rpm=None: warn that client-side rate limiting is disabled and the only safeguard is the upstream provider's rate limit. * LLM.max_tokens AND max_completion_tokens both None: warn that responses are uncapped and can produce very large outputs (e.g. 128k+ tokens) on modern models. Both warnings are emitted via @model_validator(mode='after') hooks using logging.warning, consistent with the existing logging patterns. No behavior change, no breaking changes, no new defaults, no raised errors. Also clarifies the max_rpm field description.

coderabbitai · 2026-06-27T22:18:57Z

📝 Walkthrough

Walkthrough

Adds logging import to crew.py and introduces two Pydantic model_validator(mode="after") methods: Crew._warn_rate_limit_disabled warns when max_rpm is None, and LLM._warn_tokens_uncapped warns when neither max_tokens nor max_completion_tokens is set. The max_rpm field description is also updated.

Changes

Runtime Configuration Warnings

Layer / File(s)	Summary
Crew max_rpm warning `lib/crewai/src/crewai/crew.py`	Adds `import logging`, updates `max_rpm` field description to note that `None` disables rate limiting, and adds `_warn_rate_limit_disabled` model validator that logs a warning when `max_rpm is None`.
LLM token cap warning `lib/crewai/src/crewai/llm.py`	Adds `_warn_tokens_uncapped` model validator that logs a warning when both `max_tokens` and `max_completion_tokens` are unset.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 66.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly matches the main change: adding runtime warnings for unset max_rpm and token limits.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@lib/crewai/src/crewai/crew.py`:
- Around line 300-301: `max_rpm=0` is still treated as “disabled” in the `Crew`
RPM flow, but the new docs/validation only mention `None`. Update the
`Crew`-related `max_rpm` handling so zero is covered the same way as `None` in
the truthiness checks that control controller hookup and cleanup, and align the
warning/description accordingly. Make sure the validator and any explanatory
text near `Crew`’s `max_rpm` parameter reflect both uncapped values, not just
`None`.

In `@lib/crewai/src/crewai/llm.py`:
- Around line 686-698: The uncapped-token warning in LLM._warn_tokens_uncapped
only checks for None, but runtime logic treats 0 the same as no cap via the
max_tokens/max_completion_tokens fallback. Update the validator and any related
cap-handling in LLM so zero-valued caps are normalized or rejected consistently,
and make the warning trigger for both None and 0 when the effective request will
be uncapped.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 99be48f0-e0d5-4407-af2f-be890e29ad76

📥 Commits

Reviewing files that changed from the base of the PR and between 6491f5a and b9b8342.

📒 Files selected for processing (2)

lib/crewai/src/crewai/crew.py
lib/crewai/src/crewai/llm.py

coderabbitai · 2026-06-27T22:21:26Z

+            "Maximum number of requests per minute for the crew execution. "
+            "Set to None to disable rate limiting (not recommended for production)."


🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

max_rpm=0 still disables throttling without triggering this warning.

Line 767 and Line 2102 both gate RPM behavior on truthiness, so max_rpm=0 skips controller hookup/cleanup just like None. As written, the new description and validator only cover None, so this PR still misses one uncapped configuration path.

Also applies to: 617-630

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@lib/crewai/src/crewai/crew.py` around lines 300 - 301, `max_rpm=0` is still treated as “disabled” in the `Crew` RPM flow, but the new docs/validation only mention `None`. Update the `Crew`-related `max_rpm` handling so zero is covered the same way as `None` in the truthiness checks that control controller hookup and cleanup, and align the warning/description accordingly. Make sure the validator and any explanatory text near `Crew`’s `max_rpm` parameter reflect both uncapped values, not just `None`.

coderabbitai · 2026-06-27T22:21:26Z

+    @model_validator(mode="after")
+    def _warn_tokens_uncapped(self) -> LLM:
+        """Warn when neither ``max_tokens`` nor ``max_completion_tokens`` is set.
+
+        Without an explicit cap, a single prompt can produce very large outputs
+        (e.g. 128k+ tokens on modern models), leading to runaway spend. This
+        warning surfaces the risk at runtime without changing behavior.
+        """
+        if self.max_tokens is None and self.max_completion_tokens is None:
+            logging.warning(
+                "max_tokens/max_completion_tokens not set; LLM responses are uncapped. "
+                "Set a limit (e.g., max_tokens=4096) to control token costs and avoid runaway generation."
+            )


🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

Zero-valued caps bypass this warning but are still treated as uncapped.

Line 755 and Line 2168 both collapse 0 to “no cap” via self.max_tokens or self.max_completion_tokens. That means LLM(max_tokens=0) still sends an uncapped request without hitting this validator, so the warning does not cover the actual runtime behavior.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@lib/crewai/src/crewai/llm.py` around lines 686 - 698, The uncapped-token warning in LLM._warn_tokens_uncapped only checks for None, but runtime logic treats 0 the same as no cap via the max_tokens/max_completion_tokens fallback. Update the validator and any related cap-handling in LLM so zero-valued caps are normalized or rejected consistently, and make the warning trigger for both None and 0 when the effective request will be uncapped.

coderabbitai Bot reviewed Jun 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add runtime warnings when max_rpm and max_tokens are unset#6371

Add runtime warnings when max_rpm and max_tokens are unset#6371
camgrimsec wants to merge 1 commit into
crewAIInc:mainfrom
camgrimsec:feat/runtime-warnings-unbounded

camgrimsec commented Jun 27, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Jun 27, 2026 •

edited

Loading

Walkthrough

Changes

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Jun 27, 2026

Uh oh!

coderabbitai Bot Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		"Maximum number of requests per minute for the crew execution. "
		"Set to None to disable rate limiting (not recommended for production)."

Uh oh!

Conversation

camgrimsec commented Jun 27, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this is a safe, high-merge-likelihood PR

Implementation notes

Changes

Stats

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

camgrimsec commented Jun 27, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 27, 2026 •

edited

Loading