Checklist
Describe the Question
Hello - I'm using CloudSDK + Langgraph. I have a node in my agent that calls a tool hosted by a remote MCP server. The tool is called repeatedly as part of the agent loop and is generating a lot of context, and hence my token count is very large.
- Does SDK support the ability to compress the context based on a trigger ?
- Secondly, we are having a system prompt that is a bit heavy. Does SDK prompt caching as shown below? If yes, please provide details on how to use it.
https://docs.langchain.com/oss/javascript/integrations/middleware/anthropic
Checklist
Describe the Question
Hello - I'm using CloudSDK + Langgraph. I have a node in my agent that calls a tool hosted by a remote MCP server. The tool is called repeatedly as part of the agent loop and is generating a lot of context, and hence my token count is very large.
https://docs.langchain.com/oss/javascript/integrations/middleware/anthropic