Bug Description
When generating the final report in deep-research with a substantial research corpus, all research content is sent as context in a single request to the LLM (e.g., Gemini 2.5 Pro). However, the free tier has a strict token limit (such as 250k TPM for Gemini 2.5 Pro), which frequently leads to failures or empty reports when the content is too lengthy.
Environment:
- Deployment Method: Docker
- OS: macOS
Steps to Reproduce
Steps to Reproduce:
- Proceed through the workflow to the final report generation step.
- Ensure the research content is sufficiently large.
- Attempt to generate the report and observe the result—typically an empty report or request failure.
Expected Behavior
Ideally, the system would either warn the user when the context exceeds the token limit, or automatically split the report generation into several API calls to accommodate the restriction.
It would be very helpful if future versions could support a warning when the context is about to exceed the token limit, or automatically divide the report into multiple API requests.
Screenshots
No response
Deployment Method
Desktop OS
macos
Desktop Browser
No response
Desktop Browser Version
No response
Smartphone Device
No response
Smartphone OS
No response
Smartphone Browser
No response
Smartphone Browser Version
No response
Additional Logs
No response
Bug Description
When generating the final report in deep-research with a substantial research corpus, all research content is sent as context in a single request to the LLM (e.g., Gemini 2.5 Pro). However, the free tier has a strict token limit (such as 250k TPM for Gemini 2.5 Pro), which frequently leads to failures or empty reports when the content is too lengthy.
Environment:
Steps to Reproduce
Steps to Reproduce:
Expected Behavior
Ideally, the system would either warn the user when the context exceeds the token limit, or automatically split the report generation into several API calls to accommodate the restriction.
It would be very helpful if future versions could support a warning when the context is about to exceed the token limit, or automatically divide the report into multiple API requests.
Screenshots
No response
Deployment Method
Desktop OS
macos
Desktop Browser
No response
Desktop Browser Version
No response
Smartphone Device
No response
Smartphone OS
No response
Smartphone Browser
No response
Smartphone Browser Version
No response
Additional Logs
No response