Currently experiencing issues where the application fails to start when Elasticsearch is not immediately available during container startup. This creates problems in docker-compose environments where services start in parallel.
The ingestion module should have proper retry logic with exponential backoff when connecting to Elasticsearch. Right now if ES isn't ready, the whole service can crash or hang.
Proposed changes:
- Add connection retry decorator for ES operations
- Implement exponential backoff (start at 1s, max 30s)
- Add proper logging for connection attempts
- Make retry attempts configurable via environment variable
- Handle connection pool exhaustion gracefully
This would make the deployment more robust and prevent the startup race conditions we're seeing.
Currently experiencing issues where the application fails to start when Elasticsearch is not immediately available during container startup. This creates problems in docker-compose environments where services start in parallel.
The ingestion module should have proper retry logic with exponential backoff when connecting to Elasticsearch. Right now if ES isn't ready, the whole service can crash or hang.
Proposed changes:
This would make the deployment more robust and prevent the startup race conditions we're seeing.