Add retry mechanism and error handling for Elasticsearch connection failures

Currently experiencing issues where the application fails to start when Elasticsearch is not immediately available during container startup. This creates problems in docker-compose environments where services start in parallel.

The ingestion module should have proper retry logic with exponential backoff when connecting to Elasticsearch. Right now if ES isn't ready, the whole service can crash or hang.

Proposed changes:
- Add connection retry decorator for ES operations
- Implement exponential backoff (start at 1s, max 30s)
- Add proper logging for connection attempts
- Make retry attempts configurable via environment variable
- Handle connection pool exhaustion gracefully

This would make the deployment more robust and prevent the startup race conditions we're seeing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add retry mechanism and error handling for Elasticsearch connection failures #545

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add retry mechanism and error handling for Elasticsearch connection failures #545

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions