Add multi-DB assistant with auto-monitor, analyse, session monitor, SQL tuning & snapshot compare by devin-ai-integration[bot] · Pull Request #15 · Cognition-Partner-Workshops/platform-engineering-shared-services

devin-ai-integration · 2026-04-04T14:43:55Z

Summary

Adds a Python tool under tools/pg-assistant/ that converts natural language questions into SQL queries using a local Ollama LLM and executes them against PostgreSQL or Oracle databases via a Streamlit web UI. Includes automated tablespace monitoring with auto-extend, fully programmatic performance analysis, live session/lock monitoring, an AI-powered SQL tuning advisor, and side-by-side snapshot comparison with Plotly visualizations.

Modules:

app.py — Streamlit web UI with sidebar for connection/profile management, tabbed interface (Query, Schema, Auto Monitor, Auto Analyse, Sessions & Locks, SQL Tuning Advisor, Compare Snapshots, History)
db_client.py — Abstract BaseDBClient with PostgreSQLClient (psycopg2) and OracleClient (oracledb thin mode) implementations, plus create_db_client() factory
llm_client.py — Ollama REST API client (/api/generate)
sql_generator.py — Prompt engineering with schema injection, dual-DB system prompts (PostgreSQL/Oracle SQL dialects), SQL extraction, keyword-based safety validation, retry logic
profile_manager.py — Save/load/delete connection profiles as JSON (~/.pg-assistant/profiles.json) with db_type and service_name fields
auto_monitor.py — TablespaceMonitor class: periodic tablespace usage checks (configurable interval, default 1hr), auto-extend Oracle datafiles up to 20 GB per file, PostgreSQL storage size reporting
auto_analyse.py — PerformanceAnalyser class: live V$/pg_stat_* collection, AWR snap-ID range analysis (Oracle), pgProfile sample-ID range analysis (PostgreSQL), latest pg_stat_statements snapshot, uploaded report file parsing (AWR HTML/text, CSV, pgProfile). Analysis is 100% programmatic — Python code extracts real findings from DB data with specific SQL IDs, table names, query text, and exact fix commands. No LLM involved in analysis (codellama was hallucinating generic advice). Includes best-practice checks for row contention, sequence caching, high elapsed time, full table scans, high execution count, temp usage, and 30+ other sections.
session_monitor.py — SessionMonitor class: active sessions, blocking lock tree, lock details, long-running queries, wait events, and kill/cancel session for both Oracle and PostgreSQL
sql_tuning_advisor.py — SQLTuningAdvisor class: EXPLAIN PLAN execution, per-table metadata collection (columns, indexes, statistics), and LLM-powered tuning recommendations with specific index/rewrite/maintenance suggestions
snapshot_compare.py — SnapshotComparator class: compare two AWR (Oracle) or pgProfile (PostgreSQL) snapshot ranges, compute delta metrics, generate Plotly bar/pie charts for visual comparison, and produce programmatic differential analysis (no LLM)
requirements.txt — requests, psycopg2-binary, oracledb, streamlit, pandas, plotly
README.md — Architecture, usage, installation docs

Key behaviors:

On connect, fetches schema metadata (information_schema for PG, ALL_TAB_COLUMNS for Oracle) and injects it into every LLM prompt
Blocks dangerous SQL keywords (DROP, DELETE, UPDATE, etc.) and enforces SELECT/WITH-only queries in the natural language path
Auto Monitor uses a separate internal code path for administrative DDL (ALTER TABLESPACE, ALTER DATABASE DATAFILE)
Retries SQL generation up to 3 times if validation fails; additionally, if a generated query fails at the DB level (e.g. ORA-00933), the error is automatically fed back to the LLM for a corrected re-generation attempt
Query results rendered as interactive DataFrames with CSV download
Connection profiles persist across sessions via JSON file with db_type field
Oracle column names are normalized to lowercase in OracleClient.execute_query() for consistent downstream access
All analysis queries include 500-character SQL text snippets and exclude system schemas/queries
Performance analysis is fully programmatic: _build_findings_report() covers 30+ data sections — all extracted from real DB data with specific SQL IDs, table names, query text, and exact fix commands

Updates since last revision

Restructured analysis output to enterprise DBA / Copilot-quality format:

auto_analyse.py: Complete rewrite of _build_findings_report() to produce structured, severity-grouped output:
- Executive Summary with health rating (CRITICAL / WARNING / ADVISORY / HEALTHY) and key metric headlines
- Database & Workload Overview table (cache hit, backends, commits, rollbacks, WAL, temp usage)
- Top Bottlenecks grouped by severity level (SEV-1 Critical, SEV-2 Important, SEV-3 Advisory) — each bottleneck includes specific SQL IDs, table names, exact metrics, and markdown tables
- Configuration Review from pg_settings / v$parameter with risk flags (e.g. statement_timeout=0, high max_connections)
- Risk Register table (Risk, Likelihood, Impact)
- Prioritised Action Plan with Priority 0 (Immediate), Priority 1 (Structural), Priority 2 (Performance Hygiene) groupings
New data collection queries added:
- PostgreSQL: pg_stat_wal (WAL volume, FPI, sync time — PG 14+), pg_total_relation_size with TOAST breakdown, idle-in-transaction sessions, pg_settings configuration parameters, pg_stat_replication lag
- Oracle: v$parameter configuration, v$session idle sessions (> 5 min)
New bottleneck detectors: rollback explosion, idle-in-transaction sessions, WAL pressure, replication lag, table bloat, checkpoint pressure, backend buffer writes, temp file spilling, high redo log switches
snapshot_compare.py: Removed dead LLM code (_format_comparison_text() and _get_llm_comparison() methods)
app.py: Updated all spinner text and button labels to remove "LLM" references from analysis paths (analysis is fully programmatic, LLM is only used for SQL generation and SQL Tuning Advisor)

Review & Testing Checklist for Human

Suggested test plan: Run streamlit run app.py with Ollama (codellama) running and both a reachable PostgreSQL and Oracle instance. Verify:

Connecting to PostgreSQL via sidebar form, saving and loading a profile
Connecting to Oracle via sidebar form (with service_name), saving and loading a profile
Asking a simple question in Query tab for each DB type (e.g. "show top 10 tables by row count")
Verify Oracle queries use ROWNUM syntax, not FETCH FIRST
Intentionally trigger a failing query and verify the auto-retry regenerates corrected SQL
Schema tab loads correctly for each DB type
A dangerous prompt is blocked ("delete all users")
Auto Monitor tab: run a one-time check for Oracle (verify tablespace data appears), start periodic monitoring
Auto Analyse — Live mode: collect data and run full analysis for each DB type; critically verify output has severity-grouped bottlenecks (SEV-1/2/3) with real SQL IDs, real table names, real metrics — not empty sections or generic placeholders
Auto Analyse — AWR Snap ID mode (Oracle): load snapshots, select a range, run analysis
Auto Analyse — pgProfile Snap ID mode (PostgreSQL): load samples, select a range, run analysis
Auto Analyse — Latest pg_stat_statements (PostgreSQL): verify extension check, run analysis
Auto Analyse — Upload mode: upload an AWR HTML report and a pg_stat_statements CSV, verify parsing and summary
Verify analysis output excludes system queries (no SYS/SYSTEM schema SQL, no SET/RESET/BEGIN queries)
Sessions & Locks tab: verify Active Sessions view shows data for each DB type, cycle through all five views
Sessions & Locks tab: verify Blocking Lock Tree and Lock Details render correctly (may need to simulate a blocking lock)
Sessions & Locks tab: test kill/cancel session on a disposable test session (verify Oracle SID/Serial# and PostgreSQL PID inputs work)
SQL Tuning Advisor tab: paste a simple SELECT, run with EXPLAIN only (PostgreSQL), verify plan output and LLM recommendations appear
SQL Tuning Advisor tab: paste a multi-table JOIN query, verify table metadata (columns, indexes, stats) is collected and shown in the expander
SQL Tuning Advisor tab: test with Oracle — verify EXPLAIN PLAN FOR + DBMS_XPLAN.DISPLAY path works
SQL Tuning Advisor tab: test EXPLAIN ANALYZE checkbox (PostgreSQL) — verify the query actually executes and shows actual vs estimated rows
Compare Snapshots tab (Oracle): select two AWR snapshot ranges, run comparison, verify delta table renders, Plotly charts display, and programmatic analysis references real SQL IDs
Compare Snapshots tab (PostgreSQL): select two pgProfile sample ranges, run comparison, verify same outputs
Compare Snapshots tab: verify charts show grouped bars with Snapshot A vs B and percentage deltas are calculated correctly
CSV download works on query results
Switching between PG and Oracle profiles without stale state
Verify the timeout slider works: set to 60s, trigger a large-schema query, confirm timeout behavior

Notes

Performance analysis is now fully programmatic — LLM is only used for SQL generation (Query tab) and SQL Tuning Advisor. Auto Analyse and Compare Snapshots tabs produce findings entirely from Python code analyzing real DB data.
Analysis output follows enterprise DBA assessment format: Executive Summary → Workload Overview → Severity-grouped Bottlenecks → Configuration Review → Risk Register → Prioritised Action Plan.
This tool auto-executes generated SQL without user confirmation. For production use, consider adding a confirmation step.
Bare module imports (from db_client import ...) require running from the tools/pg-assistant/ directory. Will break if invoked from elsewhere or installed as a package.
oracledb thin mode does not require Oracle Client installation but may not support all Oracle features (e.g. Advanced Queuing, Continuous Query Notification).
psycopg2 connection objects stored in Streamlit session_state may not survive all rerun edge cases; the is_connected property mitigates this with a health-check query but manual reconnection may occasionally be needed.
Default Ollama timeout is 300s. The first request after model load is typically the slowest; subsequent requests should be faster. Users can adjust via the sidebar slider.
Uploaded report files are truncated to 15,000 characters. Very large AWR or pgProfile reports will lose tail content — the most important sections (top SQL, wait events) are typically near the top, but verify critical data isn't being cut.
HTML report parsing uses simple regex-based tag stripping, not a full HTML parser. Complex AWR HTML reports with nested tables may lose formatting context.
The README.md architecture diagram does not yet reflect the Session Monitor, SQL Tuning Advisor, or Compare Snapshots modules.
SQL injection patterns in snap-ID queries use Python string .replace() / .format() for placeholders. Values originate from DB query results (not direct user input) but are passed through Streamlit selectbox → integer → string interpolation. Similarly, session_monitor.py Oracle kill session uses .format(sid=sid, serial=serial) — values come from st.number_input (integer-constrained) but the module itself doesn't validate types.
Session monitor and auto-analyse require elevated privileges — Oracle: DBA/SELECT_CATALOG_ROLE; PostgreSQL: pg_monitor role or superuser. Users with limited grants will get permission errors.
Hardcoded analysis thresholds (cache hit < 95%, rollback rate > 10% = SEV-1, elapsed > 5s per exec, exec count > 1000, seq scans > 100 on tables > 10k rows, etc.) are reasonable defaults but may need tuning for specific environments.
LLMClient is still imported and passed to PerformanceAnalyser and SnapshotComparator constructors for API compatibility, even though neither class uses it for analysis anymore. Minor tech debt — could be cleaned up in a follow-up.

Link to Devin session: https://partner-workshops.devinenterprise.com/sessions/75db244b07ca4a3db4c6563dafd2cafc

- app.py: Main CLI loop with rich terminal output, argument parsing - llm_client.py: Ollama API client for LLM communication - mcp_client.py: MCP PostgreSQL server client for query execution - sql_generator.py: Prompt engineering, SQL extraction, and safety validation - requirements.txt: Python dependencies (requests, rich) - README.md: Architecture docs, usage examples, installation instructions Features: - Natural language to SQL via Ollama (codellama model) - Schema-aware prompt engineering - SQL safety enforcement (SELECT-only, blocks dangerous keywords) - Retry logic for failed SQL generation - Rich formatted output with timing metrics - Interactive CLI commands (help, schema, clear, exit)

devin-ai-integration · 2026-04-04T14:43:58Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

…ofiles - Replace CLI (app.py) with Streamlit web UI - Replace MCP client with direct PostgreSQL connection via psycopg2 (db_client.py) - Add connection profile manager for save/load DB configs (profile_manager.py) - Update requirements.txt with streamlit, psycopg2-binary, pandas - Update README with new architecture and usage docs - Keep llm_client.py and sql_generator.py unchanged

- Refactor db_client.py with abstract BaseDBClient, PostgreSQLClient, OracleClient - Add oracledb driver support (thin mode, no Oracle Client needed) - Add db_type dropdown in profile manager and connection sidebar - Add auto_monitor.py: periodic tablespace monitoring, auto-extend datafiles (max 20GB/file) - Add auto_analyse.py: AWR/pg_stat_statements analysis with LLM summary + action plan - Update sql_generator.py for dual-DB SQL dialects - Update Streamlit UI with Auto Monitor and Auto Analyse tabs - Update requirements.txt with oracledb dependency - Update README.md with new architecture and features

…n UI - Default timeout increased from 120s to 300s (first model load is slow) - Added timeout slider (60-600s) in Ollama Settings sidebar - Improved timeout error message with troubleshooting hint

- Update Oracle system prompt to use ROWNUM instead of FETCH FIRST/OFFSET (compatible with Oracle 11g+, fixes ORA-00933) - Increase MAX_RETRIES from 2 to 3 for SQL generation - Add auto-retry in Query tab: when a query fails with a DB error, the error is fed back to the LLM to regenerate corrected SQL automatically - Explicit Oracle syntax guidance: NVL, DUAL, TO_DATE, subquery for ORDER BY + ROWNUM

- Oracle: AWR snap ID range selector (queries DBA_HIST_SNAPSHOT, collects DBA_HIST_SQLSTAT/SYSTEM_EVENT/SYSSTAT for selected range) - PostgreSQL: pgProfile sample ID range selector (queries profile.samples, collects profile.stmt_list/wait_sampling_total for selected range) - PostgreSQL: latest pg_stat_statements one-click analysis with extension check - Both: file upload for AWR HTML/text, pg_stat_statements CSV, pgProfile reports - Auto Analyse tab now has radio button mode selector per DB type - Parsed report text shown in expander when no raw data available

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 8 additional findings.

Oracle's oracledb driver returns column names in UPPERCASE by default. Normalize to lowercase in OracleClient.execute_query() so all downstream code (AWR snap selector, auto_analyse, etc.) can use lowercase keys consistently.

- Oracle: collect top CPU SQL (v$sql by cpu_time), full table scans (v$sql_plan TABLE ACCESS FULL), existing indexes (all_indexes + all_ind_columns with LISTAGG), stale stats (all_tab_statistics), and execution plans (v$sql_plan detail for top 5 sql_ids) - PostgreSQL: collect top CPU queries (pg_stat_statements with blk_read_time/temp_blks), seq scan tables (pg_stat_user_tables with avg rows per scan), existing indexes (pg_indexes with DDL), stale stats/vacuum (dead tuples, last_analyze), lock waits (pg_stat_activity) - Rewrote LLM system prompt to require SQL-ID-specific analysis: high-CPU SQL with exact sql_id/queryid, full table scan tables with causing sql_id, missing index CREATE statements referencing the queryid that benefits, stale stats with ANALYZE/DBMS_STATS commands, unused index DROP statements, and numbered action plan with exact SQL commands and expected improvement

Session/Lock Monitor (session_monitor.py): - Active sessions view (v$session / pg_stat_activity) - Blocking lock tree with recursive hierarchy (CONNECT BY for Oracle, recursive CTE for PostgreSQL) - Lock details (v$lock / pg_locks with object names) - Long-running queries (>5s threshold) - Wait event chains - Kill/cancel session UI (ALTER SYSTEM KILL SESSION for Oracle, pg_cancel_backend/pg_terminate_backend for PostgreSQL) SQL Tuning Advisor (sql_tuning_advisor.py): - Paste any SQL, runs EXPLAIN PLAN (Oracle) or EXPLAIN (PostgreSQL) - Extracts tables from plan, collects per-table metadata: column stats, existing indexes, table stats, clustering factor - PostgreSQL: optional EXPLAIN ANALYZE with actual execution stats - LLM prompt requires step-by-step plan analysis, root cause, specific CREATE INDEX statements, SQL rewrite suggestions, stats maintenance commands, and numbered action plan Updated app.py with two new tabs in the UI.

…ysis, exclude system queries, 500-char SQL text

…ns list not dict

…ueries (PG 17+ compat)

…olders, add data-grounding instructions

…ead of system prompt (codellama is a completion model, not instruction-following)

… Python code now identifies all issues (high elapsed SQL, full table scans, sequence caching, stale stats, unused indexes, etc.) with real sql_ids, table names, and query text - LLM only provides a brief supplementary summary of pre-identified findings - Same hybrid approach applied to snapshot comparison

…LLM summary - Add top_cpu_queries/top_cpu_sql section (most important - always shows top SQL) - Add top_queries/top_elapsed_sql section (deduped from CPU section) - Add database_stats overview (cache hit ratio, connections, temp usage) - Add connection_stats section (idle connection detection) - Add Oracle system_stats with cache hit ratio, hard parse ratio, disk sorts - Add Oracle SGA configuration, tablespace I/O, redo log switches, temp usage - Add Oracle execution plans display with full scan/hash join detection - Add Oracle parallel queries section - Add pgProfile wait events section - Add table_stats (top tables by activity) section - Add AWR/pgProfile fallback for top SQL sections - Remove LLM summary entirely (codellama keeps hallucinating generic advice) - Update app.py labels: 'Performance Analysis Report' instead of 'AI Analysis' - All analysis is now 100% programmatic from real DB data

…, config review, prioritised actions

…le, AWR, pg_stat_statements)

devin-ai-integration Bot changed the title ~~Add AI-powered PostgreSQL assistant CLI tool (pg-assistant)~~ Add AI-powered PostgreSQL assistant with Streamlit UI (pg-assistant) Apr 4, 2026

devin-ai-integration Bot added 2 commits April 4, 2026 17:07

Remove unused mcp_client.py (replaced by db_client.py)

ac1a5e7

devin-ai-integration Bot changed the title ~~Add AI-powered PostgreSQL assistant with Streamlit UI (pg-assistant)~~ Add multi-DB assistant (PostgreSQL + Oracle) with auto-monitor and auto-analyse Apr 4, 2026

devin-ai-integration Bot added 3 commits April 5, 2026 02:55

Increase Ollama timeout to 300s and add configurable timeout slider i…

94e3973

…n UI - Default timeout increased from 120s to 300s (first model load is slow) - Added timeout slider (60-600s) in Ollama Settings sidebar - Improved timeout error message with troubleshooting hint

devin-ai-integration Bot commented Apr 6, 2026

View reviewed changes

devin-ai-integration Bot added 3 commits April 6, 2026 07:31

devin-ai-integration Bot changed the title ~~Add multi-DB assistant (PostgreSQL + Oracle) with auto-monitor and auto-analyse~~ Add multi-DB assistant with auto-monitor, auto-analyse, session monitor & SQL tuning Apr 6, 2026

Add Compare Snapshots with Plotly charts, enhanced best-practice anal…

47ecc13

…ysis, exclude system queries, 500-char SQL text

devin-ai-integration Bot changed the title ~~Add multi-DB assistant with auto-monitor, auto-analyse, session monitor & SQL tuning~~ Add multi-DB assistant with auto-monitor, analyse, session monitor, SQL tuning & snapshot compare Apr 6, 2026

devin-ai-integration Bot added 8 commits April 6, 2026 09:20

Fix AttributeError in Compare Snapshots tab: list_awr_snapshots retur…

bcd80b1

…ns list not dict

Detect PostgreSQL version and use version-aware bgwriter/checkpoint q…

af33818

…ueries (PG 17+ compat)

Fix LLM hallucination: simplify system prompts, remove example placeh…

7402149

…olders, add data-grounding instructions

Fix LLM hallucination v2: move instructions AFTER data in prompt inst…

fe6674f

…ead of system prompt (codellama is a completion model, not instruction-following)

Copilot-quality analysis: severity-grouped bottlenecks, risk register…

f74637a

…, config review, prioritised actions

Add structured HTML/CSV parsers for uploaded report analysis (pgProfi…

f43ffa4

…le, AWR, pg_stat_statements)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add multi-DB assistant with auto-monitor, analyse, session monitor, SQL tuning & snapshot compare#15

Add multi-DB assistant with auto-monitor, analyse, session monitor, SQL tuning & snapshot compare#15
devin-ai-integration[bot] wants to merge 19 commits into
mainfrom
devin/1775313621-pg-assistant-cli

devin-ai-integration Bot commented Apr 4, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot commented Apr 4, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Uh oh!

Conversation

devin-ai-integration Bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Updates since last revision

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration Bot commented Apr 4, 2026

🤖 Devin AI Engineer

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

devin-ai-integration Bot commented Apr 4, 2026 •

edited

Loading