Smarter, faster, and more intuitive analysis of your datasets using Generative AI + Python magic.
Repository: Insert your GitHub repo link here
Meet Data Analyst Agent 2.0 — an AI-driven assistant that eliminates tedious data crunching.
Upload your dataset + queries, and instantly get:
✅ Visual reports
✅ AI-generated insights
✅ Automated workflows
Perfect for:
- Analysts 🧾
- Researchers 🔬
- Startups & Businesses 📈
- Anyone who loves turning raw data into knowledge
| Feature | Why It’s Awesome 🚀 |
|---|---|
| 🤖 AI-Powered Insights | Uses Google’s Generative AI to “understand” your data |
| 📊 Rich Visualizations | Generates plots with Seaborn & Matplotlib |
| 🌍 Web Scraper Mode | Fetch live data directly from URLs |
| 📂 Multi-Format Friendly | Accepts CSV, Excel, JSON, Parquet, or TXT |
| 🔄 Ask Many at Once | Batch processing for multiple questions |
| 🖥️ Simple-to-Use Interface | Beginner friendly, no steep learning curve |
| ⚡ Super-Fast Execution | Optimized for speed + real-time feedback |
1️⃣ Clone the Repo - git clone https://github.com/your-username/data-analyst-agent.git
cd data-analyst-agent
Create a .env file inside the root folder:
GEMINI_API_KEY=your_google_api_key
LLM_TIMEOUT_SECONDS=240
Now open http://localhost:8000/ in your browser 🌐
-
Write Your Questions
Create a.txtfile with queries like: What’s the revenue growth month-over-month?, Find correlation between Age and Income, Show most profitable products...etc -
Upload Dataset + Questions File
- Dataset (optional) → CSV, Excel, JSON, Parquet, or TXT
- Questions file (required) → Plain text
- Voilà!
- AI processes the queries
- Generates insights + summaries
- Builds neat visualizations
- FastAPI ⚡ → High-performance web server
- LangChain 🧠 → Orchestrates LLM interactions
- Google Generative AI ✨ → Core AI engine
- Pandas + NumPy 📊 → Data wrangling made smooth
- Seaborn + Matplotlib 🎨 → Clean, insightful charts
- HTML5 + CSS + JavaScript
- Bootstrap-inspired modern UI
| Method | Endpoint | Purpose |
|---|---|---|
GET |
/ |
Access web app |
POST |
/api |
Submit dataset + questions |
GET |
/summary |
App diagnostics & summaries |
| Format | Extensions |
|---|---|
| CSV | .csv |
| Excel | .xlsx, .xls |
| JSON | .json |
| Parquet | .parquet |
| Text | .txt |
- 📈 Business Strategy – Sales, KPIs, forecasts
- 🔬 Research – Data exploration, hypothesis validation
- 🤖 Data Science – Quick EDA, anomaly detection
- 📊 Reporting – Automated dashboards
- ✅ No cloud storage → All data stays local
- ✅ API keys kept safe via
.env - ✅ Configurable CORS policy for production use
Licensed under MIT – free for personal & commercial use.