Mood Orchestrator Agent 🎵

A sophisticated AI agent built with Google's Agent Development Kit (ADK) that creates personalized music experiences based on your emotional state. The agent captures your voice, analyzes your mood, generates custom music using Suno AI, and synchronizes smart bulbs with the music's wavelengths.

Features

🎤 Voice Input: Captures audio from your microphone
🧠 Mood Detection: Uses Gemini's multimodal AI to analyze sentiment from speech
🎵 Music Generation: Creates custom music matching your mood via Suno AI
🎧 Audio Playback: Plays generated music automatically
🌈 Smart Lighting: Syncs bulb colors with music frequencies
🤖 Autonomous Orchestration: Agent handles the entire workflow intelligently

Architecture

The system uses Google ADK's agent-based architecture with custom tools:

┌─────────────────────────────────────────────────┐
│         Mood Orchestrator Agent                 │
│         (Gemini 2.0 Flash)                      │
│                                                 │
│  Orchestrates the entire workflow:              │
│  1. Capture voice input from microphone         │
│  2. Transcribe and analyze mood from speech     │
│  3. Generate music based on detected mood       │
│  4. Play music and analyze audio                │
│  5. Control bulbs based on wavelengths          │
└─────────────────────────────────────────────────┘
                      │
                      ▼
        ┌─────────────────────────────┐
        │     Custom Tools            │
        ├─────────────────────────────┤
        │ • record_voice_input        │
        │ • transcribe_and_analyze    │
        │ • generate_music            │
        │ • play_music                │
        │ • analyze_audio_wavelengths │
        │ • control_bulb_colors       │
        └─────────────────────────────┘

Prerequisites

Python 3.10 or higher
Google Gemini API key (Get it here)
Suno AI API key (Get it here)
Microphone for voice input
(Optional) Smart bulb server running on localhost:3000

Installation

Clone or navigate to the project directory:
```
cd moodagent
```
Install dependencies (using uv):
```
uv sync
```
Set up environment variables:
```
cp .env.example .env
```
Edit .env and add your API keys:
```
GEMINI_API_KEY=your_gemini_api_key_here
SUNO_API_KEY=your_suno_api_key_here
BULB_SERVER_URL=http://localhost:3000
```
Getting API Keys:
- Gemini API: Visit Google AI Studio to get your free API key
- Suno AI API: Visit SunoAPI.org to obtain your API key

Usage

Run the agent:

uv run python main.py

Example Session

🎵  MOOD ORCHESTRATOR AGENT  🎵
============================================================
Welcome! I'll capture your voice, detect your mood,
generate personalized music, and sync your smart lights!
============================================================

🎤 Ready to start! The agent will guide you through the process.

Type 'start' to begin, or 'quit' to exit.

You: start

🤖 Agent:
------------------------------------------------------------
I'll help you create a personalized music experience! Let me start by 
recording your voice to understand your mood.

🔧 Executing tool: record_voice_input
🎤 Recording for 5 seconds... Speak now!
✓ Recording complete!

🔧 Executing tool: transcribe_and_analyze_mood
📝 Transcription: I'm feeling really happy and energetic today!
😊 Detected Mood: energetic (confidence: 0.95)
💭 Reasoning: High vocal energy, upbeat tone, positive words

Great! I detected you're feeling energetic! Let me generate some 
upbeat music for you...

🔧 Executing tool: generate_music
🎵 Generating music for mood: energetic
⏳ Task created. Waiting for generation...
✓ Music generated successfully!

🔧 Executing tool: play_music
🎵 Now playing: generated_music_energetic.mp3

🔧 Executing tool: analyze_audio_wavelengths
🎼 Dominant frequencies: 523.3Hz, 1046.5Hz, 2093.0Hz

🔧 Executing tool: control_bulb_colors
💡 Setting bulb color: RGB(128, 64, 255) for 523.3Hz

Perfect! I've created energetic music for you and synced your lights 
with the music's rhythm. Enjoy! 🎵
------------------------------------------------------------

Tools Description

1. `record_voice_input`

Captures audio from microphone
Duration: 5 seconds (configurable)
Output: WAV file

2. `transcribe_and_analyze_mood`

Transcribes speech using Gemini
Analyzes mood from vocal tone and content
Classifies: happy, sad, calm, energetic, angry
Returns confidence score

3. `generate_music`

Uses Suno AI API (api.sunoapi.org)
Non-custom mode for simplicity
Maps mood to music style
Model: V4_5ALL (better song structure, max 8 min)
Polls for completion (up to 4 minutes)
Returns 2 songs per request (uses first one)
Downloads MP3 file
Stream URL available in 30-40 seconds
Full quality download in 2-3 minutes

4. `play_music`

Plays audio using pygame
Non-blocking playback

5. `analyze_audio_wavelengths`

FFT analysis of audio
Extracts dominant frequencies
Returns frequency spectrum

6. `control_bulb_colors`

Maps frequencies to RGB colors
Sends HTTP requests to bulb server
Graceful fallback if server unavailable

Configuration

Edit config.py to customize:

Audio settings: Sample rate, channels, recording duration
Mood mappings: Customize music styles for each mood
Bulb server: Change URL or endpoint
Suno API: Model version (V4_5ALL - better song structure, max 8 min)

Project Structure

moodagent/
├── main.py              # Main application with ADK runner
├── agent.py             # Agent definition and instructions
├── config.py            # Configuration and constants
├── tools/               # Custom ADK tools
│   ├── voice_tool.py    # Voice recording
│   ├── mood_tool.py     # Mood analysis
│   ├── music_tool.py    # Music generation
│   ├── audio_tool.py    # Playback and analysis
│   └── bulb_tool.py     # Smart bulb control
├── utils/               # Utility functions
│   └── helpers.py       # File operations
├── temp/                # Temporary files
├── pyproject.toml       # Dependencies
├── .env                 # Environment variables (create from .env.example)
└── README.md            # This file

Dependencies

google-adk: Agent Development Kit
google-genai: Gemini API for multimodal AI
requests: HTTP client for Suno API
sounddevice: Microphone recording
scipy: Audio file I/O
pygame: Audio playback
numpy: Audio analysis
python-dotenv: Environment variables
pydantic: Type validation

Troubleshooting

Microphone Issues

Ensure microphone permissions are granted
Check default audio device settings
Try adjusting AUDIO_SAMPLE_RATE in config.py

API Errors

Verify API keys are correct in .env
Check API quota/limits
Ensure internet connection

Bulb Server

The agent works without bulb server (graceful fallback)
Adjust endpoint in bulb_tool.py to match your server API

Future Enhancements

Real-time audio streaming for live mood detection
Multiple music generation options
Advanced audio-to-color algorithms
Support for multiple bulb systems
Web UI for easier interaction
Music library and history

License

MIT License

Credits

Built with:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
mood_orchestrator_agent		mood_orchestrator_agent
tools		tools
utils		utils
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
PROJECT_VISION.md		PROJECT_VISION.md
README.md		README.md
__init__.py		__init__.py
agent.py		agent.py
config.py		config.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mood Orchestrator Agent 🎵

Features

Architecture

Prerequisites

Installation

Usage

Example Session

Tools Description

1. `record_voice_input`

2. `transcribe_and_analyze_mood`

3. `generate_music`

4. `play_music`

5. `analyze_audio_wavelengths`

6. `control_bulb_colors`

Configuration

Project Structure

Dependencies

Troubleshooting

Microphone Issues

API Errors

Bulb Server

Future Enhancements

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mood Orchestrator Agent 🎵

Features

Architecture

Prerequisites

Installation

Usage

Example Session

Tools Description

1. record_voice_input

2. transcribe_and_analyze_mood

3. generate_music

4. play_music

5. analyze_audio_wavelengths

6. control_bulb_colors

Configuration

Project Structure

Dependencies

Troubleshooting

Microphone Issues

API Errors

Bulb Server

Future Enhancements

License

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. `record_voice_input`

2. `transcribe_and_analyze_mood`

3. `generate_music`

4. `play_music`

5. `analyze_audio_wavelengths`

6. `control_bulb_colors`

Packages