Skip to content

Harsh-Belekar/Samsung-Data-Analysis

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

29 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏒 Samsumg Data Analysis

End-to-End Data Analytics Project | Python Β· PostgreSQL Β· Power BI

Python Pandas NumPy Matplotlib Seaborn Jupyter Tool Process Process Feature Feature Feature Feature Domain Type Status License


Project Folder Structure

Samsung Data Analysis/
    β”‚   
    β”œβ”€β”€ Data Warehouse/
    β”‚       β”œβ”€β”€ Data/
    β”‚       β”‚   └── Raw_data.zip
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Docs/
    β”‚       β”‚   β”œβ”€β”€ Data_Catalog.md
    β”‚       β”‚   └── Naming_Conventions.md
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Images/
    β”‚       β”‚   β”œβ”€β”€ Data_Warehouse_Architecture.png
    β”‚       β”‚   └── Schema.png
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Logs/
    β”‚       β”‚   β”œβ”€β”€ Init_database.log
    β”‚       β”‚   β”œβ”€β”€ DDL_Bronze.log
    β”‚       β”‚   β”œβ”€β”€ Load_Bronze.log
    β”‚       β”‚   β”œβ”€β”€ DDL_Silver.log
    β”‚       β”‚   β”œβ”€β”€ Helper_func.log
    β”‚       β”‚   β”œβ”€β”€ Load_Silver.log
    β”‚       β”‚   └── DDL_Gold.log
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Python Scripts/
    β”‚       β”‚   β”œβ”€β”€ Bronze/
    β”‚       β”‚   β”‚   β”œβ”€β”€ DDL_Bronze.py
    β”‚       β”‚   β”‚   └── Load_Bronze.py
    β”‚       β”‚   β”œβ”€β”€ Silver/
    β”‚       β”‚   β”‚   β”œβ”€β”€ DDL_Silver.py
    β”‚       β”‚   β”‚   β”œβ”€β”€ Helper_func.py
    β”‚       β”‚   β”‚   └── Load_Silver.py
    β”‚       β”‚   β”œβ”€β”€ Gold/
    β”‚       β”‚   β”‚   └── DDL_Gold.py
    β”‚       β”‚   β”œβ”€β”€ Init_database.py
    β”‚       β”‚   └── File_converter.py
    β”‚       β”‚
    β”‚       β”œβ”€β”€ SQL Scripts/
    β”‚       β”‚   β”œβ”€β”€ Bronze/
    β”‚       β”‚   β”‚   β”œβ”€β”€ DDL_Bronze.sql
    β”‚       β”‚   β”‚   └── Proc_Load_Bronze.sql
    β”‚       β”‚   β”œβ”€β”€ Silver/
    β”‚       β”‚   β”‚   β”œβ”€β”€ DDL_Silver.sql
    β”‚       β”‚   β”‚   β”œβ”€β”€ Helper_function.sql
    β”‚       β”‚   β”‚   └── Proc_Load_Silver.sql
    β”‚       β”‚   β”œβ”€β”€ Gold/
    β”‚       β”‚   β”‚   └── DDL_Gold.sql
    β”‚       β”‚   └── Init_database.sql
    β”‚       β”‚
    β”‚       └── Data_Warehouse.md
    β”‚   
    β”‚   
    β”œβ”€β”€ Data Analysis/
    β”‚       β”œβ”€β”€ Dashboard/
    β”‚       β”‚   β”œβ”€β”€ Samsung_Data_Analysis_Dashboard.pdf
    β”‚       β”‚   β”œβ”€β”€ Samsung_Data_Analysis.pbix
    β”‚       β”‚   └── Images/
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Data/
    β”‚       β”‚   └── Gold_Schema.zip
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Docs/
    β”‚       β”‚   β”œβ”€β”€ Business_Problem.md
    β”‚       β”‚   └── Business_Problem.pdf
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Logs/
    β”‚       β”‚   β”œβ”€β”€ Create_tables.log
    β”‚       β”‚   └── Insert_data.log
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Notebooks/
    β”‚       β”‚   β”œβ”€β”€ Exploratory_Data_Analysis.ipynb
    β”‚       β”‚   └── Samsung_Data_Analysis.ipynb
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Python Scripts/
    β”‚       β”‚   β”œβ”€β”€ Create_tables.py
    β”‚       β”‚   └── Insert_data.py
    β”‚       β”‚
    β”‚       β”œβ”€β”€ SQL Scripts/
    β”‚       β”‚   β”œβ”€β”€ Create_tables.sql
    β”‚       β”‚   └── Insert_data.sql
    β”‚       β”‚
    β”‚       β”œβ”€β”€ Visuals/
    β”‚       β”‚
    β”‚       └── Data Analysis.md
    β”‚   
    β”œβ”€β”€ requirements.txt
    β”œβ”€β”€ LICENSE
    └── README.md

⚠️ Dataset Disclaimer

All datasets used in this project are dummy, synthetic, or public β€” generated programmatically using Python for learning and portfolio demonstration purposes only.

No real customer data, restaurant data, or proprietary Samsung information has been used. This project is not affiliated with, endorsed by, or connected to Samsung in any way.


πŸ‘©β€πŸ’» Author

πŸ‘€ Tanvi Jadhav
πŸ“ Data Analyst | SQL | Power BI | Python | Excel | Data Visualization
πŸ“¬ LinkedIn | πŸ”—GitHub | πŸ“§ tanvijadhav360@gmail.com


πŸ§‘β€πŸ’» Contributor

πŸ‘€ Harsh Belekar
πŸ“ Data Analyst | Python Developer | SQL | Power BI | Excel | Data Visualization
πŸ“¬ LinkedIn | πŸ”—GitHub

πŸ“§ harshbelekar74@gmail.com


⭐ If you found this project helpful, feel free to star the repo and connect with me for collaboration!

About

🏒 End-to-End Data Analysis project for Samsung | Synthetic dataset of 2M+ rows Β· 14 tables Β· 6 business domains | Python β€’ PostgreSQL β€’ Power BI | Solving 7 Business Challenges.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 60.7%
  • PLpgSQL 39.3%