I'm a current graduate CIS student at the University of Pennsylvania. I am interested in distributed systems and software engineering
CIS 5050 FINAL PROJECT: PennCloud | Email me for report! Fall 2025
- Engineered a highly available distributed storage and messaging backend using C++ in a docker environment, implementing a custom multi-threaded Key-Value store (like bigtable) featuring a select-based event loop, pthread worker pool and gRPC interface.
- Architected a highly fault tolerant replication engine based on Paxos Consensus model to ensure strong consistency between server nodes in C++
- Designed a robust checkpointing and recovery logic to minimize downtime, prevent data loss, network partitions and ensure quickest recovery
- Technologies: C++, gRPC, TCP, Valgrind, bash, cMake
CIS 5450 FINAL PROJECT: Big Data Analytics Fall 2025
- Integrated and cleaned two large-scale datasets (~13GB) from Steam, PlayStation, and Xbox, managing complex many-to-many joins and nested data structures
- Implemented K-Fold Target Encoding for high-cardinality features and applied log transformations to address heavy skew in purchase data
- Developed a 15-layer Deep Neural Network (PyTorch) with Batch Normalization, alongside XGBoost and Random Forest models to predict game success.
- Conducted analysis (EDA) to identify market leaders, discovering that "Action" and "Indie" genres account for over 400,000 total purchases. Technologies: Python (Pandas, PyTorch, XGBoost, Scikit-Learn), SQL, and Matplotlib.

