MY PROJECTS
Real-Time Taxi Cab Analytics & Streaming Pipeline
Built a fully containerized, end-to-end data pipeline that ingests NYC Yellow Taxi trip data, executes spatial queries using SparkSQL, streams filtered records through Apache Kafka, and loads them into a Neo4j graph database for real-time analytics. The solution supports both batch and streaming modes, enabling large-scale spatial processing and live data ingestion. This project highlights proficiency in Big Data frameworks, real-time stream processing, Kubernetes orchestration, and graph-based analytics.


Technologies & Tools:
Scala · Apache Spark · SparkSQL · Docker · sbt · Apache Kafka · ZooKeeper · Kafka Connect · PyArrow · Kubernetes (Minikube) · Helm · Neo4j · Cypher · Python · bash scripting
Resources:
Financial Services Cloud Implementation
To modernize wealth management at Citi Bank, this Salesforce Financial Services Cloud solution centralized fragmented client data, automated compliance workflows, and reduced manual advisor tasks by 30%. By developing custom Apex triggers and Lightning components, the system enhanced operational efficiency and strengthened collaboration between financial advisors and relationship managers, enabling smarter, faster, and more connected day-to-day service delivery.


Service Cloud Case Management Enhancement
Developed for Prudential Insurance, this enhanced case management system in Salesforce Service Cloud reimagined the claims journey by streamlining operations and reducing average resolution times by 25%. Through custom objects, Omni-Channel routing, and automated approval flows, the solution enabled real-time updates and smarter queue handling. Unified dashboards and proactive messaging ensured seamless departmental handoffs, significantly boosting operational efficiency and policyholder satisfaction.


Geo-Cane: Empowering Zimbabwe's Visually Impaired
Geo-Cane is a smart mobility solution equipped with GPS navigation, obstacle detection, and haptic feedback, designed to support 1.3 million visually impaired individuals in Zimbabwe. Developed collaboratively with a global team, the project combines hardware innovation with human-centered design to enhance independence, accessibility, and safe navigation in daily movement.


Resources:
Expedia Strategy Outlook
This initiative analyzed personalized travel trends and emerging market opportunities to support Expedia’s global expansion. By focusing on high-growth regions like Mauritius, we developed data-driven market entry plans and targeted discount strategies that aligned with shifting traveler behavior and long-term growth objectives.


Resources:
Healthcare Data Mining
To enable predictive modeling in healthcare, this project involved scraping, cleaning, and processing data on 1,100+ diseases using Python. By developing robust ETL pipelines and enhancing data quality, the work laid the groundwork for accurate machine learning applications in healthcare analytics.


Resources:
Predicting Traffic Collision Occurrence & Severity
Using logistic regression, XGBoost, and Random Forest, this project forecasted traffic collision risks and severity levels based on urban road data. Optimized feature selection improved prediction accuracy, helping inform public safety strategies and support data-driven policymaking.

