With more than 1.5 years of experience working as a Software Engineer, I was part of a Data Migration and Analytics project. My expertise include SQL queries, Python, Spark, GCP, AWS cloud and Manual Testing. I'm also constantly willing to learn new technologies. Currently a Computer Science Graduate Student, I aspire to become skilled Data Engineer.
I am passionate about exploring new destinations and experiencing different cultures through travel. From hiking in the mountains to strolling through bustling city streets, I love immersing myself in diverse environments and discovering hidden gems around the world.
As a dedicated and motivated individual, I am committed to making meaningful contributions to the field of computer science and leveraging my skills and knowledge to drive positive change
Serving as the Team Lead and Publicity Head at SSVM, a nonprofit sports institute in India, I successfully organized and trained 150 coaches in various physical activities. Additionally, I led two sports camps with over 1800 participants. I also volunteered for clerical tasks such as filing, creating presentations, data entry, and article writing. Fun Fact: I was a coach in the Mallakhamb Training Team which conducted two sports camps in Munich in 2010.
In this project, real time data streaming is simulated using python to understand how data is sent from producer to consumer. The purpose of this project is to understand working of kafka and try hands on exercises.
Tools Used : Python, Kafka, AWS S3, Glue, Athena
Techniques : Data Cleaning, Kafka Concepts, ETL
Check it outBeginner ETL project to understand and explore various AWS technologies
AWS Services and languages used: AWS S3, Crawlers, Glue, Athena, Redshift, Python, SQL
Check it outThis project leverages data analysis and visualization techniques to gain insights into crime trends in New York City. By analyzing crime data from 2010 to 2021, we aim to inform data-driven decision-making processes for enhancing public safety measures.
Tools Used : Python, Pandas, matplotlib, seaborn
Techniques : Exploratory Data Analysis, Statistical Analysis, Visualizations
Check it outThis project leverages data analysis and visualization techniques to gain insights into taxi records for the year 2020 in New York City.
Tools Used : Python, Pandas, TensorFlow Data Valiadation, matplotlib, seaborn
Techniques : Exploratory Data Analysis, Statistical Analysis, Visualizations
Check it out• Confronted with the need for robust data migration and quality assurance processes amidst a large-scale platform transition.
• Designed and developed 15 table structures on Google Cloud Platform, facilitating seamless data migration from Hadoop to BigQuery. Collaborated with a cross-functional team to conduct comprehensive manual testing and data quality checks.
• Provided Production support by monitoring ingestion logs. Ensured data accuracy and completeness for over 70 supply chain and sourcing module objects, enhancing operational efficiency and decision-making capabilities.
• Faced with complex data extraction and integration requirements across diverse data sources.
• Assisted in the development and integration of data pipelines extracting data from Oracle, SQL server, and MS Excel, transforming data as per specifications, and loading it into appropriate storage systems.
• Supported data consistency and integrity efforts, ensuring seamless data flow and reliability for downstream analysis.
• Identified significant data inaccuracies impacting program effectiveness and decision-making processes.
• Implemented rigorous quality control measures, including comprehensive data validation checks, and resolving over 1,000 data errors monthly across databases and spreadsheets.
• Achieved a 20% increase in data accuracy, providing program staff and leadership with reliable data insights for informed decision-making.
• TA for Python-based Introduction to CS. Assist in developing and delivering instructional materials, including lectures, assignments, and coding exercises to a class of 25 high school students.
• Monitor and track student progress, identifying and addressing any recurring challenges or areas of difficulty.