Highlights
- Pro
-
In this project, we utilize PySpark to perform data aggregation tasks on a TMDB dataset. We aim to create pre-aggregated tables for genres and identify the most popular film in each original language.
Jupyter Notebook UpdatedJun 2, 2024 -
AirBnB-Barcelona Public
Airbnb hosts a vast amount of data regarding property listings, reviews, and calendar availability. Analyzing this data can provide valuable insights for both hosts and guests, such as identifying …
Jupyter Notebook UpdatedFeb 29, 2024 -
Welcome to our comprehensive end-to-end data engineering project tailored for e-commerce. In this project, We have designed and implemented a robust data pipeline to handle diverse datasets from va…
-
-
SQL-problem-solving Public
This repository is dedicated to providing straightforward solutions to SQL problems commonly found on coding platforms. Each file in this repository contains a solution to a specific problem, makin…
TSQL UpdatedJan 9, 2024 -
alteryx-mini-projects Public
This repository is dedicated to weekly challenges designed to test and enhance your Alteryx skills. Whether you're a seasoned Alteryx user or just getting started, these challenges provide a platfo…
UpdatedDec 20, 2023 -
-
Credit-Risk-Assessment Public
Welcome to the German Credit Risk Analysis repository! This project is an exploration of credit risk prediction using a German credit dataset. We dive into the world of data analysis, feature engin…
-
8-Week-SQL-Challenge Public
This repository serves as a record of my progress, solutions, and insights gained during this learning experience.
TSQL UpdatedOct 11, 2023 -
Bank-Marketing-Campaign Public
This project explores and analyzes the outcomes of a bank's marketing campaign aimed at selling long-term deposits.
-
Absenteeism-at-Work-Analysis Public
This GitHub repository contains a comprehensive analysis of employee absenteeism at work using a real-world dataset. The project leverages Python and popular data analysis libraries to explore, vis…
-
-
Predict house prices in King County, USA using ML. This project develops models to estimate prices based on property features. Explore, preprocess data, build models, and predict using pipelines. I…
Jupyter Notebook UpdatedAug 29, 2023 -
This Jupyter Notebook explores a telecom customer churn dataset and performs various tasks such as data preprocessing, exploratory data analysis (EDA), predictive modeling using different algorithm…
-
-
Boston-housing- Public
Domain: Real Estate Difficulty: Easy to Medium Challenges: Missing value treatment Outlier treatment Understanding which variables drive the price of homes in Boston Summary: The Boston housing dat…
Jupyter Notebook UpdatedJun 16, 2023 -
Crop-Yield-Prediction Public
The crop yield prediction project aims to develop a predictive model that estimates the yield of crops based on various environmental factors. By analyzing historical data and employing statistical…
-
BigMart-Sales-Prediction Public
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which pla…
Jupyter Notebook UpdatedJun 15, 2023 -
Loan-Prediction Public
Comprehensive loan applicant data with attributes like gender, marital status, education, income, loan details, and approval status. Includes an IPython Notebook file with loan approval prediction …
Jupyter Notebook UpdatedJun 10, 2023 -
Iris-Classifier-Project Public
This project is used to classify the different species of iris floweMachine Learning Project : Iris-flower-classification This program applies basic machine learning (classification) concepts on Ir…
Jupyter Notebook UpdatedJun 9, 2023 -
SparkFoundation Public
internship in Data Science and Business Intelligence
Jupyter Notebook UpdatedSep 17, 2022 -