Skip to content
View GVRQ's full-sized avatar

Block or report GVRQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
GVRQ/README.md

Hi there 👋 I'm Alexander,

Stockholm-based data scientist and digital marketing expert with a background in E-Commerce and Management. With an entrepreneurial mindset, I focus on innovative data-driven solutions that drive growth.

Connect with me:

Telegram_Alexander_Gavrilov_Data_Scientist Email_Alexander_Gavrilov_Data_Scientist  Linkedin_Alexander_Gavrilov_Data_Scientist

Recent Data Science Projects:

Project Skills & Tools
Used E-Bikes Price Prediction App (private)

- Data Mining: over 15.000 scraped ads from Blocket.se

- Data Mining: over 1800 scraped e-bike models from PriceRunner.se

- Data Mining: over 2000 scraped e-bike listings from Rebike.com

- Conducted comprehensive analysis of the Swedish electric bike market and international players, providing insights for strategic decision-making.

- Developed predictive machine learning models with outstanding results (97-99% accuracy).

- ML: created a separate app for yearly rental price prediction

- Created insightful reports, leveraging data analysis and machine learning model outcomes.

CatboostRegressor, VS Code, Streamlit, Pandas, Seaborn, GridSearchCV,StandardScaler, Cross_val_score, Shuffle

UAE Real Estate Investments: Price Prediction App (public version)

- Data Mining: 130.000 properties in Dubai

- ML: created a machine learning algorithm for property valuation. R2 score is: 0.94

- ML: created a separate app for yearly rental price prediction

- Created a web app that predicts property value based on property's features

RandomForestRegressor, VS Code, Streamlit, Pandas, Matplotlib,Seaborn,LinearRegression, GridSearchCV, StandardScaler, Cross_val_score, Shuffle

Hackathon | Talent Squad | Data Science

Participated in Data Science hackathon by Barcelona Digital Talent. Project "Influences on Academic Achievement".

Results: 🥈 Individual Ranking #2nd Place.

Hackathon | Jump2Digital 2022

Participated in Data Science hackathon by Barcelona Digital Talent. Project "Air Quality Classification".

Results: 🥇 Individual Ranking #1 Place.

E-Commerce

This Data Science project's goal is to conduct experiments with various metrics on e-commerce data.

Metrics:

  • Gamma-Gamma & BG/NBD
  • Heuristic metrics
  • Customer Lifetime Value prediction and segmentation
  • RFM Segmentation (Recency, Frequency, and Monetary)
  • CLV Segments + RFM

Cryptocurrency online app

Created a simple web app with cryptocurrency exchange rates over time.

VS Code, the CoinCap API, Streamlit.

U.S. Inflation rate vs Fed Rate. 1983 - 2022

Analysis & Correlation between inflation (CPI) and the Fed Rate during 1983 - 2022; US national recessions (1990s, 2000s, Great Recession, Covid-19 recession) + pre & post recession periods.

Exploratory data analysis, linear regression, data visualization, data preprocessing, data processing, correlation, pandas, numpy, seaborn, matplotlib, datareader, sklearn, pyplot, lineplot.

U.S. Home Prices vs. CPI in 2022

Overview of 2022: HPI vs CPI in the US (Forecasting and Analysis). 75 & 35 years of data analysed in order to forecast remaining 2022's months data on HPI & CPI. Forecasting is made using Prophet model.

Data forecasting, exploratory data analysis, linear regression, data visualization, data preprocessing, data processing, prediction, pandas, numpy, seaborn, matplotlib, datareader, sklearn, pyplot, statsmodels, pystan, tsarplot, fbprophet, prophet.

Exploratory Data Analysis(EDA) on Residential Properties

Data science project's goal is to determine the market value of real estate objects and to define typical parameters of residential properties. The real estate listing platform will use Exploratory Data Analysis' conclusions in an automated system to prevent fraudulent activities on the platform.

Exploratory data analysis, data visualization, data preprocessing, data processing, histogram, boxplot, scatter matrix, categorization, scatter plot, fraud monitoring, numpy, seaborn, pyplot, python, pandas, matplotlib

[RU] Customer churn forecast for telecom operator

Project's goal is to predict whether the client will leave the telecom operator in the near future or not. If it turns out that the user plans to leave, he will be offered promo codes and special plans.

Machine learning, data analysis, regression, custom metrics, gradient boosting, logisticregression, randomforestclassifier, catboostclassifier, python numpy, scikit-learn, matplotlib.
   

Completed Projects (can be shared upon request):

- Machine Learning for Texts / Toxic Comment Classification: model development; NLTK, lemmatisation
(Python, Pandas, NLTK, WordNetLemmatizer, WordNet, SciKit-learn, CatBoost)
 
- Music Streaming Service: user behavior analytics , comparative analysis and hypothesis testing (Python, Pandas, Data Analysis)
 
- Credit Risk Analysis: data preprocessing, analysis and hypothesis testing
(Python, Pandas, Data Preprocessing & Analysis)
 
- Real Estate Market Analysis: factors affecting price determination; exploratory data analysis (Python, Pandas, Matplotlib, Visualization, Data Preprocessing, Exploratory Data Analysis)
 
- Statistical Data Analysis / Telecom Tariff Analysis: descriptive statistics, data analysis and hypothesis testing (Python, Pandas, Matplotlib, NumPy, SciPy, Data Preprocessing, EDA, Student's t-test)
 
- Computer Vision / Supermarket Chain's Customer Age Verification: building and evaluating a ML model (Python, Keras, TensorFlow, Image Processing, Neural Networks, Machine Learning)
 
- Machine Learning / Telecom Customer Classification: classification, ML model & hyperparameter selection (Python, Pandas, Matplotlib, SciKit-learn)
 
- Supervised Learning / Bank Customer Churn Prediction: forecasting, classification, ML model & hyperparameter selection (Python, Pandas, Matplotlib, SciKit-learn)
 
- Machine Learning / Mining Company's New Well Location Selection: regression, business model development, bootstrap (Python, Pandas, NumPy, Matplotlib, Seaborn, SciKit-learn)
 
- Machine Learning / Gold Concentration Ratio Prediction: data analysis, regression, custom metrics (Python, Pandas, NumPy, Matplotlib, Seaborn, SciKit-learn)
 
- Insurance Company Client's Data Protection: development of a data anonymization model; linear algebra, regression (Python, NumPy, SciKit-learn)
 
- Numerical Methods / Car Price Prediction: development of a car price prediction model; regression, gradient boosting (Python, NumPy, SciKit-learn, CatBoost, LightGBM)
 
- Time Series / Taxi Demand Prediction: development of a taxi demans prediction model; time series, regression, predictions (Python, NumPy, SciKit-learn, Statsmodels)

Popular repositories Loading

  1. GVRQ GVRQ Public

    Config files for my GitHub profile.

  2. Real_Estate_EDA Real_Estate_EDA Public

    Jupyter Notebook

  3. US_HPI_vs_CPI_2022 US_HPI_vs_CPI_2022 Public

    Jupyter Notebook

  4. customer_churn_prediction customer_churn_prediction Public

    Customer churn forecast for telecom operator

    Jupyter Notebook

  5. Mortgage-Rates-Forecast Mortgage-Rates-Forecast Public

    US Mortgage Rates Forecast for 2022.

  6. U.S.-CPI-vs-Fed-Rate U.S.-CPI-vs-Fed-Rate Public

    U.S. Inflation (CPI) vs Fed Reserve interest rate. 1983-2022

    Jupyter Notebook