Skip to content

Where should NSW real estate investors look into? In this project, we gathered data from various resources in order to predict property price growth rates in NSW.

License

Notifications You must be signed in to change notification settings

ckpradip/NSW-house-price

 
 

Repository files navigation

PyPI - Python Version

NSW Housing Data 🏘️ 🇦🇺

ℹ️   Team Members

Name
Felix Rosenberger Felix's
Christopher Rudolph Christopher's
Ken Walther Sy Ken's
Jingran Zhao Jingran's



Project Summary

In this project, we have chosen to analyse the property prices in NSW, Australia.

Given the real estate market activity in NSW, we were curious to see how the relationship between housing data, demographics, and certain economic factors interact with one another. Also, we were curious whether we could predict housing data in each postcode with these variables.

We explored questions such as:

  • Where should one buy a home?
  • Can we predict prices based on socioeconomic data such as income levels, household size, or interest rates?
  • Can we predict which postcodes will grow faster than average?

Data Preparation

Data Prep

We collected datasets from:

These are cleaned under the Data Preparation Notebooks.

Variable Relationships

Variable analysis

In-depth analysis covered under Main Notebook.ipynb

Top Predictors of House Prices in NSW

Which postcodes will grow more than 5% in the next quarter?

Apparently, there are several postcodes outside of Sydney that are projected to grow more than 5% in the following quarter.

Among these high growth areas, which are the cheapest?

These are postcodes have the lowest median house prices that are projected to grow more than 5% in the following quarter. This piece of insight can be very valuable especially for first-time home buyers looking into the real estate market.

Libraries and packages

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.patches as mpatches
import seaborn as sns
import os

from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LinearRegression
from sklearn.ensemble import RandomForestClassifier
from sklearn.neural_network import MLPClassifier

from sklearn.cluster import KMeans
from scipy.cluster.hierarchy import linkage
from scipy.cluster.hierarchy import dendrogram
from scipy.cluster.hierarchy import cut_tree

from sklearn.model_selection import train_test_split, cross_val_score, cross_val_predict
from sklearn.model_selection import GridSearchCV, KFold, StratifiedKFold
from sklearn.feature_selection import RFE
from sklearn.metrics import accuracy_score, r2_score, mean_squared_error, confusion_matrix

import geopandas as gpd
from mpl_toolkits.axes_grid1 import make_axes_locatable
from shapely.geometry import Point, Polygon

Other packages are also required to be installed:

pip install geopandas
pip install -U mapclassify

Notebook Structure

  • Main Notebook - This is where the codes for the main part of the analysis and predictive modelling are
  • Data Preparation Pt1 (NSW Housing data) - This notebook is where we cleaned all the files for the NSW Housing data
  • Data Preparation Pt2 (Feature data) - This notebook is where we cleaned all the features that we used
  • Variable Analysis - This is where we deep-dive into the analysis of all the variables

About

Where should NSW real estate investors look into? In this project, we gathered data from various resources in order to predict property price growth rates in NSW.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%