fraud-ny-property-unsupervised

unsupervised learning for fraud analytics on ny property data

This data can be found here https://data.cityofnewyork.us/City-Government/Property-Valuation-and-Assessment-Data/yjxr-fw8i

Based on the analysis of property valuation and assessment data, we have developed a predictive model to identify anomalies. We first clean our data and because this project is not based on fraud label, we need to find the unusual records that are outliers. In the cleaning part we want to keep most of the data and filling the missing data with reasonable values. After cleaning the data, we create the meaningful variables and scale the variables before doing a pca for dimensionality reduction. Then we build two score, one from z scale and one from autoencoder. Using rank order scaling to combine the two score as final score and sort with this final score to examine the top records. Repeat this step to see if the algorithm make sense and adjust it with examination feedback. Our results shows that some records with certain variables unusualness are anomaly because of certain inaccurate information for those records.

After we got our final score, we sort our data with the final score. By looking at the top records, we can check if we find anything unusual. After we get the feedback, we can adjust our model by improving exclusions and variables. Then redo the step until we find something interesting. To examine the top properties, we will look at the z scaled variables first and it will immediately see which variables have unusual values. We generate a heatmap of the z scaled variables and we could see which variables cause the high scores.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Property_Valuation_and_Assessment_Data_Dictionary(1).xlsx		Property_Valuation_and_Assessment_Data_Dictionary(1).xlsx
README.md		README.md
ny_property_data_explore.ipynb		ny_property_data_explore.ipynb
ny_property_unsupervised.ipynb		ny_property_unsupervised.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fraud-ny-property-unsupervised

About

Releases

Packages

Languages

Yogayyj/fraud-ny-property-unsupervised

Folders and files

Latest commit

History

Repository files navigation

fraud-ny-property-unsupervised

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages