Skip to content

Praneet460/Tide-Transaction-Receipt-Matching

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tide Transaction Receipt Matching

The overall flow of the analysis is summerized with the help of a flow chart

flowchart

Where to find

  • /model : XGBoost saved model
  • /data :
    • raw_data/ : original dataset
    • interim_data/ : processed dataset with added variable 'target'
  • /doc :
    • report : a quick report that summarizes the whole analysis

Quick Model Metrics

As, we find our data is highly imbalanced, we focussed on ROC AUC metrics for each model. Individual models metrics are

Model Test data AUC
Random Forest 0.908558
XGBoost 0.964785

Quick Business Metrics

As, data is highly imbalance, our area of focus is high Recall. We build a final contingency table for each model to analyse the buckting of the ranks.

Random Forest XGBoost
rfm xgboost

By compairing the contigency table we can see XGBoost performs well in bucketing majority to the lower rank of 1.

Areas of Improvement

  • Feature Enginerring
  • Feature Selection

Can put more light on Feature Engineering by creating more important features and Feature Selection to find the important subset of features related to different models.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published