GitHub - Praneet460/Tide-Transaction-Receipt-Matching

Tide Transaction Receipt Matching

The overall flow of the analysis is summerized with the help of a flow chart

Where to find

/model : XGBoost saved model
/data :
- raw_data/ : original dataset
- interim_data/ : processed dataset with added variable 'target'
/doc :
- report : a quick report that summarizes the whole analysis

Quick Model Metrics

As, we find our data is highly imbalanced, we focussed on ROC AUC metrics for each model. Individual models metrics are

Model	Test data AUC
Random Forest	0.908558
XGBoost	0.964785

Quick Business Metrics

As, data is highly imbalance, our area of focus is high Recall. We build a final contingency table for each model to analyse the buckting of the ranks.

Random Forest	XGBoost

By compairing the contigency table we can see XGBoost performs well in bucketing majority to the lower rank of 1.

Areas of Improvement

Feature Enginerring
Feature Selection

Can put more light on Feature Engineering by creating more important features and Feature Selection to find the important subset of features related to different models.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assert		assert
data		data
doc		doc
model		model
.DS_Store		.DS_Store
README.md		README.md
Receipt_Matching.ipynb		Receipt_Matching.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tide Transaction Receipt Matching

Where to find

Quick Model Metrics

Quick Business Metrics

Areas of Improvement

About

Releases

Packages

Languages

Praneet460/Tide-Transaction-Receipt-Matching

Folders and files

Latest commit

History

Repository files navigation

Tide Transaction Receipt Matching

Where to find

Quick Model Metrics

Quick Business Metrics

Areas of Improvement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages