-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support sample weights when bucketing #29
Comments
@operte , no, currently no bucketing method support weights. How exactly would you envision the weights to work in the skorecard context? I can only imagine it working when fitting the supervised bucketers Or do you mean something different? |
That is what I meant, yes. Another example: for the FixedFrequencyBucketer, you might also have weights on the classes, which also changes the counts on each bucket, and then the bucketer needs to adjust that. Tbh I'm not sure how to fix this in an elegant way, or if skorecard should have a way of doing this. In my particular problem, what I ended up doing was oversample my datasets before feeding them in to skorecard. I think @timvink and @orchardbirds discussed a way of implementing this. |
Revisited this. We would need to do this consistently for all bucketers, which would be a lot of work. Instead, probably we could create 1 or 2 new bucketers that do what you want to do. The easiest solution is indeed to up/downsample but that quickly becomes quite large & slow. I also checked and even If you already know the weights on the classes however, wouldn't using Do you already have a design in mind? |
@operte thoughts? |
No, I don't have different ideas for this. As I mentioned, in our case we went with the oversampling approach and that turned out fine. Since you also can't find this option in other popular bucketers, this might not be a popular problem, so one option is to drop this. We might spend time adding the option of weights to a particular skorecard bucketer, but the user might want to use a different bucketer. Perhaps it's best to leave this issue to the user and just propose the oversampling approach in the documentation. I did not understand the suggestion of using |
Dear skorecard team, are there bucketers that support weights?
E.g. we undersampled our negative target for modelling, and now we need to apply a weight in this class to get proper default rates in each bucket.
I looked at the DecisionTreeBucketer but it doesn’t seem to support that.
The text was updated successfully, but these errors were encountered: