Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

default p_min #64

Open
ianlini opened this issue Sep 12, 2016 · 7 comments
Open

default p_min #64

ianlini opened this issue Sep 12, 2016 · 7 comments
Assignees
Labels

Comments

@ianlini
Copy link
Collaborator

ianlini commented Sep 12, 2016

https://github.com/ntucllab/striatum/blob/master/striatum/bandit/exp4p.py#L64
What's this? Any reference?

@yangarbiter
Copy link
Collaborator

@ianlini
Copy link
Collaborator Author

ianlini commented Sep 12, 2016

How do we define T?

@taweihuang
Copy link
Contributor

The problem here is that we do not know the exact value of N and T for initialization.
T is the total times of recommendation, while N is the number of experts.

@ianlini
Copy link
Collaborator Author

ianlini commented Sep 12, 2016

I think we should fix N.
What happens if we have more than T rounds?

@ianlini
Copy link
Collaborator Author

ianlini commented Sep 12, 2016

I think the actions and experts should both be fixed...
I don't think Exp4.P can handle changes of actions and experts reasonably...
This is a big change, any idea?
@yangarbiter @stegben @SoluMilken

@taweihuang
Copy link
Contributor

Yeah, the original EXP4P cannot handle new actions and experts.
For new actions, if we retrain our experts, I think it's still okay. But for new experts, I think the original algorithm could not handle this case.

@ianlini
Copy link
Collaborator Author

ianlini commented Sep 12, 2016

After retraining the experts, I don't think the weight can still work, and the new weight of a new action is also a problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants