Training decoupling#7
Conversation
|
I've review the code most of it looks good so far. I've run the examples and all of them run but I have a warning with the SVAE, GSM1 and GSM2 notebooks: This is related to the plotting in the very last cell of the notebooks. We probably want to fix it before to merge the code. (It's possible that the warning is already in the master branch... ) Regarding the code I have a couple of suggestions:
|
|
I will deal with the warning here (though it is present in the master as well :-P ). Then, I'll comment on the other issues you raised. So, the issues is not ours to solve. See numpy/numpy#9505 for details. The Bokeh library just spell "np.float" instead of "np.floating". |
Declare exp_llh(), kl_div_posterior_prior() and natural_grad_update() here. This is the right thing to do, because train_conj_exp() expects any ConjugateExponentialModel() to provide these methods.
|
The above commit partially adresses your first bullet -- the I believe that |
|
Ok, thanks to have checked for the warning. I have one more suggestion: you now have the function |
|
The third bullet and your renaming proposition have been adressed above. As for the second bullet --- turning beer.training.mini_batches() into PyTorch implementation -- I think it should not happen now, but in |
|
Sure, let's make this change when the pytorch version of the log-linear models is finished. I think it's almost finished. As a final change I would eventually add a docstring to Otherwise, if you agree, we can merge it now to the master. |
The aim of this branch is mostly to decouple the models from the training.