DoWG accepted at NeurIPS

Our work on an extension of DoG with weighted gradients got accepted for presentation at NeurIPS this year! If you want to try our method, a pytorch implementation is available on github. I hope to see more papers building upon DoG, DoWG, D-Adaptation, and Prodigy, we have barely scratched the surface on what can be done, and some of these methods are already being used in practice.

Konstantin Mishchenko
Konstantin Mishchenko
Research Scientist

I’m a research scientist working on code generation in Paris. I like math, computers, and electronic music.