1. Syed Mumtaz Ali and Samuel D. Silvey. A general class of coefficients of divergence of one distribution from another. Journal of the Royal Statistical Society: Series B (Methodological), 28(1):131–142, 1966.
2. Shun-ichi Amari and Atsumi Ohara. Geometry of $$q$$-exponential family of probability distributions. Entropy, 13(6):1170–1185, 2011.
3. Francis Bach. On the equivalence between kernel quadrature rules and random feature expansions. Journal of Machine Learning Research, 18(1):714–751, 2017.
4. Francis Bach. Information theory with kernel methods. IEEE Transactions on Information Theory, 2022.
5. Francis Bach, Simon Lacoste-Julien, and Guillaume Obozinski. On the equivalence between herding and conditional gradient algorithms. In International Conference on Machine Learning, pages 1355–1362, 2012.