Professor of Physics & Senior Data Fellow at Belmont University, Nashville TN
Head of Research for Hyperstate Music AI.
Teacher of audio engineers, Opinions my own.
Explainer blog: https://drscotthawley.github.io
gist.github.com/drscotthawle...
gist.github.com/drscotthawle...
github.com/drscotthawle...
github.com/drscotthawle...
Super-minor typo: looks like Colab doesn't render \mathcal{N}: (at least in my browser, Brave.)
Super-minor typo: looks like Colab doesn't render \mathcal{N}: (at least in my browser, Brave.)
VQ (commit) loss is the first to go NaN, then MSE, then cross-entropy.
commitment_weight = 0.6
decay = 0.95,
threshold_ema_dead_code = 2,
rotation_trick = True,
orthogonal_reg_weight=0.2,
2 codebooks, 16 vectors each, 2 dims per vector.
3 of the level-1 vectors look almost unused:
VQ (commit) loss is the first to go NaN, then MSE, then cross-entropy.
commitment_weight = 0.6
decay = 0.95,
threshold_ema_dead_code = 2,
rotation_trick = True,
orthogonal_reg_weight=0.2,
2 codebooks, 16 vectors each, 2 dims per vector.
3 of the level-1 vectors look almost unused:
Re. "What does Vector Quantization do?": If you just look at the locations of vectors, they may look like a gaussian-shaped "blob". But within that blob can be very different probabilities, vs. the smooth shape of a gaussian.
Re. "What does Vector Quantization do?": If you just look at the locations of vectors, they may look like a gaussian-shaped "blob". But within that blob can be very different probabilities, vs. the smooth shape of a gaussian.
Deadline: December 1, 2025.
Planned publication: Summer 2026.
Link to CFP: users.spa.aalto.fi/vpv/JAES_V73...
Deadline: December 1, 2025.
Planned publication: Summer 2026.
Link to CFP: users.spa.aalto.fi/vpv/JAES_V73...