Bubbles
0 points · 51 days ago · 0 comments

At today's Paper Talk session cohosted by IDI, I'm giving a talk today on the paper "The Free Transformer" by Fleuret (preprint available here). This paper describes a modification of the standard decoder-only transformer that learns to condition token generation on random latent variables learned without supervision. We describe some of the intuition and background for the new architecture. My talk is a RevealJS presentation, and is available here. This architecture is very straightforward t...

No comments yet. Log in to discuss on the Fediverse