123 days ago · Tech · 0 comments

At today's Paper Talk session cohosted by IDI, I'm giving a talk today on the paper "The Free Transformer" by Fleuret (preprint available here). This paper describes a modification of the standard decoder-only transformer that learns to condition token generation on random latent variables learned without supervision. We describe some of the intuition and background for the new architecture. My talk is a RevealJS presentation, and is available here. This architecture is very straightforward t...

No comments yet. Log in to reply on the Fediverse. Comments will appear here.