Image Patch Transformation
Yannic and Hugo discuss whether the process of transforming image patches into tokens can be considered a form of convolution. They explore the idea of treating different parts of an image differently and the role of positional embeddings in the process.In this clip
From this podcast

Machine Learning Street Talk (MLST)
#044 - Data-efficient Image Transformers (Hugo Touvron)
Related Questions