Image Patch Transformation

Yannic and Hugo discuss whether the process of transforming image patches into tokens can be considered a form of convolution. They explore the idea of treating different parts of an image differently and the role of positional embeddings in the process.