keras-io
/

conv_Mixer

Image Classification

Model card Files Files and versions Metrics Training metrics Community

harsha163 commited on Jul 11, 2022

Commit

bb701de

·

1 Parent(s): aca7595

updated the description

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -5,17 +5,17 @@ tags:
 - Architecture
 ---
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 - Architecture
 ---
+# Tensorflow Keras implementation of : [Image classification with ConvMixer](https://keras.io/examples/vision/convmixer/)
+The full credit goes to: [Sayak Paul](https://twitter.com/RisingSayak)
+## Short description:
+ConvMixer is a simple model based on the ideas of representing an image as patches( used in ViT) and separating the mixing of Spatial and channel dimensions (used in MLP-Mixer). Unlike ViT and MLP-Mixer, they use only standard Convolution operations. The full paper is a submission to ICLR 22 and can be found [here](https://openreview.net/pdf?id=TVHS5Y4dNvM)
+## Model and Dataset used
+The Dataset used here is CIFAR-10. The model is called ConvMixer-256/8 where 256 is the hidden dimension (the dimension of patches) and 8 is the depth(number of repetitions of ConvMix layers)
 ## Training procedure