Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The SegFormer also uses a Transformer encoder to build hierarchical feature maps, but it adds a simple multilayer perceptron (MLP) decoder on top to combine all the feature maps and make a prediction.