Levi-Heath
/

Towards-Errorless-Training-ImageNet-1k

Image Classification

image classification

Model card Files Files and versions

Towards-Errorless-Training-ImageNet-1k / README.md

Levi-Heath's picture

Update README.md

a32eb7c verified 4 months ago

|

1.87 kB

	---
	license: cc-by-nc-nd-4.0
	language:
	- en
	pipeline_tag: image-classification
	tags:
	- ImageNet-1k
	- image classification
	datasets:
	- ILSVRC/imagenet-1k
	metrics:
	- accuracy
	authors:
	- Bo Deng, University of Nebraska-Lincoln
	- Levi Heath, University of Colorado Colorado Springs
	---

	# Towards Errorless Training ImageNet-1k
	This repository host MATLAB code and models for the manuscript, Towards Errorless Training ImageNet-1k, which is available at https://arxiv.org/abs/2508.04941.
	We give 6 models trained on the ImageNet-1k dataset, which we list in the table below.
	Each model is featured model of archtecture 17x40x2.
	That is, each model is made up of 17x40x2=1360 FNNs, all with homogeneous architecture (900-256-25 or 900-256-77-25),
	working in parallel to produce 1360 predictions which determine a final prediction using the majority voting protocol.

	We trained the 6 models using the following transformation of the 64x64 downsampled ImageNet-1k dataset:
	- downsampled images to 32x32, using the mean values of non-overlapping 2x2 grid cells and
	- trimmed off top row, bottom row, left-most column, and right-most column.

	This transformed data results in 30x30 images, hence 900-dimensional input vectors.

	For a thorough description of our models trained on the ImageNet-1k dataset, please read our preprint linked above.


	\| Model \| Training Method \| FNN Architecture \| Accuracy (%) \|
	\| ------------- \| ------------- \| ------------- \| ------------- \|
	\| Model_S_h1_m1 \| SGD \| 900-256-25 \| 98.247 \|
	\| Model_S_h1_m2 \| SGD \| 900-256-25 \| 98.299 \|
	\| Model_S_h2_m1 \| SGD \| 900-256-77-25 \| 96.990 \|
	\| Model_T_h1_m1 \| SGD followed by GDT \| 900-256-25 \| 98.289 \|
	\| Model_T_h1_m2 \| SGD followed by GDT \| 900-256-25 \| 98.300 \|
	\| Model_T_h2_m1 \| SGD followed by GDT \| 900-256-77-25 \| 97.770 \|
	*SGD = stochastic gradient descent
	**GDT = gradient descent tunneling