=========================================== The Caltech-UCSD Birds-200-2011 Dataset =========================================== For more information about the dataset, visit the project website: http://www.vision.caltech.edu/visipedia If you use the dataset in a publication, please cite the dataset in the style described on the dataset website (see url above). Directory Information --------------------- - images/ The images organized in subdirectories based on species. See IMAGES AND CLASS LABELS section below for more info. - parts/ 15 part locations per image. See PART LOCATIONS section below for more info. - attributes/ 322 binary attribute labels from MTurk workers. See ATTRIBUTE LABELS section below for more info. ========================= IMAGES AND CLASS LABELS: ========================= Images are contained in the directory images/, with 200 subdirectories (one for each bird species) ------- List of image files (images.txt) ------ The list of image file names is contained in the file images.txt, with each line corresponding to one image: ------------------------------------------ ------- Train/test split (train_test_split.txt) ------ The suggested train/test split is contained in the file train_test_split.txt, with each line corresponding to one image: where corresponds to the ID in images.txt, and a value of 1 or 0 for denotes that the file is in the training or test set, respectively. ------------------------------------------------------ ------- List of class names (classes.txt) ------ The list of class names (bird species) is contained in the file classes.txt, with each line corresponding to one class: -------------------------------------------- ------- Image class labels (image_class_labels.txt) ------ The ground truth class labels (bird species labels) for each image are contained in the file image_class_labels.txt, with each line corresponding to one image: where and correspond to the IDs in images.txt and classes.txt, respectively. --------------------------------------------------------- ========================= BOUNDING BOXES: ========================= Each image contains a single bounding box label. Bounding box labels are contained in the file bounding_boxes.txt, with each line corresponding to one image: where corresponds to the ID in images.txt, and , , , and are all measured in pixels ========================= PART LOCATIONS: ========================= ------- List of part names (parts/parts.txt) ------ The list of all part names is contained in the file parts/parts.txt, with each line corresponding to one part: ------------------------------------------ ------- Part locations (parts/part_locs.txt) ------ The set of all ground truth part locations is contained in the file parts/part_locs.txt, with each line corresponding to the annotation of a particular part in a particular image: where and correspond to the IDs in images.txt and parts/parts.txt, respectively. and denote the pixel location of the center of the part. is 0 if the part is not visible in the image and 1 otherwise. ---------------------------------------------------------- ------- MTurk part locations (parts/part_click_locs.txt) ------ A set of multiple part locations for each image and part, as perceived by multiple MTurk users is contained in parts/part_click_locs.txt, with each line corresponding to the annotation of a particular part in a particular image by a different MTurk worker: