- Notifications
You must be signed in to change notification settings - Fork5
Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.
License
steggie3/goose-dataset
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This is a dataset of 1,000 Canada goose images. They are allmugshots, with the head being the main feature of the image. Each image comes with an annotation XML in the PASCAL VOC format with a bounding box of the head and the object classgoose-head.
Canada geese have a distinct and high-contrast facial pattern that is highly recognizable. The images in this dataset are all somewhat visually similar. The head in the image is either the left side or the right side, not front or back. The background is usually blurry. Only one goose head is present in each image. These properties makes the dataset rathersimple for various computer vision tasks, and can serve as a starting point for computer vision experiments such as object recognition, localization, detection, etc.
All images are photographed by me.
Sample images from the dataset.Theimages directory contains 1,000 JPG images of the size 800 x 533. Theannotations directory contains the XML annotations. These can be directly used in thedarkflow implementation of object detection by placing them under darkflow/test/training.
If you are only using the images and not the annotations, I also implemented aload_data() API that has compatible usage as those in theKeras built-in datasets.
(x_train,y_train), (x_test,y_test)=GooseDataset.load_data(test_ratio=0.2,grayscale=False,cropped=False,resize_shape=None,save=False)
Returns:
- 2 tuples:
- (x_train, x_test): Arrays of integers with shape (n_samples, n_rows, n_columns, 3) in the range [0, 255] for RGB, or arrays of floats with shape (n_samples, n_rows, n_columns, 1) in the range [0, 1] for grayscale.
- (y_train, y_test): Arrays of integers with shape (n_samples, 1). All values are 1, which stands for presence of goose in the picture.
- 2 tuples:
Arguments:
- test_ratio: Float. Ratio of (number of testing data) / (number of all data).
- grayscale: Boolean. Whether to load the images in grayscale.
- cropped: Boolean. Whether to load only the cropped bounding boxes of the images. IfTrue, a new directorycropped_images will be created in addition to loading the images.
- resize_shape: [n_rows, n_columns], target shape of the loaded images. IfNone, the default shape [533, 800] will be used.
- save: Boolean. Whether to save the images as how they are loaded into a new directoryprocessed_images in addition to loading. This can be used to visualize the actual images being loaded after grayscaling, cropping, and/or resizing.
To see an actual example, please checkthe Example Jupyter notebook.
Please do not use this dataset to derive any work that harms animals.