Skip to content

Latest commit

 

History

History
182 lines (146 loc) · 8.36 KB

README.md

File metadata and controls

182 lines (146 loc) · 8.36 KB

AODRaw: Towards RAW Object Detection in Diverse Conditions

Paper link

RAW and corresponding sRGB images from different conditions
space-1.jpg

Table of Contents

Introduction

Existing object detection methods often consider sRGB input, which was compressed from RAW data using ISP originally designed for visualization. However, such compression might lose crucial information for detection, especially under complex light and weather conditions. We introduce the AODRaw dataset, which offers 7,785 high-resolution real RAW images with 135,601 annotated instances spanning 62 categories, capturing a broad range of indoor and outdoor scenes under 9 distinct light and weather conditions. Based on AODRaw that supports RAW and sRGB object detection, we provide a comprehensive benchmark for evaluating current detection methods. We find that sRGB pre-training constrains the potential of RAW object detection due to the domain gap between sRGB and RAW, prompting us to directly pre-train on the RAW domain. However, it is harder for RAW pre-training to learn rich representations than sRGB pre-training due to the camera noise. To assist RAW pre-training, we distill the knowledge from an off-the-shelf model pre-trained on the sRGB domain. As a result, we achieve substantial improvements under diverse and adverse conditions without relying on extra pre-processing modules.

Apps and Sourcecode

Dataset and Downloading

Organize the dataset into the following directory structure. These directories are not always needed. We can selectively store a directory according to the needs of our tasks.

├── AODRaw
    ├── images (435G)
        ├── 00000000.ARW # RAW images
        ├── 00000000.JPG # sRGB images
        ├── ...
        ├── 00007784.ARW
        ├── 00007784.JPG
    ├── images_downsampled_raw (223G) # uses train/test_annotations_downsample_scale3_bbox_min_size32.json
    ├── images_downsampled_srgb (4.3G) # uses train/test_annotations_downsample_scale3_bbox_min_size32.json
    ├── images_slice_raw (439G) # uses train/test_annotations_slice_1280_300.json
    ├── images_slice_srgb (23G) # uses train/test_annotations_slice_1280_300.json
    ├── annotations        
        ├── train_annotations.json
        ├── test_annotations.json
        ├── train_annotations_downsample_scale3_bbox_min_size32.json
        ├── test_annotations_downsample_scale3_bbox_min_size32.json
        ├── train_annotations_slice_1280_300.json
        ├── test_annotations_slice_1280_300.json

These directories support RAW-based object detection and sRGB-based object detection.

Directory Usage Resolution Images Downloads Baidu Downloads Google Downloads TeraBox
images original RAW and sRGB images $6000\times 4000$ 7,785 Images Images
images_downsampled_raw down-sampled RAW images $2000\times 1333$ 7,785 Images
images_downsampled_srgb down-sampled sRGB images $2000\times 1333$ 7,785 Images Images
images_slice_raw sliced RAW images $1280\times 1280$ 71,782
images_slice_srgb sliced sRGB images $1280\times 1280$ 71,782 Images Images
annotations annotations - - Anno Anno Anno

Among these directories, the images directory contains the original images in AODRaw, where most of the images are captured in a resolution of $6000\times 4000$. To feed these images into detectors, we preprocess the images in two ways. The images_slice_raw and images_slice_srgb are generated by Slicing. The images_downsampled_raw and images_downsampled_srgb are generated by Down-sampling.

You can download these directories using the links above or generate them from the images directory as follows. If you want to preprocess images using different settings from our default setting, you should also generate corresponding annotation files.

Down-sampling preprocessing

Down-sampling images for RAW-based object detection:

python downsample/downsample_raw_images.py \
--root [path to your AODRaw images, e.g., /AODRaw/images] \
--target [the path to save downsampled images, e.g., /AODRaw/images_downsample_raw] \
--cuda [optional, processing RAW images in a GPU]

Down-sampling images for sRGB-based object detection:

python downsample/downsample_srgb_images.py \
--root [path to your AODRaw images, e.g., /AODRaw/images] \
--target [the path to save downsampled images, e.g., /AODRaw/images_downsample_srgb]

Slicing preprocessing

Slicing RAW images into $1280\times 1280$ patches for RAW-based object detection and generating the annotation file:

python slice/slice_raw_images.py --base-json slice/configs/slice_raw_train_images.json
python slice/slice_raw_images.py --base-json slice/configs/slice_raw_test_images.json

Slicing sRGB images into $1280\times 1280$ patches for sRGB-based object detection and generating the annotation file:

python slice/slice_srgb_images.py --base-json slice/configs/slice_srgb_train_images.json
python slice/slice_srgb_images.py --base-json slice/configs/slice_srgb_test_images.json

We can modify the config files to specify the path. save_ann_file and save_dir mean the save path for images and annotation files. The annotation files will be generated at save_ann_file if save_ann_file is not empty. We set save_ann_file as empty by default because we have provided the downloading links of the annotation files as above.

{
  "img_root": "/AODRaw/images/", 
  "save_dir": "/AODRaw/images_slice_raw",
  "save_ann_file": ""
}

Dataset Information

Images

Split Category Images Instances
Train 62 5,445 94,949
Test 62 2,340 40,652

Annotations

We provide annotations following the COCO format, as follows. The tag item shows the light and weather conditions in an image.

{
  "images": [
    {
      "id": 0,
      "file_name": "00001336.ARW",
      "height": 6000, 
      "width": 4000, 
      "tag": [
          "low_light" 
      ]
    },
  ], 
  "type": "instance",
  "categories": [
    {
      "supercategory": "none",
      "id": 0,
      "name": "person"
    },
  ], 
  "annotations": [
    {
      "segmentation": [],
      "area": 4686,
      "ignore": 0,
      "iscrowd": 0,
      "image_id": 0,
      "bbox": [],
      "category_id": 0,
      "id": 0
    },
  ]
}

Citation

@article{li2024aodraw,
  title={Towards RAW Object Detection in Diverse Conditions}, 
  author={Zhong-Yu Li and Xin Jin and Boyuan Sun and Chun-Le Guo and Ming-Ming Cheng},
  journal={arXiv preprint arXiv:2411.15678},
  year={2024},
}

License

The code is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License for NonCommercial use only.