OpenPicPal

OpenPicPal is an open-source tool for image training and automatic classification.

OpenPicPal, being a Python-based project, aims to train and obtain a required classification model by leveraging the InceptionV3 base model and a pre-prepared image dataset. And then, it automates image classification using the resulting model.

This image classification project involves the following key steps:

Choose a Base Model: For this project, InceptionV3 serves as the base model (considering its balance between accuracy and runtime performance, with not too many parameters).
Prepare Image Data: This includes the preparation of image datasets for training, validation, testing, etc.
Train the Model: Generate the required result model.
Image Classification/Prediction: Utilize the resulting model for image classification/prediction.

Data Preparation

The source of image data can be obtained by searching for open-source datasets on the internet or by writing custom web crawlers. Prepare the image data and organize it into the following directory structure:

data/
    |__saved_model/     # Directory for saving trained model files
    |__train/           # Directory for training image datasets categorized into subdirectories
        |__book
        |__cat
        |__digit
        |__movie
        ……
    |__validation/      # Validation image datasets, ensuring the same subdirectory structure as 'train'

Image Dataset Configuration

Each subdirectory under 'train' should contain image files corresponding to the respective categories (e.g., 'book/' directory contains all images related to books).
Prepare as many subdirectories as there are categories.
Select approximately 1/4 to 1/3 of the images from 'train' subdirectories and place them in corresponding 'validation' subdirectories. For example, if there are 100 images in 'train/book/', consider moving 25 images to 'validation/book/' to create the validation dataset.
Ensure that the number of subdirectories in 'validation' matches the number in 'train', and maintain a certain proportion (e.g., validation dataset size is 1/4 to 1/3 of the training dataset).
Generally, more images lead to better classification performance, but consider training time and performance when deciding on the dataset size.
If you have new categories, add new subdirectories in 'train' and 'validation'.

Development Environment

Required python libraries:

    python==3.9.2
    keras==2.11.0
    tensorflow==2.11.0

Clone code:

git clone [email protected]:dlooto/open-picpal.git

Navigate to the project root directory: cd open-picpal
Create the data directory structure as described in "Data Preparation." The data/saved_model/ directory is used to store trained model file.
Create and modify configuration files:

cp picpal/config.py.example  picpal/config.py     # Copy the example file
vi picpal/config.py         # Modify the relevant parameters

Install Python libraries using pip:

pip install -r requirements.txt

Training and Classification

Set Class Labels:

# picpal/config.py
CLASS_LABELS = {
    0: "book",
    1: "cat",
    2: "digit",
    3: "movie",
}

Ensure that the CLASS_LABELS configuration in config.py matches the subdirectory names in 'train' and 'validation.' You can modify the label "book" to "books," for example, but make sure to update the subdirectory names in both 'train' and 'validation' accordingly.

Modify Training Parameters:

MODEL_FILE_NAME = 'new_model.h5'    # Model file name used for image classification

IMG_WIDTH, IMG_HEIGHT = 256, 256     
BATCH_SIZE = 32                      
EPOCHS = 20

The (IMG_WIDTH, IMG_HEIGHT) parameters set the required input image size for the InceptionV3 model. Adjust these dimensions based on your specific business requirements. For example, if your business primarily deals with portrait images (where the height is much larger than the width), set IMG_HEIGHT to be greater than IMG_WIDTH (or a multiple of IMG_WIDTH).
The EPOCHS represents the number of iterations over the dataset during training. For instance, if you set EPOCHS to 10, training will iterate over the entire dataset 10 times.
During each epoch, the dataset is typically divided into batches for processing. The BATCH_SIZE determines the number of samples in each batch.
Different settings for EPOCHS and BATCH_SIZE will affect training duration.

Train the Model:

python -m picpal.train

Image Classification:

python -m picpal.predict

Using as a Library

You can also use OpenPicPal as a Python library in other business code.

Build and publish:

python setup.py sdist bdist_wheel

Find the generated package (e.g., open-picpal-0.1.0.tar.gz) in the 'dist' directory and install it:

pip install dist/open-picpal-0.1.0.tar.gz

Using in business code:

from picpal.train import Trainer

# Train the model
trainer = Trainer(
    epochs=20,
    batch_size=32, 
    img_width=256, 
    img_height=256
)
trainer.train()


# Image Classification
from picpal.config import *
from picpal.predict import Predictor

predictor = Predictor(
    model_path=get_model_path(),
    class_labels=CLASS_LABELS,
    img_width=IMG_WIDTH,
    img_height=IMG_HEIGHT
)

img_path_list = [
    "cat/14694fdd.png",
    "movie/e08b23f.png",
    "book/33h07bu31.jpg",
]

for i in img_path_list:
    result = predictor.predict_with_image_path(
        get_validation_img_path(i)
    )
    print(result, i)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
picpal		picpal
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenPicPal

Data Preparation

Image Dataset Configuration

Development Environment

Training and Classification

Using as a Library

About

Releases

Packages

Languages

License

dlooto/open-picpal

Folders and files

Latest commit

History

Repository files navigation

OpenPicPal

Data Preparation

Image Dataset Configuration

Development Environment

Training and Classification

Using as a Library

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages