PyTorch Image Segmentation Project

This repository contains an implementation of image segmentation using PyTorch and the U-Net architecture. The project focuses on human segmentation using the EfficientNet-B0 encoder and custom training pipeline.

Overview

The project implements a complete image segmentation pipeline including:

Custom dataset handling
Data augmentation
U-Net architecture with EfficientNet-B0 encoder
Combined loss function (Dice + BCE)
Training and validation loops
Model checkpointing

Requirements

torch
opencv-python
numpy
pandas
matplotlib
scikit-learn
tqdm
albumentations
segmentation-models-pytorch

Project Structure

├── Dataset(folder)               # Training data information
├── Deep_Learning_with_PyTorch_ImageSegmentation.ipynb  # Main implementation file
└── helper.py                     # Helper functions for visualization

Setup and Installation

Clone the repository:

git clone https://github.com/amangupta143/PyTorch-Image-Segmentation.git
cd PyTorch-Image-Segmentation

Install required packages:

pip install segmentation-models-pytorch
pip install -U albumentations
pip install opencv-contrib-python

Download the dataset:

git clone https://github.com/parth1620/Human-Segmentation-Dataset-master.git

Model Architecture

Base Architecture: U-Net
Encoder: EfficientNet-B0 (pretrained on ImageNet)
Input Channels: 3 (RGB)
Output Classes: 1 (Binary Segmentation)
Loss Function: Combination of Dice Loss and Binary Cross-Entropy Loss

Training Configuration

EPOCHS = 35
LEARNING_RATE = 0.003
IMAGE_SIZE = 320
BATCH_SIZE = 16
ENCODER = 'timm-efficientnet-b0'
WEIGHTS = 'imagenet'

Data Augmentation

Training augmentations include:

Resize to 320x320
Horizontal Flip (50% probability)
Vertical Flip (50% probability)

Model Training

The training process includes:

Custom training and validation functions
Model checkpointing for best validation loss
Adam optimizer
GPU acceleration support

optimizer = torch.optim.Adam(model.parameters(), lr=LEARNING_RATE)

Usage

Prepare your dataset and update the CSV_FILE path in the configuration
Run the training script:

python deep_learning_with_pytorch_imagesegmentation.py

Inference

The model can be used for inference as follows:

model.load_state_dict(torch.load('bestModel.pt'))
image, mask = validset[idx]
logits_mask = model(image.to(DEVICE).unsqueeze(0))
pred_mask = torch.sigmoid(logits_mask)
pred_mask = (pred_mask > 0.5) * 1.0

Acknowledgments

Dataset originally from: Human-Segmentation-Dataset

License

MIT License

Feel free to use this implementation and modify it according to your needs. Contributions are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Dataset		Dataset
Jupyter Notebook File		Jupyter Notebook File
Trained Model		Trained Model
LICENSE		LICENSE
README.md		README.md
helper.py		helper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Image Segmentation Project

Overview

Requirements

Project Structure

Setup and Installation

Model Architecture

Training Configuration

Data Augmentation

Model Training

Usage

Inference

Acknowledgments

License

About

Releases

Packages

Languages

License

amangupta143/PyTorch-Image-Segmentation

Folders and files

Latest commit

History

Repository files navigation

PyTorch Image Segmentation Project

Overview

Requirements

Project Structure

Setup and Installation

Model Architecture

Training Configuration

Data Augmentation

Model Training

Usage

Inference

Acknowledgments

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages