LD-Net is a deep learning-based algorithm designed to enhance visibility in hazy images and videos, improving visual quality and clarity. This project implements a lightweight dehazing network based on a convolutional autoencoder (CAE) architecture.
- Achieved a 40% improvement in structural similarity index (SSIM) compared to baselines
- Reduced model size by 70% while maintaining performance
- Integrated PyTorch model with a React.js frontend via a Flask API
- Enables real-time image processing with under 2-second latency
The project consists of two main components:
- Frontend (React.js)
- Backend (Python Flask API)
cd frontend
npm install
npm run dev
cd Backend
pip install -r requirements.txt
python main.py
python -m venv .venv
source .venv/bin/activate
python -m venv .venv
source .venv/Scripts/activate
Install the following modules in your virtual environment:
pip install torch opencv-python numpy tqdm matplotlib
Note: Make sure to compile and run the project using the virtual environment kernel.
LD-Net utilizes the encoder-decoder structure of a Convolutional Autoencoder (CAE) to learn a latent representation that captures the haze-free content of the image. The decoder then reconstructs the dehazed image from this latent representation.
Key advantages of LD-Net include:
- Lightweight architecture suitable for real-time applications
- Computational efficiency
- Improved image quality metrics (PSNR and SSIM) compared to hazy input images
The network has been evaluated on a benchmark hazy image dataset, demonstrating its effectiveness in improving image quality metrics like PSNR (Peak Signal-to-Noise Ratio) and SSIM (Structural Similarity Index).
- Further optimization of the model for even faster processing
- Expansion of the dataset to improve generalization
- Integration with mobile platforms for on-device dehazing
Ashutosh Jha Ayush Sehgal