Skip to content

A CNN that classifies images as 'Real' or 'Fake' with 96% accuracy.

Notifications You must be signed in to change notification settings

stefsyrsiri/synthetic-image-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Generated Synthetic Images Detection | CNN Classifier

download (1)

Overview

The boom of AI-powered content generation and increasing interest in the research field of Deep Learning has led to widely accessible (and trending) tools that can produce content of any kind: text, image, audio, and video. While AI isn't new, the the availability of powerful low-code generative AI applications to the public is. AI-generated content can often be indistinguishable from its authentic counterparts, posing a threat to the credibility of digital media. The underlying dangers of the misuse of GenAI have already come to surface with deepfakes, voice cloning, fakes news, disinformation, identity theft and various types of scams. In fact, a survey conducted by Microsoft in 2023 shows that 71% of respondents are worried about AI scams.

In this project, we focus on image generation, which can have multiple societal effects, especially on people not familiar with this kind of technology. Our task is to train a neural network to identify whether an image is real or AI-generated.

Table of Contents

Files

  • functions.py : All the preprocessing, model training, evaluation functions and CNN class used in the report.
  • report.ipynb : The full machine learning pipeline. EDA, preprocessing, model training, transfer learning, evaluation, gradio deployment.

Dataset

CIFAKE: Real and AI-Generated Synthetic Images is a comprehensive collection of 60,000 synthetically-generated images and 60,000 real images (collected from CIFAR-10). The dataset contains two classes, labelled as "REAL" and "FAKE". There are 100,000 images for training (50k per class) and 20,000 for testing (10k per class). Since the training an test sets have 50% of each class, there is no class imbalance that needs to be taken care of for our binary classification task.

Results

Screenshot 2024-10-07 132401

Authors

License

Distributed under the MIT License

About

A CNN that classifies images as 'Real' or 'Fake' with 96% accuracy.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages