GANOrigin: Deepfake Detection & GAN Attribution for Faces using Latent Space

This is a AI/ML project that aims to detect and attribute GAN generated images for faces. It is a two stage pipeline that first detects whether an image of a face is fake or not. If the image is found to be fake, it will attribute to which among the GAN models generated it. Our current implementation is able to attribute images to StyleGAN2, StyleGAN3 and ProGAN.

Technologies Used

ProGAN: Generative model used to geerate images for latent feature extracction for model training.
StyleGAN2: Generative model used to geerate images for latent feature extracction for model training.
StyleGAN3: Generative model used to geerate images for latent feature extracction for model training.
Hugging Face Hub: For model hosting and deployment.
Universal Fake Detect: Pretraned model used for deepfake detection
e4e Encoder: Encoder used to project images to latent space.
Netlify: Used to deploy frontend.
Render: Used to deploy backend.
React: Used to create the website for frontend.
Express: Used as middleware for backend
Flask: Backend framework
Python: Overall language ussed for this project
Kaggle: Used to train the model using its compute resources.
Vite: Deployment of frontend.
TailwindCSS: Used for styling the frontend with React.
Gardio: Used to create the interface on Hugging Face Hub.

Modules

Our proposed model in it's current stage is implemented using two modules; 1) Deepfake Detection Module 2) Deepfake Attribution Module.

Deepfake Detection Module:
This module is responsible for detecting whether the inputted image is deepfake or not. Our main body of work is not focused on this module. So we have just implemented a pretrained model, UniversalFakeDetect which is pretty good for detecting deepfakes especially those generated by models from GAN family.
Deepfake Attribution Module:
This module is responsible for attributing the image to the GAN model that generated the image. This is our main body of work. How this module works is that once a deepfake image of a face is detected by the deepfake detection module, it will be fed into this module. This module will pass it through e4eEncoder, which will project the image to latent space which will give us latent features. The latent features are then fed into a Linear SVM model that predicts which GAN model generated that particular deepfake.

Model Training

The model training that was mainly done was for the Linear SVM. The Linear SVM was trained of latent features that were obtained from 5000 images of each class among StylGAN2, StyleGAN3 and ProGAN. Each of those images were projected into W+ latent space. The latent features were then used to train the Linear SVM model after subjecting to Resolution Augmentation as to avoid model convergence due to resolution artifactation.

Methodology

It uses latent space to achieve this.

Image --(CLIP ViT-L/14)--> Deepfake Detection --(if real)--> Output real
Image --(CLIP ViT-L/14)--> Deepfake Detection --(if fake)--> Deepfake Image Detected --(e4e Encoder)--> latent features --(Linear SVM)--> Predicted Architecture

Results

We obtained a Linear SVM that demonstrated an accuracy of 99%.

Below given is the classification report for the model:

	precision	recall	f1-score	support
progan	1.00	0.98	0.99	1000
stylegan2	0.99	1.00	0.99	1000
stylegan3	0.99	0.99	0.99	1000
-------------	------------	----------	-----------	-----------
accuracy			0.99	3000
macro avg	0.99	0.99	0.99	3000
weighted avg	0.99	0.99	0.99	3000

Setup

Clone Repo

git clone https://github.com/HamiYasir/Deepfake-Detection-GAN-Attribution-for-Faces-using-Latent-Space.git   
cd Deepfake-Detection-GAN-Attribution-for-Faces-using-Latent-Space

Backend Setup

cd backend
python -m venv venv  
venv\Scripts\activate   # Windows  
pip install -r requirements.txt  
Run: python app.py

Frontend Setup

cd frontend
npm install
npm run dev

API Endpoint

POST /predict

Input: Image file Output:

{
"prediction": "StyleGAN2",
"confidence": 0.94,
"probabilities": {...},
"deepfake_score": 0.97
}

Future Works

Can extend the scope from just images of faces to objects/people.
Can extend the scope from just image inputs to video, audio, etc...
Current implementation only works on GAN models. It would be amazing to figure out a way to include Diffusion models and especially account for images generated by ChatGPT, NanoBanana Pro, etc.
I saw a cool site for Deepfake Detection. Although it only does deepfake detection, it does this cool explainability where it analyzes each part of the detected image and tells what feature's were detected that contributed that image to being a deepfake or not. While judging by my limited knowledge, I think that explainabilityh is achieved by pixel artefact detction and considereing the fact that we are not doing that, it would still be pretty cool to see that explainability in here.

Deployment

This has been deployed at https://huggingface.co/spaces/HAMIwithHugs/GANOrigin.

It also has been deployed at https://ganorigin.netlify.app/. But due to free tier restrictions on render, this deployment wont work.

Enjoy :)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GANOrigin: Deepfake Detection & GAN Attribution for Faces using Latent Space

Technologies Used

Modules

Model Training

Methodology

Results

Setup

API Endpoint

Future Works

Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GANOrigin: Deepfake Detection & GAN Attribution for Faces using Latent Space

Technologies Used

Modules

Model Training

Methodology

Results

Setup

API Endpoint

Future Works

Deployment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages