2. New Direction

Research

Our approach focuses on attacking the training process of deepfake models. Deepfake models typically operate by extracting facial feature embeddings – the unique characteristics that define a person's visual identity, such as face shape or eye size. Deepfake models use embeddings to replicate a person's likeness to generate convincing imitations. By targeting feature embeddings, we developed a strategy to shield an individual's visual identity from being captured.

We began by studying Glaze's research paper for inspiration on how to build our own model. The Glaze software protects artist’s works by adding a few pixels encoded with the embeddings of a completely different art style. Our novel approach is similar — our model selects a different face from our database and “cloaks” the original picture with a few pixels of new facial features embeddings. We fine-tuned the model to make the source photo look like the target photo, without introducing human noticeable changes. After multiple iterations, we successfully optimized the parameters in our model to achieve desired results.

An Ethical Dataset

We generated 500 faces from thispersondoesnotexist.com to train our model instead of scraping for people’s faces online without proper consent — which would’ve contradicted the purpose of this project 😅 !!

Methodology

Together, we wrote a Python script to scrape thispersondoesnotexist.com faces into our database, and another Python script with Hugging Face to get feature embeddings from a picture of a face. Then, we wrote the algorithm that searched our dataset for a target face with the most similar feature embeddings to the original face.

The last part of our model we implemented was to produce a new image that minimized visual difference, using the L2 norm metric, and maximized feature embedding differences, using the LPIPS metric.

After working on the core implementation together, we delegated the rest of the work — model fine tuning, and frontend & design.

4. Results

Cloaked’s similarity to Target: 75.67%

Cloaked’s similarity to Original: 29.57%

Cloaked images are 2.56 times more effective at fooling deepfakes!

Original

Cloaked

The output image is almost completely identical to the source image in the human eye, but actually very different in terms of facial features calculated from the machine, which used the 500 images ran on our model, and measured the Euclidean distance of the original and cloaked images. Cloaked images are approx. 2.56X more similar to the target than the original in feature embedding space!

1. Initial Approach

2. New Direction

Research

An Ethical Dataset

Methodology

3. Frontend & UI Design

4. Results

5. Takeaways

Impact & Implications