Page 792 - Emerging Trends and Innovations in Web-Based Applications and Technologies
P. 792

International Journal of Trend in Scientific Research and Development (IJTSRD)
               Special Issue on Emerging Trends and Innovations in Web-Based Applications and Technologies
                                       Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470

                               Deepfake Detection with Generalization
                                    via Domain-Aware Meta-Learning

                                                       2
                     Kartik. S. Raut. , Harsh. S. Pendke , Prof. Anupam Chaube , Prof. Usha Kosarkar
                                                                                                      4
                                    1
                                                                                3
                                           1,2,3,4 Department of Science and Technology,
                          1,2 G H Raisoni Institute of Engineering and Technology, Nagpur, Maharashtra, India
                          3,4 G H Raisoni College of Engineering and Management, Nagpur, Maharashtra, India

             ABSTRACT                                           performance against the latest deepfake techniques, which
             Deepfakes,  media  manipulated  using  deep  learning   can evade current detection systems (Le et al., 2024).
             techniques, pose a growing threat to the integrity of digital   One  promising  approach  to  address  the  limitations  in
             content.  These  AI-generated  forgeries  are  becoming
             increasingly sophisticated, making them difficult to detect.   generalizing deepfake detection involves utilizing a meta-
             Traditional detection methods often lag behind the rapid   learning framework for domain generalization, combined
             evolution of deepfake techniques and are hampered by the   with  data  augmentation.  Meta-learning  enhances
             limited variety of training data, making it hard for them to   adaptability  by  exposing  the  model  to  a  variety  of
                                                                manipulations during training, enabling it to learn features
             generalize effectively to new types of deepfakes. This thesis
             introduces  a  novel  deepfake  detection  approach  that   that remain consistent across different deepfake generation
             combines meta-learning for domain generalization (MLDG)   techniques.  Meanwhile,  data  augmentation  increases  the
                                                                diversity of training data, further strengthening the model’s
             with self blended images (SBI) to address this challenge.
             MLDG,  inspired  by  meta-learning  principles,  aims  to   ability to detect unseen deepfakes.
             improve  the  model’s  adaptability  to  new  manipulation   This  study  evaluates  the  effectiveness  of  an  integrated
             techniques by simulating domain shifts during training. The   deepfake  detection  framework  that  combines  Meta-
             model learns from various source domains representing   Learning for Domain Generalization (MLDG) (D. Li et al.,
             different deepfake generation methods. Additionally, SBIs,   2017b)  with  Self-Blended  Images  (SBIs)  (Shiohara  &
             synthetic images created by blending real and manipulated   Yamasaki,  2022)  as  a  data  augmentation  strategy.  The
             faces, are incorporated to further diversify the training data   primary  objective  is  to  assess  how  well  this  approach
             and promote the learning of features that generalize across   enhances  detection  performance  on  previously  unseen
             domains.  This  thesis  focuses  on  detecting  image-based   deepfake generation techniques compared to a conventional
             deepfakes using the Face Forensics++ dataset, a benchmark   baseline.
             collection  of  real  and  manipulated  videos,  specifically
                                                                The evaluation will be conducted using the FaceForensics++
             designed for deepfake detection research. The proposed
                                                                dataset  (Rössler  et  al.,  2019)  through  a  leave-one-out
             method is evaluated with a leave-one-out cross-validation
                                                                cross-validation methodology. The model will be trained on
             scheme on this dataset, where each deepfake generation
                                                                multiple  deepfake  generation  techniques  and  tested  on
             technique is used as a test case while the others are used for
                                                                unseen methods. The performance of the proposed approach
             training. The results consistently show that MLDG, when
                                                                will be compared against a baseline model trained using
             enhanced with SBIs, outperforms the standard Empirical   Empirical Risk Minimization (ERM) (Vapnik, 1999) to
             Risk  Minimization  (ERM)  method,  demonstrating  its
                                                                determine its effectiveness.
             effectiveness  in  generalizing  to  unseen  manipulation
             techniques.  The  research  offers  a  practical  solution  for   This study focuses exclusively on video-based deepfake
             deepfake  detection,  highlighting  how  MLDG  and  SBI   detection, specifically targeting facial manipulations. While
             augmentation  can  create  more  effective  and  adaptable   audio-based deepfake detection is an emerging concern, it is
             detection  systems.  The  findings  emphasize  the  need  for   beyond the scope of this research. Additionally, the detection
             models that can adapt to evolving deepfake techniques to   strategy  used  in  this  study  is  image-based,  analyzing
             protect the integrity of digital media.            individual video frames instead of incorporating temporal

                                                                information. This allows for a focused evaluation of domain
             I.     INTRODUCTION                                generalization  techniques  applied  to  facial  deepfake
             The rapid advancements in deepfake generation technology   detection.
             have  introduced  significant  challenges  for  detection   II.   RELATED WORK
             methods,  highlighting  the  need  for  more  robust  and
                                                                Challenges  in  Traditional  Machine  Learning  for
             adaptable approaches. Traditional deepfake detection relies
                                                                Deepfake Detection
             on machine learning models trained to distinguish real and
                                                                Conventional  machine  learning  models  face  several
             manipulated media based on specific deepfake techniques.
                                                                limitations, particularly their dependence on large labeled
             However, these models often struggle when encountering
                                                                datasets  for  effective  learning.  In  real-world  deepfake
             previously  unseen  manipulation  methods  (Malik  et  al.,
                                                                detection,  obtaining  labeled  datasets  for  every  possible
             2022). Recent research efforts have aimed to improve the
                                                                manipulation technique is challenging, time-consuming, and
             adaptability of detection models across different types of
                                                                often  impractical  (Prince,  2023).  Additionally,  machine
             deepfake  manipulations  (Sun  et  al.,  2021;  Z.  Wang  et  al.,
                                                                learning  models  frequently  struggle  with  adapting  to
             2023),  yet  there  remains  a  notable  gap  in  model
                                                                evolving  or  unseen  tasks.  For  example,  facial  recognition
             IJTSRD | Special Issue on Emerging Trends and Innovations in Web-Based Applications and Technologies   Page 782
   787   788   789   790   791   792   793   794   795   796   797