Page 507 - Emerging Trends and Innovations in Web-Based Applications and Technologies
P. 507

International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
















































                                          Fig 3. Methodology of plagiarism detection tool
             VI.    PERFORMANCE EVALUATION                      2.  Training: Originality Guard was trained on the training
             To comprehensively evaluate the performance of Originality   set using a 5-fold cross-validation approach
             Guard, a multi-faceted evaluation framework was designed,
             incorporating various metrics, baselines, and experimental   3.  Testing: the trained model was evaluated on the testing
             setups.                                                set
                                                                4.  Baselines:  two  baseline  methods  were  used  for
             Evaluation Metrics                                     comparison:
             The following metrics were employed to assess Originality     Turnitin: a commercial plagiarism detection tool
             Guard's performance:                                 PLAGUE:  a  state-of-the-art  machine  learning-based
             1.  Precision:  measures  the  proportion  of  true  positives   approach
                among all detected plagiarism instances
                                                                VII.   CONCLUSION
             2.  Recall: measures the proportion of true positives among
                                                                This  review  paper  presented  Originality  Guard,  a  novel
                all actual plagiarism instances
                                                                plagiarism detection tool that leverages advanced natural
             3.  F1-score: measures the harmonic mean of precision and   language processing and machine learning techniques. The
                recall                                          main contributions of this research includes:
             4.  Accuracy:  measures  the  proportion  of  correctly   1.  Development  of  Originality  Guard:  A  robust  and
                classified instances                                accurate  plagiarism  detection  tool  that  outperforms
                                                                    existing baselines.
             5.  Mean Average Precision (MAP): measures the average
                precision at different recall levels            2.  Comprehensive  evaluation  framework:  A  thorough
                                                                    evaluation methodology that assesses the performance
             6.  Receiver Operating Characteristic (ROC) Curve: plots the
                                                                    of Originality Guard using various metrics.
                true  positive  rate  against  the  false  positive  rate  at
                different thresholds                            3.  Insights into plagiarism detection: A detailed analysis of
                                                                    the results, highlighting the strengths and weaknesses of
             Experimental Setup
                                                                    Originality Guard.
             The experimental setup consisted of:
             1.  Dataset:  a  large-scale  dataset  of  50,000  academic   It  has  significant  implications  for  advancing  plagiarism
                documents,  divided  into  training  (80%)  and  testing   detection and accuracy, promoting academic integrity, and
                (20%) sets


             IJTSRD | Special Issue on Emerging Trends and Innovations in Web-Based Applications and Technologies   Page 497
   502   503   504   505   506   507   508   509   510   511   512