The Use of Information Retrieval in Student Academic Document Plagiarism Detection System

Authors

    Arief Rahman Yusuf( 1 ) Angga Prasetyo( 2 )

    (1) Universitas Muhammadiyah Ponorogo
    (2) Universitas Muhammadiyah Ponorogo

DOI:


https://doi.org/10.32877/bt.v6i2.1063

Keywords:


Academic Documents, Detection System, Information Retrieval , Plagiarism, Turnitin

Abstract

The practice of plagiarism can threaten the credibility of academic documents such as final assignments, theses, theses, and dissertations. Universitas Muhammadiyah Ponorogo (UMPO) is one of the best Muhammadiyah Universities in Indonesia. All academic documents at UMPO are well stored. However, plagiarism is only found in the Turnitin database and in the title of academic documents. This research aims to pre-screen academic documents in the internal scope by creating a plagiarism application that is easy to use and cheaper in terms of cost than using Turnitin. The steps of making a plagiarism detection system in academic documents using information retrieval (IR) with waterfall development. The design process starts from system requirements analysis, design, implementation, and testing. This plagiarism detection system goes through several stages, namely tokenisation, stopword removal, stemming, and termweigthing. Tokenisation involves converting uppercase letters into lowercase letters and removing punctuation marks. Stopword removal removes noise, stemming converts words into basic forms, and termweigthing uses local and global weighting to calculate similarity with the Vector Space Model. Document similarity is calculated using cosine similarity, rejected documents are considered free of plagiarism. If there is no similarity, the document is accepted as a source document, while the rejected document is not stored in the database. Further research on the document plagiarism detection system that previously only used the txt file extension, in the future it can use the .doc or HTML extension so as to increase the effectiveness of the performance of academic document examination time.

Downloads

Download data is not yet available.

References

E. Bensal, E. Miraflores, and N. C. Tan, “Plagiarism: Shall we turn to Turnitin?,” CALL-EJ, vol. 14, pp. 2–22, Jan. 2013.

A. R. Fadilla, H. Haryadi, and M. Rapik, “Plagiarisme Karya Ilmiah Dalam Kacamata Hukum Pidana,” PAMPAS J. Crim. Law, vol. 4, no. 1, Art. no. 1, Feb. 2023, doi: 10.22437/pampas.v4i1.24074.

G. C. Adiyati and A. Supriyanto, “PENYEBAB DAN DAMPAK BAGI SESEORANG YANG MELAKUKAN TINDAKAN PLAGIARISME DALAM PENULISAN KARYA ILMIAH,” Semin. Nas. Arah Manaj. Sekol. Pada Masa Dan Pasca Pandemi Covid-19, no. 0, Art. no. 0, 2020, Accessed: Dec. 14, 2023. [Online]. Available: http://conference.um.ac.id/index.php/apfip/article/view/375

S. Wachidah, “PLAGIARISME DALAM KATA-KATA MAHASISWA: ANALISIS TEKS DENGAN PENDEKATAN FUNGSIONAL,” Linguist. Indones., vol. 31, no. 2, Art. no. 2, Aug. 2013, doi: 10.26499/li.v31i2.8.

T. Foltýnek, N. Meuschke, and B. Gipp, “Academic Plagiarism Detection: A Systematic Literature Review,” ACM Comput. Surv., vol. 52, no. 6, p. 112:1-112:42, Oct. 2019, doi: 10.1145/3345317.

L. Wulandari, “PENERAPAN TEXT MINING PADA SEARCH ENGINE (STUDI KASUS E-COMMERCE SHOPEE),” J. Teknol. Inf. Manaj. Dan Bisnis Digit., pp. 21–27, Sep. 2023.

N. Ahmad, A. A. Prasetyo, and A. Masruri, “PENERAPAN INFORMATION RETRIEVAL PADA SEARCH ENGINE,” Knowl. J. Inov. Has. Penelit. Dan Pengemb., vol. 1, no. 1, Art. no. 1, Dec. 2021.

F. Wiranto and I. M. Tirta, “Information Retrieval Using Matrix Methods,” presented at the International Conference on Mathematics, Geometry, Statistics, and Computation (IC-MaGeStiC 2021), Atlantis Press, Feb. 2022, pp. 167–172. doi: 10.2991/acsr.k.220202.032.

E. Fitriani, R. E. Indrajit, and R. Aryanti, “Penerapan Model Information Retrieval Untuk Pencarian Konten Pada Perpustakaan Digital,” J. Perspekt., vol. 15, no. 2, Art. no. 2, Sep. 2017, doi: 10.31294/jp.v15i2.2350.

A. Sutanti, M. K. Mz, M. Mustika, and P. Damayanti, “RANCANG BANGUN APLIKASI PERPUSTAKAAN KELILING MENGGUNAKAN PENDEKATAN TERSTRUKTUR,” Komputa J. Ilm. Komput. Dan Inform., vol. 9, no. 1, pp. 1–8, Mar. 2020, doi: 10.34010/komputa.v9i1.3718.

R. Nurmasari, S. Pinem, and U. Nurkhalifah, “Perancangan Pengelolaan Data Pelabuhan Perikanan Nusantara (PPN) Pelabuhan Ratu Menggunakan Entity Relationship Diagram (ERD),” J. Ilm. Rekayasa Dan Manaj. Sist. Inf., vol. 9, no. 1, Art. no. 1, Mar. 2023, doi: 10.24014/rmsi.v9i1.22024.

Z. Tuasamu et al., “Analisis Sistem Informasi Akuntansi Siklus Pendapatan Menggunakan DFD dan Flowchart Pada Bisnis Porobico,” J. Bisnis Dan Manaj. JURBISMAN, vol. 1, no. 2, Art. no. 2, May 2023, doi: 10.61930/jurbisman.v1i2.181.

N. Normah and F. Sihaloho, “Perancangan User Interface (UI) dan User Experince (UX) Aplikasi pendistribution alat-alat kesehatan pada perusahaan PT. Rekamileniumindo Selaras Jakarta Barat,” Indones. J. Softw. Eng. IJSE, vol. 9, no. 1, Art. no. 1, Jun. 2023, doi: 10.31294/ijse.v9i1.15467.

L. D. Krisnawati, J. F. Lim, and G. Virginia, “Penggunaan Pemodelan Topik dalam Sistem Temu Kembali Dokumen Termirip,” J. Linguist. Komputasional, vol. 6, no. 1, Art. no. 1, Apr. 2023, doi: 10.26418/jlk.v6i1.78.

R. Kaban, P. Sihombing, M. Pandia, and P. Simamora, “Pemrosesan Query dan Pemeringkatan Hasil dalam Information Retrieval: Sebuah Kajian Literatur,” J. Inf. Syst. Res. JOSH, vol. 4, no. 3, Art. no. 3, Apr. 2023, doi: 10.47065/josh.v4i3.2867.

Downloads

Published

2023-12-28

How to Cite

[1]
A. Rahman Yusuf and A. Prasetyo, “The Use of Information Retrieval in Student Academic Document Plagiarism Detection System”, bit-Tech, vol. 6, no. 2, pp. 235–240, Dec. 2023.

Issue

Section

Articles
DOI : https://doi.org/10.32877/bt.v6i2.1063
Abstract views: 55 / PDF downloads: 49