DETEKSI TWEET UJARAN KEBENCIAN DAN UCAPAN KASAR BERBAHASA INDONESIA MENGGUNAKAN RANDOM FOREST DAN SUPPORT VECTOR MACHINE DENGAN TEKNIK VOTING CLASSIFIER


Dandi Indra Wijaya, 4611418013 (2022) DETEKSI TWEET UJARAN KEBENCIAN DAN UCAPAN KASAR BERBAHASA INDONESIA MENGGUNAKAN RANDOM FOREST DAN SUPPORT VECTOR MACHINE DENGAN TEKNIK VOTING CLASSIFIER. Under Graduates thesis, Universitas Negeri Semarang.

[thumbnail of 4611418013 - Dandi Indra Wijaya.pdf] PDF - Published Version
Restricted to Repository staff only

Download (2MB) | Request a copy

Abstract

The use of social media has become one of the main things in everyday life. This happens because the features provided make it easy for people to communicate and disseminate information. One of the social media used by many people is Twitter. the main feature of twitter is that its users can post posts that are termed tweets. There is a negative thing about the freedom to write a tweet, namely a tweet that does not contain things that harm other people or community. The problem that arises from this negative thing is to distinguish between hatespeech tweets and abusive tweets. Hate speech and abusive speech are often the same thing. These differences need to be considered because they can have a negative impact on social life. Sentiment analysis is used to distinguish the two things. Sentiment analysis is an implementation of natural language processing which is part of machine learning. The algorithms used in this research are Support Vector Machine, Random Forest, and Voting Classifier with soft voting type. The estimator for the Voting Classifier is the Support Vector Machine and Random Forest. TF-IDF and N-gram were used as feature extraction. The data used is a tweet dataset that has been labeled neutral, hate speech, and rude speech. Measurement of model accuracy is done by using confusion matrix. The highest accuracy was produced by a combination of Voting Classifier technique with TF-IDF feature extraction and the amount of N-gram was 1 gram, which was 82.57% accuracy.

Item Type: Thesis (Under Graduates)
Uncontrolled Keywords: Analysis Sentiment, Support Vector Machine, Random Forest, Voting Classifier.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Fakultas: Fakultas Matematika dan Ilmu Pengetahuan Alam > Ilmu Komputer, S1
Depositing User: TUKP unnes
Date Deposited: 29 Mar 2023 04:00
Last Modified: 29 Mar 2023 04:00
URI: http://lib.unnes.ac.id/id/eprint/56790

Actions (login required)

View Item View Item