Open Access Open Access  Restricted Access Subscription or Fee Access

Fake News and Hate Speech Detection Using Deep Learning

Akanksha Balaji Mate, Ashar Hasan, Pallavi Santosh Murkute, Shubham Kumar, Rahul M. Samant

Abstract


Due to the potential for such content to hurt both people and society as a whole, the issue of identifying fake news and hate speech on internet platforms has grown in importance in recent years. This problem has shown great potential for resolution through deep learning, a specific area of machine learning. In this research work, we examine recent studies on the application of deep learning methods for the identification of fake news and hate speech. We begin by outlining the difficulties in identifying such information, such as the difficulty in defining fake news and hate speech, the sheer volume of online text, and the requirement to take context and other contextual elements into account. The use of deep learning for detection is then covered in detail. In order to enhance the performance of models, we also investigate the use of transfer learning and data augmentation strategies. An active study field in NLP and machine learning is the detection of fake news and hate speech using deep learning algorithms. The detection of fake news and hate speech has showed promise when using deep learning algorithms like CNNs, RNNs, and transformers. Using methods like sentiment analysis, topic modeling, and text classification, one can identify false news by examining the news article's wording, citations, and tone. Lack of big, annotated datasets is a barrier in detecting fake news and hate speech, particularly in languages other than English. For the development of precise deep learning models, high-quality annotated datasets are crucial.


Full Text:

PDF

References


Samant RM, Sameer Kulkarni, Atharva Bhusari. Sherlock: An Ensemble based Deep Learning Framework for Fake News Detection. Int J Adv Comput Sci Appl (0975-8887). 2020 May; 176(22): 37–42.

Helmstetter S, Paulheim H. Weakly Supervised Learning for Fake News Detection on Twitter. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). 2018; 274–277.

Sen A, Rudra K, Ghosh S. Extracting Situational Awareness from Microblogs during Disaster Events. Social Networking Workshop, COMSNETS 2015. 2015; 1–6.

Cho K, Merrienboer BV, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning Phrase Representations using RNN Encoder–Decoder. arXiv:1406.1078. 2014.

Wang WY. Liar, liar pants on fire: A new benchmark dataset for fake news detection. arXiv:1705.00648v1 [cs.CL]. 2017; 422–426.

Saad S, Nicholas W, Mei-Ling S, Daniel F. High Dimensional Latent Space Variational Auto Encoders for Fake News Detection. IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). 2019; 437–442.

Reis J, Correia A, Murai F, Veloso A, Benevenuto F. Supervised Learning for Fake News Detection. IEEE Intell Syst. 2019 Mar–Apr; 34(2): 76–81.

Sudarshan S, Seth S, Chebrolu K, Chakrabarti S, Agarwal M, Pale A, Bagade A. The Kauwa-Kaate Fake News Detection System: Demo. ACM India Joint International Conference on Data Science and Management of Data (CoDS-COMAD). 2020; 302–306.

Fakenewschallenge.org. (2016). Fake news challenge. [Online] Available at: http://www.fakenewschallenge.org/.

Risdal M. (2016). Getting real about fake news. [Online] Kaggle.com. Available at: https://www.kaggle.com/datasets/mrisdal/fake-news.

Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank. In Proceedings of EMNLP. 2013; 1631–1642.

Bengio Y, Simrad P, Franconi P. Learning Long-Term Dependencies with Gradient Descent is Difficult. IEEE Trans Neural Netw. 1994; 5(2): 157–166.

Hochreiter S, Schmidhuber J¨. Long Short-Term Memory. Neural Comput. 1997; 9(8): 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.

Kim Y. Convolutional Neural Networks for Sentence Classification. Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014; 1746–1751.

Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed Representations of Words and Phrases and their Compositionality. NIPS'13: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013 Dec; 2: 3111–3119.

Socher R, Pennington J, Huang EH, Ng AY, Manning CD. Semi-Supervised Recursive Autoencoders. EMNLP'11: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011; 151–161.

Kalchbrenner N, Grefenstette E, Blunsom P. A Convolutional Neural Network for Modelling Sentences. arXiv:1404.2188v1 [cs.CL]. 2014.

Traylor T, Straub J, Gurmeet, Snell N. Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator. IEEE 13th International Conference on Semantic Computing (ICSC). 2019; 445–449.


Refbacks

  • There are currently no refbacks.