• Fri. May 24th, 2024

AI Neural Network Trained to Decode Darknet Slang and Terminology

Avatar photo

ByHarper Stewart

Aug 2, 2023
AI decodes Darknet slang
Harper Stewart
Latest posts by Harper Stewart (see all)

South Korean researchers have successfully trained DarkBERT, a neural network designed to comprehend the slang used by participants on the Darknet. Tom’s Hardware reports on this groundbreaking achievement, citing the technical neural documentation. The researchers from the Institute of Advanced Technology in South Korea trained DarkBERT using a vast array of Darknet text data, consisting of approximately 6 million web pages, and leveraging the RoBERTa language model.

Developing DarkBERT: Advancing Understanding of Darknet Slang and Limitations of Neural Networks

One notable aspect of this development is the distinct nature of Darknet slang, which possesses specific characteristics enabling attackers to evade detection. For instance, Russian-speaking hackers often employ Anglicisms and even modify syntax to camouflage their public exchanges on forums, thereby concealing their activities.

Through the training of the neural network, scientists have made significant strides in detecting confidential data leaks on the internet. DarkBERT is capable of discerning whether published data represents a new leak or a duplication of previously disclosed information. Moreover, the neural network has acquired the ability to identify advertisements for the illicit sale of psychotropic substances by analyzing the presence of specific keywords. This innovative solution has the potential to streamline the efforts of law enforcement agencies in apprehending offenders involved in such activities.

It is important to note that DarkBERT, despite its accomplishments, does have limitations. Currently, the technology only supports the English language, limiting its applicability in multilingual Darknet contexts. Additionally, the learning process is still a manual endeavor due to variations in site layouts, necessitating the compilation of data arrays for neural network training.

Despite the widespread enthusiasm surrounding neural networks, it is crucial to recognize that the technology is still a work in progress. A case in point is the performance of ChatGPT and Sage neural networks, which received a “C” grade on a history exam at the Ural Federal University. Mikhail Kiselyov, an associate professor of history in UrFU, asserts that the chatbots’ algorithms simulate independent reasoning, resulting in the fabrication of facts and references to non-existent scholarly works. These challenges underscore the ongoing journey to refine and enhance the capabilities of neural networks.

 
Avatar photo

Harper Stewart

With a deep understanding of the complexities of the Dark Web, Harper curates informative and thought-provoking content for our readers. Her knowledge of the hidden corners of the internet and cybersecurity helps shed light on the often mysterious and illicit activities that take place in this realm.