Uso de técnicas e ferramentas de mineração de dados na extração de informações sobre o comportamento de uso dos recursos da internet na UTFPR - Câmpus Medianeira

The large increase in capacity to generate, transmit and store data in digital format, has exceeded the human capacity of knowledge extraction of these data. The Data Mining is the process emerged in recent decades, precisely in order to solve the problem. This paper presents the application of Data...

ver descrição completa

Autor principal: Valiati, Gustavo Rafael
Formato: Trabalho de Conclusão de Curso (Graduação)
Idioma: Português
Publicado em: Universidade Tecnológica Federal do Paraná 2020
Assuntos:
Acesso em linha: http://repositorio.utfpr.edu.br/jspui/handle/1/13445
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
Resumo: The large increase in capacity to generate, transmit and store data in digital format, has exceeded the human capacity of knowledge extraction of these data. The Data Mining is the process emerged in recent decades, precisely in order to solve the problem. This paper presents the application of Data Mining, as a case study, in a large amount of data in logs, generated by Squid, from Internet sharing servers, to extract knowledge required by the network administrator. The paper discusses, in detail, performing the steps of Data Mining, and also some obstacles that hindered the execution of part of the project, such as inviability of constructing an automated tool to process Data Mining; inability of specific hardware to process the required data; necessity of using new strategies in the creation of large ARFF files, to enable Weka tool to apply mining tasks. Further, this paper presents a tool for preprocessing and data transformation, specifically designed for the environment encountered. And as a result of mining are presented patterns found in the logs along with samples of possible interpretations. At last, a list of some opportunities for new papers is presented.