Otimização de desempenho do Hadoop MapReduce: um caso prático
With the popularization of the Internet, massive amounts of data have been generated on a daily basis, especially in the social media. The growing demand for managing large volumes of data meant that new solutions were developed. Currently Hadoop is one of the solutions used. Settings can be applied...
Autor principal: | Kuss, Elder Lucas |
---|---|
Formato: | Trabalho de Conclusão de Curso (Graduação) |
Idioma: | Português |
Publicado em: |
Universidade Tecnológica Federal do Paraná
2020
|
Assuntos: | |
Acesso em linha: |
http://repositorio.utfpr.edu.br/jspui/handle/1/15938 |
Tags: |
Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
|
Resumo: |
With the popularization of the Internet, massive amounts of data have been generated on a daily basis, especially in the social media. The growing demand for managing large volumes of data meant that new solutions were developed. Currently Hadoop is one of the solutions used. Settings can be applied in Hadoop to extract better performance. This paper carries out a study about the influence of configuration parameters on the performance of Hadoop MapReduce, and for reach that goal, uses a virtualized cluster Docker environment for testing development. The results obtained in this paper demonstrate that it is possible to achieve performance improvements in Hadoop by tuning the values of its configuration parameters. |
---|