Course Outline | With the development of computer technology, data is being digitized and generated and stored in various fields. In this age of knowledge and information, it is important to extract knowledge from these data and make it informatized in order to increase the competitiveness of the business being operated and further create a new business. However, these data are very large or complex in shape, making it difficult to process them with existing technologies. Big data technologies such as Hadoop have been developed to deal with this. Big data technologies are based on distributed processing, and various types of distributed processing platforms have been released as open source. In this course, students will learn various open-source big data platforms for analyzing and managing large amounts of data. First, you will learn about the distributed processing model used for distributed processing platforms and the distributed processing method used. In addition, students will learn open-source solutions for various tasks that occur in big data applications, such as data collection, event processing, and big data analysis. In this course, practical skills are cultivated by implementing big data processing programs using open-source software. |