On January 20, 2020, the College of Management, National Sun Yat-sen University (NSYSU) hosted the "2020 Business Intelligence Day” to allow high school students to understand the process of basic data analysis and data visualization using the R language, cultivating their interest in business data analysis. A significant number of registrations were received for this event, and 90 students from Kaohsiung Municipal Kaohsiung Senior High School, National Feng-Shan Senior High School, Sanmin High School, Yunlin Hu-wei Senior High School, Taoyuan Pingjen Senior High School, and Pingtung Girls’ High School, etc. were admitted. Assistant Professor Tony Cho pointed out that today's high school students were quite active in exploring abilities, and the College was also happy to share the knowledge with them and cultivate future talents in business management.
The full-day experience camp was divided into two sessions. The first part included background learning, introducing the R language environment and the Dplyr package. The session allowed the students to perform basic calculations and use Table (), tapply () and other functions to obtain descriptive statistics. The second part included practicing visualizing data, using the ggplot2 & d3heatmap package to draw bar graphs, line graphs, heat maps, scatter plots, etc. to present statistical results. Finally, students will be grouped to analyze and present 120 years of Olympic contestants' data (including gender, height, sports and medals, etc.) for them to learn data analysis through practice.
The teaching assistants of the camp were all students taking Business Big Data Micro Course at the College of Management. In a short and brief manner, high school students who were exposed to the R language for the first time were able to write a few small programs within one day. 林士弘, a second year student of the master in the business administration program, who was one of the lecturers for the camp, said that each group of students was asked to present the main findings from the Olympic data using graphs, and their presentations were graded by the teaching assistants in terms of creativity, annotation detailedness, stage manners and technicality. The camp offered a total prize of NT$42,000.
The team that won the first place consisted of 黃猷珵and李柏寯 from Sanmin High School, 高瑀and曾冠旗 from Guoguang Laboratory School, National Sun Yat-sen University, 陳姿諭 from National Feng-Shan Senior High School, and林佳宜 from Hu-wei Senior High School. They explored the relationship between the participating countries’ GDP and democracy index with the number of medals. They concluded that countries with a large number of medals were not necessarily economically prosperous. They pointed out that, "Hard work can produce better results than financial resources". 陳韻卉, the teaching assistant assigned to lead the team who is a second-year graduate student in the computer science program, said that students who had never learned about programming languages were frustrated at first, but she was feeling especially proud and pleased when they ended up presenting their analysis result on the stage with confidence. "It is our most important task to make sure the students are not overwhelmed by the large amount of data and programming”, she said.
The team that won the second place was diverse and consisted of吳坤霖from National Feng-Shan Senior High School, 邵彥哲from Kaohsiung Saint Paul's High School, 陳伃玟 from Lichih Senior High School, 鍾順穎 from Guoguang Laboratory School, National Sun Yat-sen University, and 楊舒晴 and陳宏諭 from Sanmin High School. They analyzed the correlation between the democracy index and sports performance of various countries and found that the presence of a democratic system in a country did not affect the number of gold medals the country won. Nevertheless, the majority of the countries that won gold medals had a democracy index of more than 8. Therefore, it was inferred that the presence of a democratic system in a country could have certain advantages for sports development. 許紘齊, the teaching assistant assigned to lead the team who is a second-year graduate student in the computer science program, pointed out that the reason for the team to win the award was each student was delegated with the task they were good at. For instance, the students who were not good at programming were assigned to prepare and do the presentation, while the students who were more interested in programming were allowed to focus on programming. The award was the result of a collective effort.
In light of the upcoming 2020 Olympics in Tokyo, the team consisting of陳禹璇 from Deguang Catholic High School, 范名媛 and蕭尹翔 from Guoguang Laboratory School, National Sun Yat-sen University, 黃廉凱 from St. Dominic Catholic High School, 陳以恩 from National Feng-Shan Senior High School, and王皓平 from Sanmin High School was curious about the relationship between the host country and the number of medals won. Studying China as the subject, they analyzed the medal data of the past four Olympic Games (China was the host country in 2008) and found that when China was the host country, the total number of medals the Chinese team won in each sport did indeed outperform the team in other years. Therefore, they also inferred that because the home team would have more people cheering for them, the visiting team would be relatively pressured, which would affect the sports performance indirectly.
After a day of learning and practices, it is believed that the students have gained a preliminary understanding of the concept of data analysis. The most important thing is that they would no longer be afraid of dealing with data and could enjoy the sense of accomplishment of turning primary data into useful information.