TABLE OF CONTENTS
- History/Overview
- Data Wrangling and Data Cleaning
- Exploratory Data Analysis/Data Storytelling and Visualization
- Conclusion
- References
HISTORY/OVERVIEW
The FIFA World Cup was first held in 1930, when FIFA(Fédération Internationale de Football Association), the world's football governing body, decided to stage an international men's football tournament under the era of FIFA president Jules Rimet who put this idea into place. The first competition for the cup was won by Uruguay. The inaugural edition, held in 1930, was contested as a final tournament of only thirteen teams invited by the organization. Since then, the World Cup has experienced successive expansions and format remodeling, with its current 32-team final tournament preceded by a two-year qualifying process, involving over 200 teams from around the world. The World Cup in football (soccer) is a quadrennial tournament of 32 national teams that is organized by the Fédération Internationale de Football Association (FIFA). It determines the sport’s men’s world champion. It is likely the most popular sporting event in the world, drawing billions of television viewers every tournament.
Evolution of the format The number of teams and the format of each final tournament have varied considerably over the years. In most tournaments, the tournament consists of a round-robin group stage followed by a single-elimination knockout stage.
The 2022 World Cup hosted by Qatar will be the first tournament to not be held in summer time in which it is usually held. It will take place from 21 November to 18 December 2022.
This Fifa World Cup Analysis covers from 1930 - Present. The datasets were gotten from en.m.wikipedia.org and kaggle.com
Data Wrangling and Data Cleaning
Firstly we scraped more data from 2014-2018 in order to make our dataset complete, accurate and up to date then merged it with one of the dataset used titled "fifa world cup data" which was renamed to "New World cup Data" after the merging using Excel before we moved to power query editor where datatypes were changed, some columns removed and renamed, relationships were created and duplicates were removed. We made use of two datasets namely "Fifa World Cup" and "New World cup Data" which we cleaned and transformed using the Power Query Editor in PowerBI.
Exploratory Data Analysis/Data Storytelling and Visualization
Analysis 1(Countries with World cup Winning Titles)
This shows Brazil with the highest World Cup Winning Titles of 5 cups/trophies.
Analysis 2(Goals per Team)
This shows each goal per team that played in the World Cup each year and the total goals scored by the teams. Brazil is the team with the highest total goals of 47 throughout FIFA World Cup.
Analysis 3(Matches with the Highest Attendance)
This shows the various countries and their summed up attendance across the years during the World Cup but throws more insights on the country(ies) with the total highest attendance across the years.
Analysis 4(Matches per Cup)
This shows the total matches played per cup(each year from 1930-2018). 2018 being the latest year the World Cup has been played having a total of 64 matches.
Analysis 5(Match Outcome; Home and Away)
This shows the match goal statistics for the away and home teams respectively in the World Cup. It shows year 1954 with the highest Home Team goals and 2014 with the highest Away Team goals.
Analysis 6(Number of Teams)
85 teams have played in the World Cup from past(non-existing teams) to present(existing teams) times.
Analysis 7(Total Attendance)
This shows the total attendance across the years in the FIFA World Cup.
Analysis 8(Total Goals)
This shows the total goals scored for each cup across the various years.
Analysis 9(Stadium with the Highest Attendance)
This shows the stadium with the highest attendance across the years, Estadio Azteca being the stadium with the highest total attendance of 1,917,550, Luzhniki Stadium being the stadium with the highest total attendance of 546,077 in 2018.
FIFA World Cup Analysis Dashboard
Conclusion
This analysis shows the wins and losses from the first FIFA World Cup in 1930 to the most current in 2018. It gives data driven insights to help predict future outcomes as well as draw conclusions based on facts and findings. It show trends across the years, statistics and each team's performance during the world cup between the year 1930-2018. We were able to run a descriptive analysis on the datasets gotten about the FIFA World Cup, trying to identify what has happened in the past(historical events).
REFERENCES
en.wikipedia.org, google.com, fifa.com/the-best-fifa-football-awards/hist.., footballhistory.org/world-cup/index.html, en.m.wikipedia.org/wiki/FIFA_World_Cup, kaggle.com/code/shivan118/fifa-world-cup-da..