Many Nobel laureates are affiliated with a university, a laboratory, or a research organisation (apart from Literature and Peace prize winners as we've seen). But the world is a big place. Which research institutions had the most Nobel laureates working there at the time of making the discovery?
Create a bar chart showing the organisations affiliated with the Nobel laureates. It should looks something like this:

Which organisations make up the top 20?
How many Nobel prize winners are affiliated with the University of Chicago and Harvard University?
Each research organisation is located in a particular city. Are some cities hot spots for scientific discoveries? Where do major discoveries tend to take place?
Create another plotly bar chart graphing the top 20 organisation cities of the research institutions associated with a Nobel laureate.
Where is the number one hotspot for discoveries in the world?
Which city in Europe has had the most discoveries?
Contrast the above chart with the birth city of the Nobel laureates. Would you expect to see a similar ranking for where the laureates are born versus where most discoveries are made? Would you expect to see the most populous cities producing the highest number of Nobel laureates?
Create a plotly bar chart graphing the top 20 birth cities of Nobel laureates.
Use a named colour scale called Plasma for the chart.
What percentage of the United States prizes came from Nobel laureates born in New York?
How many Nobel laureates were born in London, Paris and Vienna?
Out of the top 5 cities, how many are in the United States?
Create a DataFrame that groups the number of prizes by organisation.
Then use the plotly documentation to create a sunburst chart
Click around in your chart, what do you notice about Germany and France?
Here's what you're aiming for:

.
.
..
...
..
.
.
Solution 1: The Top Research Organisations
This one should be pretty simple:
top20_orgs = df_data.organization_name.value_counts()[:20] top20_orgs.sort_values(ascending=True, inplace=True)
Our chart includes many of the usual suspects:
org_bar = px.bar(x = top20_orgs.values,
y = top20_orgs.index,
orientation='h',
color=top20_orgs.values,
color_continuous_scale=px.colors.sequential.haline,
title='Top 20 Research Institutions by Number of Prizes')
org_bar.update_layout(xaxis_title='Number of Prizes',
yaxis_title='Institution',
coloraxis_showscale=False)
org_bar.show()
Solution 2: Research Cities
top20_org_cities = df_data.organization_city.value_counts()[:20]
top20_org_cities.sort_values(ascending=True, inplace=True)
city_bar2 = px.bar(x = top20_org_cities.values,
y = top20_org_cities.index,
orientation='h',
color=top20_org_cities.values,
color_continuous_scale=px.colors.sequential.Plasma,
title='Which Cities Do the Most Research?')
city_bar2.update_layout(xaxis_title='Number of Prizes',
yaxis_title='City',
coloraxis_showscale=False)
city_bar2.show()Cambridge Massachusets and New York in the United States lead the pack:

Solution 3: Laureate Birth Cities
top20_cities = df_data.birth_city.value_counts()[:20]
top20_cities.sort_values(ascending=True, inplace=True)
city_bar = px.bar(x=top20_cities.values,
y=top20_cities.index,
orientation='h',
color=top20_cities.values,
color_continuous_scale=px.colors.sequential.Plasma,
title='Where were the Nobel Laureates Born?')
city_bar.update_layout(xaxis_title='Number of Prizes',
yaxis_title='City of Birth',
coloraxis_showscale=False)
city_bar.show()A higher population definitely means that there's a higher chance of a Nobel laureate to be born there. New York, Paris, and London are all very populous. However, Vienna and Budapest are not and still produced many prize winners. That said, much of the ground-breaking research does not take place in big population centres, so the list of birth cities is quite different from the list above. Cambridge Massachusets, Stanford, Berkely and Cambridge (UK) are all the places where many discoveries are made, but they are not the birthplaces of laureates.

Solution 4: The Sunburst Chart
Each country has a number of cities, which contain a number of cities, which in turn contain the research organisations. The sunburst chart is perfect for representing this relationship. It will give us an idea of how geographically concentrated scientific discoveries are!
country_city_org = df_data.groupby(by=['organization_country',
'organization_city',
'organization_name'], as_index=False).agg({'prize': pd.Series.count})
country_city_org = country_city_org.sort_values('prize', ascending=False)
burst = px.sunburst(country_city_org,
path=['organization_country', 'organization_city', 'organization_name'],
values='prize',
title='Where do Discoveries Take Place?',
)
burst.update_layout(xaxis_title='Number of Prizes',
yaxis_title='City',
coloraxis_showscale=False)
burst.show()France is a great example of concentration. Practically all the organisations affiliated with Nobel prize winners are in Paris. In contrast, scientific discoveries are much more spread out across Germany. Meanwhile, the UK is dominated by Cambridge and London.
