Pandas - 3. The Land of Poets Challenge
Download exercises
For a digital humanities project you need to display Italian poets by filtering a csv table according to various criteria. This challenge will be only about querying with pandas, which is something you might find convenient to do during exams for quickly understanding datasets content (using pandas will always be optional, you will never be asked to perform complex modifications with it)
You are given a dataset taken from Wikidata, a project by the Wikimedia foundation which aims to store only machine-readable data, like numbers, strings, and so on interlinked with many references. Each entity in Wikidata has an identifier, for example Dante Alighieri is the entity Q1067 and Florence is Q2044
Wikidata can be queried using the SPARQL language: the data was obtained with this query and downloaded in CSV format (among the many which can be chosen). Even if not necessary for the purposes of the exercise, you are invited to play a bit with the interface, like trying different visualizations (i.e. try select map in the middle-left corner) - or see other examples
What to do
If you haven’t already, install Pandas:
Anaconda:
conda install pandas
Without Anaconda (
--user
installs in your home):python3 -m pip install --user pandas
unzip exercises in a folder, you should get something like this:
pandas
pandas1-sol.ipynb
pandas1.ipynb
pandas2-sol.ipynb
pandas2.ipynb
pandas3-chal.ipynb
jupman.py
WARNING 1: to correctly visualize the notebook, it MUST be in an unzipped folder !
open Jupyter Notebook from that folder. Two things should open, first a console and then browser.
The browser should show a file list: navigate the list and open the notebook
pandas3-chal.ipynb
WARNING 2: DO NOT use the Upload button in Jupyter, instead navigate in Jupyter browser to the unzipped folder !
Go on reading that notebook, and follow instuctions inside.
Shortcut keys:
to execute Python code inside a Jupyter cell, press
Control + Enter
to execute Python code inside a Jupyter cell AND select next cell, press
Shift + Enter
to execute Python code inside a Jupyter cell AND a create a new cell aftwerwards, press
Alt + Enter
If the notebooks look stuck, try to select
Kernel -> Restart
Load the dataset
First load the dataset italian-poets.csv in pandas dataframe df
USE
UTF-8
asencoding
[1]:
# write here
Tell me more
Show some info about the dataset
[2]:
# write here
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3198 entries, 0 to 3197
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 subj 3198 non-null object
1 subjLabel 3198 non-null object
2 place 3198 non-null object
3 placeLabel 3198 non-null object
4 coord 3198 non-null object
5 birthyear 3070 non-null float64
dtypes: float64(1), object(5)
memory usage: 150.0+ KB
Getting in shape
Show the rows and the columns counts:
[3]:
# write here
rows: 3198
columns: 6
10 rows
Display first 10 rows
[4]:
# write here
[4]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
0 | http://www.wikidata.org/entity/Q8797 | Aemilius Macer | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | NaN |
1 | http://www.wikidata.org/entity/Q8833 | Gaius Maecenas | http://www.wikidata.org/entity/Q13378 | Arezzo | Point(11.878055555 43.463055555) | NaN |
2 | http://www.wikidata.org/entity/Q5592 | Michelangelo | http://www.wikidata.org/entity/Q52069 | Caprese Michelangelo | Point(11.985833333 43.640833333) | 1475.0 |
3 | http://www.wikidata.org/entity/Q6197 | Horace | http://www.wikidata.org/entity/Q52691 | Venosa | Point(15.816666666 40.966666666) | -64.0 |
4 | http://www.wikidata.org/entity/Q7170 | Sallust | http://www.wikidata.org/entity/Q177061 | Amiternum | Point(13.305769 42.400776) | -85.0 |
5 | http://www.wikidata.org/entity/Q7198 | Ovid | http://www.wikidata.org/entity/Q50157 | Sulmona | Point(13.926198 42.048025) | -42.0 |
6 | http://www.wikidata.org/entity/Q7728 | Grazia Deledda | http://www.wikidata.org/entity/Q13649 | Nuoro | Point(9.3280792 40.3200621) | 1871.0 |
7 | http://www.wikidata.org/entity/Q7803 | Bronzino | http://www.wikidata.org/entity/Q2044 | Florence | Point(11.254166666 43.771388888) | 1503.0 |
8 | http://www.wikidata.org/entity/Q8796 | Sandra Lombardi | http://www.wikidata.org/entity/Q220 | Rome | Point(12.482777777 41.893055555) | 1946.0 |
9 | http://www.wikidata.org/entity/Q8800 | Gaius Maecenas Melissus | http://www.wikidata.org/entity/Q20571 | Spoleto | Point(12.733333333 42.733333333) | -100.0 |
Born in Verona
Display all people born in Verona
[5]:
# write here
[5]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
0 | http://www.wikidata.org/entity/Q8797 | Aemilius Macer | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | NaN |
135 | http://www.wikidata.org/entity/Q163079 | Catullus | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | -83.0 |
232 | http://www.wikidata.org/entity/Q318593 | Girolamo Fracastoro | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1478.0 |
256 | http://www.wikidata.org/entity/Q539577 | Guarino da Verona | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1374.0 |
375 | http://www.wikidata.org/entity/Q1236766 | Ippolito Pindemonte | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1753.0 |
436 | http://www.wikidata.org/entity/Q620193 | Aleardo Aleardi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1812.0 |
755 | http://www.wikidata.org/entity/Q2293943 | Cristina Ali Farah | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1973.0 |
764 | http://www.wikidata.org/entity/Q1587432 | Francesco Scipione, marchese di Maffei | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1675.0 |
858 | http://www.wikidata.org/entity/Q3290043 | Marco Antonio Zucchi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1750.0 |
891 | http://www.wikidata.org/entity/Q3611735 | Alida Airaghi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1953.0 |
1035 | http://www.wikidata.org/entity/Q3638918 | Berto Barbarani | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1872.0 |
1090 | http://www.wikidata.org/entity/Q3663490 | Caterina Bon Brenzoni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1813.0 |
1098 | http://www.wikidata.org/entity/Q3665350 | Cesare Betteloni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1808.0 |
1143 | http://www.wikidata.org/entity/Q3741666 | Federico Ceruti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1532.0 |
1169 | http://www.wikidata.org/entity/Q3746475 | Flavio Ermini | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1947.0 |
1393 | http://www.wikidata.org/entity/Q3762433 | Giambattista Spolverini | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1695.0 |
1459 | http://www.wikidata.org/entity/Q3766734 | Giovanni Battista Pighi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1898.0 |
1489 | http://www.wikidata.org/entity/Q3767945 | Giovanni Pindemonte | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1751.0 |
1566 | http://www.wikidata.org/entity/Q3768974 | Girolamo Pompei | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1731.0 |
1694 | http://www.wikidata.org/entity/Q4015300 | Vittorio Betteloni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1840.0 |
1771 | http://www.wikidata.org/entity/Q3081061 | Francesco Pona | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1595.0 |
1869 | http://www.wikidata.org/entity/Q3837018 | Lorenzo Montano | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1893.0 |
1935 | http://www.wikidata.org/entity/Q3846323 | Marco Ongaro | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1956.0 |
2211 | http://www.wikidata.org/entity/Q6999870 | Rudy De Cadaval | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1933.0 |
2322 | http://www.wikidata.org/entity/Q15432608 | Ortensio Mauro | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1634.0 |
2361 | http://www.wikidata.org/entity/Q15726796 | Teresa Albarelli | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1788.0 |
2390 | http://www.wikidata.org/entity/Q16574305 | Luigi Nogarola | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1669.0 |
2460 | http://www.wikidata.org/entity/Q17341090 | Giovanni Ceriotto | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1883.0 |
2530 | http://www.wikidata.org/entity/Q18945280 | Francesco degli Allegri | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1495.0 |
2531 | http://www.wikidata.org/entity/Q18945373 | Giorgio Summaripa | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1435.0 |
2617 | http://www.wikidata.org/entity/Q19597229 | Giambattista Mutinelli | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1747.0 |
2634 | http://www.wikidata.org/entity/Q20671732 | Pietro Caliari | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1841.0 |
2651 | http://www.wikidata.org/entity/Q23014868 | Ilario Casarotti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1772.0 |
2841 | http://www.wikidata.org/entity/Q30126093 | Girolamo Orti Manara | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1769.0 |
2850 | http://www.wikidata.org/entity/Q30303339 | Paolo Zazzaroni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | NaN |
2872 | http://www.wikidata.org/entity/Q28778065 | Angela Nogarola | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1380.0 |
2885 | http://www.wikidata.org/entity/Q30308589 | Bartolomeo Tortoletti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1560.0 |
How many people in Verona
Display how many people were born in Verona
[6]:
# write here
[6]:
37
Python is everywhere
Show poets born in Catania in the year -500
mind the minus
I swear we did not altered the dataset in any way :-)
[7]:
# write here
[7]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
2231 | http://www.wikidata.org/entity/Q7263938 | Python of Catana | http://www.wikidata.org/entity/Q1903 | Catania | Point(15.087269444 37.502669444) | -500.0 |
Verona after 1500
Display all people born in Verona after the year 1500
[8]:
# write here
[8]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
375 | http://www.wikidata.org/entity/Q1236766 | Ippolito Pindemonte | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1753.0 |
436 | http://www.wikidata.org/entity/Q620193 | Aleardo Aleardi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1812.0 |
755 | http://www.wikidata.org/entity/Q2293943 | Cristina Ali Farah | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1973.0 |
764 | http://www.wikidata.org/entity/Q1587432 | Francesco Scipione, marchese di Maffei | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1675.0 |
858 | http://www.wikidata.org/entity/Q3290043 | Marco Antonio Zucchi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1750.0 |
891 | http://www.wikidata.org/entity/Q3611735 | Alida Airaghi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1953.0 |
1035 | http://www.wikidata.org/entity/Q3638918 | Berto Barbarani | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1872.0 |
1090 | http://www.wikidata.org/entity/Q3663490 | Caterina Bon Brenzoni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1813.0 |
1098 | http://www.wikidata.org/entity/Q3665350 | Cesare Betteloni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1808.0 |
1143 | http://www.wikidata.org/entity/Q3741666 | Federico Ceruti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1532.0 |
1169 | http://www.wikidata.org/entity/Q3746475 | Flavio Ermini | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1947.0 |
1393 | http://www.wikidata.org/entity/Q3762433 | Giambattista Spolverini | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1695.0 |
1459 | http://www.wikidata.org/entity/Q3766734 | Giovanni Battista Pighi | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1898.0 |
1489 | http://www.wikidata.org/entity/Q3767945 | Giovanni Pindemonte | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1751.0 |
1566 | http://www.wikidata.org/entity/Q3768974 | Girolamo Pompei | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1731.0 |
1694 | http://www.wikidata.org/entity/Q4015300 | Vittorio Betteloni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1840.0 |
1771 | http://www.wikidata.org/entity/Q3081061 | Francesco Pona | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1595.0 |
1869 | http://www.wikidata.org/entity/Q3837018 | Lorenzo Montano | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1893.0 |
1935 | http://www.wikidata.org/entity/Q3846323 | Marco Ongaro | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1956.0 |
2211 | http://www.wikidata.org/entity/Q6999870 | Rudy De Cadaval | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1933.0 |
2322 | http://www.wikidata.org/entity/Q15432608 | Ortensio Mauro | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1634.0 |
2361 | http://www.wikidata.org/entity/Q15726796 | Teresa Albarelli | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1788.0 |
2390 | http://www.wikidata.org/entity/Q16574305 | Luigi Nogarola | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1669.0 |
2460 | http://www.wikidata.org/entity/Q17341090 | Giovanni Ceriotto | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1883.0 |
2617 | http://www.wikidata.org/entity/Q19597229 | Giambattista Mutinelli | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1747.0 |
2634 | http://www.wikidata.org/entity/Q20671732 | Pietro Caliari | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1841.0 |
2651 | http://www.wikidata.org/entity/Q23014868 | Ilario Casarotti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1772.0 |
2841 | http://www.wikidata.org/entity/Q30126093 | Girolamo Orti Manara | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1769.0 |
2885 | http://www.wikidata.org/entity/Q30308589 | Bartolomeo Tortoletti | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1560.0 |
First Antonio
Display all people with Antonio as first name
[9]:
# write here
[9]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
47 | http://www.wikidata.org/entity/Q266482 | Antonio Bonfini | http://www.wikidata.org/entity/Q3415 | Ancona | Point(13.516666666 43.616666666) | 1427.0 |
48 | http://www.wikidata.org/entity/Q266482 | Antonio Bonfini | http://www.wikidata.org/entity/Q3897778 | Patrignone | Point(13.60926 42.98027) | 1427.0 |
77 | http://www.wikidata.org/entity/Q348311 | Antonio Tebaldeo | http://www.wikidata.org/entity/Q13362 | Ferrara | Point(11.619865 44.835297) | 1463.0 |
120 | http://www.wikidata.org/entity/Q470067 | Antonio Fogazzaro | http://www.wikidata.org/entity/Q6537 | Vicenza | Point(11.55 45.55) | 1842.0 |
203 | http://www.wikidata.org/entity/Q524960 | Antonio Ghislanzoni | http://www.wikidata.org/entity/Q6237 | Lecco | Point(9.4 45.85) | 1824.0 |
... | ... | ... | ... | ... | ... | ... |
2881 | http://www.wikidata.org/entity/Q30250615 | Antonio Bruni | http://www.wikidata.org/entity/Q52019 | Manduria | Point(17.634166666 40.402777777) | 1593.0 |
2917 | http://www.wikidata.org/entity/Q42941837 | Antonio Decio | http://www.wikidata.org/entity/Q176180 | Orte | Point(12.386111111 42.460277777) | 1560.0 |
2979 | http://www.wikidata.org/entity/Q56166956 | Antonio Rossetti | http://www.wikidata.org/entity/Q51313 | Vasto | Point(14.708219444 42.111588888) | 1770.0 |
3060 | http://www.wikidata.org/entity/Q54860414 | Antonio Ricci | http://www.wikidata.org/entity/Q51240 | Guardiagrele | Point(14.221591666 42.189222222) | 1952.0 |
3135 | http://www.wikidata.org/entity/Q94075340 | Antonio Gasparinetti | http://www.wikidata.org/entity/Q46503 | Ponte di Piave | Point(12.466666666 45.716666666) | 1777.0 |
85 rows × 6 columns
Some Antonio
Display all people with Antonio as one of the names (so also include 'Paolo Antonio Rolli'
)
[10]:
# write here
[10]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
47 | http://www.wikidata.org/entity/Q266482 | Antonio Bonfini | http://www.wikidata.org/entity/Q3415 | Ancona | Point(13.516666666 43.616666666) | 1427.0 |
48 | http://www.wikidata.org/entity/Q266482 | Antonio Bonfini | http://www.wikidata.org/entity/Q3897778 | Patrignone | Point(13.60926 42.98027) | 1427.0 |
53 | http://www.wikidata.org/entity/Q55433 | Michelangelo Antonioni | http://www.wikidata.org/entity/Q13362 | Ferrara | Point(11.619865 44.835297) | 1912.0 |
77 | http://www.wikidata.org/entity/Q348311 | Antonio Tebaldeo | http://www.wikidata.org/entity/Q13362 | Ferrara | Point(11.619865 44.835297) | 1463.0 |
120 | http://www.wikidata.org/entity/Q470067 | Antonio Fogazzaro | http://www.wikidata.org/entity/Q6537 | Vicenza | Point(11.55 45.55) | 1842.0 |
... | ... | ... | ... | ... | ... | ... |
2906 | http://www.wikidata.org/entity/Q41566775 | Carlo Antonio Bertelli | http://www.wikidata.org/entity/Q111705 | Salò | Point(10.533333333 45.6) | 1637.0 |
2917 | http://www.wikidata.org/entity/Q42941837 | Antonio Decio | http://www.wikidata.org/entity/Q176180 | Orte | Point(12.386111111 42.460277777) | 1560.0 |
2979 | http://www.wikidata.org/entity/Q56166956 | Antonio Rossetti | http://www.wikidata.org/entity/Q51313 | Vasto | Point(14.708219444 42.111588888) | 1770.0 |
3060 | http://www.wikidata.org/entity/Q54860414 | Antonio Ricci | http://www.wikidata.org/entity/Q51240 | Guardiagrele | Point(14.221591666 42.189222222) | 1952.0 |
3135 | http://www.wikidata.org/entity/Q94075340 | Antonio Gasparinetti | http://www.wikidata.org/entity/Q46503 | Ponte di Piave | Point(12.466666666 45.716666666) | 1777.0 |
110 rows × 6 columns
Cesares during 1800
Display all people named Cesare who were born in 1800 century
[11]:
# write here
[11]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
389 | http://www.wikidata.org/entity/Q1056872 | Cesare Meano | http://www.wikidata.org/entity/Q495 | Turin | Point(7.7 45.066666666) | 1899.0 |
1098 | http://www.wikidata.org/entity/Q3665350 | Cesare Betteloni | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | 1808.0 |
1101 | http://www.wikidata.org/entity/Q3665409 | Cesare De Titta | http://www.wikidata.org/entity/Q51292 | Sant'Eusanio del Sangro | Point(14.333333333 42.166666666) | 1862.0 |
1105 | http://www.wikidata.org/entity/Q3665495 | Cesare Pascarella | http://www.wikidata.org/entity/Q220 | Rome | Point(12.482777777 41.893055555) | 1858.0 |
The old ones
Show poets in year of birth order
DO NOT include in the result NaN values
HINT: see pd.notnull
[12]:
# write here
[12]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
292 | http://www.wikidata.org/entity/Q332797 | Stesichorus | http://www.wikidata.org/entity/Q54614 | Gioia Tauro | Point(15.9 38.433333333) | -629.0 |
293 | http://www.wikidata.org/entity/Q332802 | Ibycus | http://www.wikidata.org/entity/Q8471 | Reggio Calabria | Point(15.65 38.114438888) | -600.0 |
327 | http://www.wikidata.org/entity/Q336115 | Theognis of Megara | http://www.wikidata.org/entity/Q1457477 | Megara Hyblaea | Point(15.18194444 37.20388889) | -569.0 |
84 | http://www.wikidata.org/entity/Q125551 | Parmenides | http://www.wikidata.org/entity/Q272968 | Velia | Point(15.154444444 40.159444444) | -514.0 |
2575 | http://www.wikidata.org/entity/Q20002641 | Glaucus of Rhegion | http://www.wikidata.org/entity/Q8471 | Reggio Calabria | Point(15.65 38.114438888) | -500.0 |
... | ... | ... | ... | ... | ... | ... |
3104 | http://www.wikidata.org/entity/Q78162153 | Q78162153 | http://www.wikidata.org/entity/Q13678 | Agrigento | Point(13.576547222 37.311075) | 1985.0 |
2989 | http://www.wikidata.org/entity/Q58995193 | Giovanni Bertoglio | http://www.wikidata.org/entity/Q495 | Turin | Point(7.7 45.066666666) | 1986.0 |
2986 | http://www.wikidata.org/entity/Q58308029 | Gio Evan | http://www.wikidata.org/entity/Q19300 | Molfetta | Point(16.6 41.2) | 1988.0 |
2374 | http://www.wikidata.org/entity/Q14922292 | Q14922292 | http://www.wikidata.org/entity/Q80652 | Battipaglia | Point(14.983333333 40.616666666) | 1991.0 |
552 | http://www.wikidata.org/entity/Q1151356 | D.B.P.I.T. | http://www.wikidata.org/entity/Q220 | Rome | Point(12.482777777 41.893055555) | 2000.0 |
3070 rows × 6 columns
Cities of poets
Find the 5 cities with most poets, sorted from most to least.
use
groupby
andsort_values
methods
[13]:
# write here
[13]:
placeLabel
Rome 198
Florence 165
Milan 121
Naples 113
Venice 94
Name: subj, dtype: int64
Most duplicated poets
Find first 8 duplicated poets
[14]:
# write here
[14]:
subjLabel
Sosiphanes 4
Alojz Rebula 4
Eliseo Calenzio 4
Giambattista Andreini 4
Tommaso Grossi 3
Giovanni della Casa 3
Giuseppe Carpani 3
Aulus Gellius 3
Name: subj, dtype: int64
All duplicated poets
Print the number of all duplicated poets
NOTE: a Series object has only one column, even if they look two (the apparent other is the index) - so if you have a Series object you don’t need to specify a column
[15]:
# write here
There are 118 duplicated poets
Northern poets
Find all the poets born north of a given town
. In other words, look for town latitude (the second coordinate in coords
), print it, and then filter the table.
DO NOT put constants like
46.5
in your code!DO NOT add new columns for longitude and latitude
NOTE:
coord
column holds just simple strings!HINT: to get an element at a given numerical index
i
of a filtered Series (regardless of the original dataframe row index), you need to use.iloc[i]
property - note the square brackets!
[ ]:
[16]:
town = 'Bolzano'
#town = 'Trento'
# write here
Latitude of Bolzano : 46.5
[16]:
subj | subjLabel | place | placeLabel | coord | birthyear | |
---|---|---|---|---|---|---|
27 | http://www.wikidata.org/entity/Q45105 | Oswald von Wolkenstein | http://www.wikidata.org/entity/Q1013962 | Schöneck Castle | Point(11.847977 46.820264) | 1377.0 |
41 | http://www.wikidata.org/entity/Q122070 | Simon Lemnius | http://www.wikidata.org/entity/Q257965 | Val Müstair | Point(10.39009 46.60566) | 1511.0 |
42 | http://www.wikidata.org/entity/Q122070 | Simon Lemnius | http://www.wikidata.org/entity/Q257965 | Val Müstair | Point(10.42 46.6) | 1511.0 |
88 | http://www.wikidata.org/entity/Q137683 | Mary de Rachewiltz | http://www.wikidata.org/entity/Q185541 | Brixen | Point(11.65 46.716666666) | 1925.0 |
583 | http://www.wikidata.org/entity/Q873784 | Ignaz Vincenz Zingerle | http://www.wikidata.org/entity/Q131605 | Merano | Point(11.163888888 46.668888888) | 1825.0 |
636 | http://www.wikidata.org/entity/Q1705031 | Josef Kostner | http://www.wikidata.org/entity/Q255568 | Urtijëi | Point(11.666666666 46.566666666) | 1933.0 |
637 | http://www.wikidata.org/entity/Q1705031 | Josef Kostner | http://www.wikidata.org/entity/Q255568 | Urtijëi | Point(11.66748 46.57432) | 1933.0 |
770 | http://www.wikidata.org/entity/Q1996716 | Norbert Conrad Kaser | http://www.wikidata.org/entity/Q185541 | Brixen | Point(11.65 46.716666666) | 1947.0 |
1905 | http://www.wikidata.org/entity/Q3839806 | Luigi Maieron | http://www.wikidata.org/entity/Q53240 | Cercivento | Point(12.983333333 46.533333333) | 1954.0 |
2152 | http://www.wikidata.org/entity/Q4505559 | Karl Ziegler | http://www.wikidata.org/entity/Q504216 | St. Martin in Passeier | Point(11.22727 46.78392) | 1812.0 |
2741 | http://www.wikidata.org/entity/Q24073666 | Fedele Demetz | http://www.wikidata.org/entity/Q499129 | Sëlva | Point(11.76038 46.55472) | 1850.0 |
2939 | http://www.wikidata.org/entity/Q55471982 | Roberta Dapunt | http://www.wikidata.org/entity/Q644159 | Val Badia | Point(11.89917 46.68361) | 1970.0 |
2940 | http://www.wikidata.org/entity/Q55471982 | Roberta Dapunt | http://www.wikidata.org/entity/Q644159 | Val Badia | Point(11.9 46.683333333) | 1970.0 |
3128 | http://www.wikidata.org/entity/Q95185585 | Anna Katharina Mair | http://www.wikidata.org/entity/Q185541 | Brixen | Point(11.65 46.716666666) | 1967.0 |
Papers please
Extract subject id (i.e. Q8797
) and place id (i.e. Q2028
) and MODIFY df
by putting them into two new columns subj_id
and place_id
[17]:
# write here
[17]:
subj | subjLabel | place | placeLabel | coord | birthyear | subj_id | place_id | |
---|---|---|---|---|---|---|---|---|
0 | http://www.wikidata.org/entity/Q8797 | Aemilius Macer | http://www.wikidata.org/entity/Q2028 | Verona | Point(10.992777777 45.438611111) | NaN | Q8797 | Q2028 |
1 | http://www.wikidata.org/entity/Q8833 | Gaius Maecenas | http://www.wikidata.org/entity/Q13378 | Arezzo | Point(11.878055555 43.463055555) | NaN | Q8833 | Q13378 |
2 | http://www.wikidata.org/entity/Q5592 | Michelangelo | http://www.wikidata.org/entity/Q52069 | Caprese Michelangelo | Point(11.985833333 43.640833333) | 1475.0 | Q5592 | Q52069 |
3 | http://www.wikidata.org/entity/Q6197 | Horace | http://www.wikidata.org/entity/Q52691 | Venosa | Point(15.816666666 40.966666666) | -64.0 | Q6197 | Q52691 |
4 | http://www.wikidata.org/entity/Q7170 | Sallust | http://www.wikidata.org/entity/Q177061 | Amiternum | Point(13.305769 42.400776) | -85.0 | Q7170 | Q177061 |
... | ... | ... | ... | ... | ... | ... | ... | ... |
3193 | http://www.wikidata.org/entity/Q99308713 | Giovanni Marrasio | http://www.wikidata.org/entity/Q487174 | Noto | Point(15.083333333 36.883333333) | 1405.0 | Q99308713 | Q487174 |
3194 | http://www.wikidata.org/entity/Q100775377 | Annemarie Innerebner | http://www.wikidata.org/entity/Q6526 | Bolzano | Point(11.35 46.5) | 1924.0 | Q100775377 | Q6526 |
3195 | http://www.wikidata.org/entity/Q100775410 | Nesti Lyro Wollek | http://www.wikidata.org/entity/Q2044 | Florence | Point(11.254166666 43.771388888) | 1875.0 | Q100775410 | Q2044 |
3196 | http://www.wikidata.org/entity/Q99655533 | Lorenza Meletti | http://www.wikidata.org/entity/Q95093 | Bondeno | Point(11.41542 44.88944) | 1940.0 | Q99655533 | Q95093 |
3197 | http://www.wikidata.org/entity/Q99912619 | Nanni Falconi | http://www.wikidata.org/entity/Q391218 | Pattada | Point(9.11 40.582222222) | 1950.0 | Q99912619 | Q391218 |
3198 rows × 8 columns
Unknown poets
Find all the ids of nameless poets and put them in a python list.
DO NOT use loops
NOTE a Series object from the point of view of Python is just a sequence
[18]:
# write here
[18]:
['Q4360247',
'Q14922292',
'Q19130448',
'Q21207901',
'Q19984452',
'Q21209119',
'Q21282215',
'Q23673492',
'Q29049430',
'Q29052339',
'Q31763467',
'Q28465822',
'Q48809843',
'Q27553577',
'Q48811051',
'Q48861610',
'Q55441810',
'Q47468550',
'Q50327630',
'Q50330028',
'Q55897192',
'Q65019765',
'Q51845316',
'Q60838260',
'Q64433131',
'Q71684946',
'Q93338246',
'Q59187521',
'Q61136330',
'Q61450547',
'Q52107491',
'Q61790603',
'Q61791394',
'Q61827513',
'Q61895377',
'Q59851133',
'Q59851150',
'Q62066746',
'Q66736238',
'Q66921487',
'Q85421610',
'Q61080035',
'Q87068357',
'Q64031897',
'Q64167386',
'Q64364409',
'Q69818426',
'Q64512266',
'Q78162153',
'Q78499894',
'Q88264630',
'Q89674973',
'Q94998318',
'Q94325725',
'Q84138681',
'Q98102965',
'Q80705985',
'Q81100287',
'Q81738068',
'Q83643534',
'Q83808244',
'Q96097742',
'Q96245247',
'Q96248786',
'Q95485499',
'Q99196947']
Better unknown poets
Find all the ids, the birthplace and birthdate of nameless poets born after year 0, and put them in a python list of tuples.
birthplaces must be integers - if not specified, put
-1
print also how many results were found
DO NOT use loops nor list comprehensions
[19]:
# write here
Found 66 results
[19]:
[('Q4360247', 'Rome', 1907),
('Q14922292', 'Battipaglia', 1991),
('Q19130448', 'Vicenza', 1492),
('Q21207901', 'Aradeo', -1),
('Q19984452', 'Anghiari', -1),
('Q21209119', 'Giuliano Teatino', 1711),
('Q21282215', 'Palermo', 1590),
('Q23673492', 'Butera', -1),
('Q29049430', 'Fondi', -1),
('Q29052339', 'Taranto', 1733),
('Q31763467', 'Caltanissetta', 1755),
('Q28465822', 'Orvieto', 1700),
('Q48809843', 'Palermo', -1),
('Q27553577', 'Cavriana', 1250),
('Q48811051', 'Roccabernarda', 1550),
('Q48861610', 'Vittorio Veneto', 1452),
('Q55441810', 'Vernio', 1844),
('Q47468550', 'Florence', 1607),
('Q50327630', 'Rome', 1700),
('Q50330028', 'Rome', 1680),
('Q55897192', 'Narni', 1872),
('Q65019765', 'Afragola', 1853),
('Q51845316', 'Genoa', -1),
('Q60838260', 'Tuscany', -1),
('Q64433131', 'Castel Goffredo', -1),
('Q71684946', "Trezzo sull'Adda", -1),
('Q93338246', 'Viterbo', -1),
('Q59187521', 'Padua', 1850),
('Q61136330', 'Osilo', 1865),
('Q61450547', 'Siderno', 1847),
('Q52107491', 'Rome', 1680),
('Q61790603', 'Veneto', 1857),
('Q61791394', 'Piedimonte del Calvario', 1905),
('Q61827513', 'Grimacco', 1847),
('Q61895377', 'Como', 1731),
('Q59851133', 'Gorizia', 1883),
('Q59851150', 'Trieste', 1929),
('Q62066746', 'Bologna', 1926),
('Q66736238', 'Lucca', 1635),
('Q66921487', 'Alghero', 1914),
('Q85421610', 'Alghero', 1869),
('Q61080035', 'Greci', 1830),
('Q87068357', 'Polla', 1970),
('Q64031897', 'Castel Goffredo', 1445),
('Q64167386', 'Sambiase', 1930),
('Q64364409', 'Corleone', 1752),
('Q69818426', 'Galati Mamertino', 1923),
('Q64512266', 'Rieti', 1890),
('Q78162153', 'Agrigento', 1985),
('Q78499894', 'San Giorgio a Cremano', 1932),
('Q88264630', 'Bari', 1930),
('Q89674973', 'Genoa', 1939),
('Q94998318', 'Trieste', 1939),
('Q94325725', 'Orune', 1908),
('Q84138681', 'Rome', -1),
('Q98102965', 'Florence', -1),
('Q80705985', 'Patti', 1879),
('Q81100287', 'Lecco', 1935),
('Q81738068', 'Rogolo', 1940),
('Q83643534', 'Alessandria', 1717),
('Q83808244', 'Cancello e Arnone', 1921),
('Q96097742', 'Milan', 1982),
('Q96245247', 'Lecce', 1919),
('Q96248786', 'Guardia Lombardi', 1970),
('Q95485499', 'Borgo San Martino', 1907),
('Q99196947', 'Tivoli', 1973)]