Pandas - 3. The Land of Poets Challenge

Download exercises

For a digital humanities project you need to display Italian poets by filtering a csv table according to various criteria. This challenge will be only about querying with pandas, which is something you might find convenient to do during exams for quickly understanding datasets content (using pandas will always be optional, you will never be asked to perform complex modifications with it)

You are given a dataset taken from Wikidata, a project by the Wikimedia foundation which aims to store only machine-readable data, like numbers, strings, and so on interlinked with many references. Each entity in Wikidata has an identifier, for example Dante Alighieri is the entity Q1067 and Florence is Q2044

Wikidata can be queried using the SPARQL language: the data was obtained with this query and downloaded in CSV format (among the many which can be chosen). Even if not necessary for the purposes of the exercise, you are invited to play a bit with the interface, like trying different visualizations (i.e. try select map in the middle-left corner) - or see other examples

What to do

  1. If you haven’t already, install Pandas:

    Anaconda:

    conda install pandas

    Without Anaconda (--user installs in your home):

    python3 -m pip install --user pandas

  2. unzip exercises in a folder, you should get something like this:

pandas
    pandas1-sol.ipynb
    pandas1.ipynb
    pandas2-sol.ipynb
    pandas2.ipynb
    pandas3-chal.ipynb
    jupman.py

WARNING 1: to correctly visualize the notebook, it MUST be in an unzipped folder !

  1. open Jupyter Notebook from that folder. Two things should open, first a console and then browser.

  2. The browser should show a file list: navigate the list and open the notebook pandas3-chal.ipynb

WARNING 2: DO NOT use the Upload button in Jupyter, instead navigate in Jupyter browser to the unzipped folder !

  1. Go on reading that notebook, and follow instuctions inside.

Shortcut keys:

  • to execute Python code inside a Jupyter cell, press Control + Enter

  • to execute Python code inside a Jupyter cell AND select next cell, press Shift + Enter

  • to execute Python code inside a Jupyter cell AND a create a new cell aftwerwards, press Alt + Enter

  • If the notebooks look stuck, try to select Kernel -> Restart

Load the dataset

First load the dataset italian-poets.csv in pandas dataframe df

  • USE UTF-8 as encoding

[1]:
# write here


Tell me more

Show some info about the dataset

[2]:
# write here


<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3198 entries, 0 to 3197
Data columns (total 6 columns):
 #   Column      Non-Null Count  Dtype
---  ------      --------------  -----
 0   subj        3198 non-null   object
 1   subjLabel   3198 non-null   object
 2   place       3198 non-null   object
 3   placeLabel  3198 non-null   object
 4   coord       3198 non-null   object
 5   birthyear   3070 non-null   float64
dtypes: float64(1), object(5)
memory usage: 150.0+ KB

Getting in shape

Show the rows and the columns counts:

[3]:
# write here


rows: 3198
columns: 6

10 rows

Display first 10 rows

[4]:
# write here


[4]:
subj subjLabel place placeLabel coord birthyear
0 http://www.wikidata.org/entity/Q8797 Aemilius Macer http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) NaN
1 http://www.wikidata.org/entity/Q8833 Gaius Maecenas http://www.wikidata.org/entity/Q13378 Arezzo Point(11.878055555 43.463055555) NaN
2 http://www.wikidata.org/entity/Q5592 Michelangelo http://www.wikidata.org/entity/Q52069 Caprese Michelangelo Point(11.985833333 43.640833333) 1475.0
3 http://www.wikidata.org/entity/Q6197 Horace http://www.wikidata.org/entity/Q52691 Venosa Point(15.816666666 40.966666666) -64.0
4 http://www.wikidata.org/entity/Q7170 Sallust http://www.wikidata.org/entity/Q177061 Amiternum Point(13.305769 42.400776) -85.0
5 http://www.wikidata.org/entity/Q7198 Ovid http://www.wikidata.org/entity/Q50157 Sulmona Point(13.926198 42.048025) -42.0
6 http://www.wikidata.org/entity/Q7728 Grazia Deledda http://www.wikidata.org/entity/Q13649 Nuoro Point(9.3280792 40.3200621) 1871.0
7 http://www.wikidata.org/entity/Q7803 Bronzino http://www.wikidata.org/entity/Q2044 Florence Point(11.254166666 43.771388888) 1503.0
8 http://www.wikidata.org/entity/Q8796 Sandra Lombardi http://www.wikidata.org/entity/Q220 Rome Point(12.482777777 41.893055555) 1946.0
9 http://www.wikidata.org/entity/Q8800 Gaius Maecenas Melissus http://www.wikidata.org/entity/Q20571 Spoleto Point(12.733333333 42.733333333) -100.0

Born in Verona

Display all people born in Verona

[5]:
# write here


[5]:
subj subjLabel place placeLabel coord birthyear
0 http://www.wikidata.org/entity/Q8797 Aemilius Macer http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) NaN
135 http://www.wikidata.org/entity/Q163079 Catullus http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) -83.0
232 http://www.wikidata.org/entity/Q318593 Girolamo Fracastoro http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1478.0
256 http://www.wikidata.org/entity/Q539577 Guarino da Verona http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1374.0
375 http://www.wikidata.org/entity/Q1236766 Ippolito Pindemonte http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1753.0
436 http://www.wikidata.org/entity/Q620193 Aleardo Aleardi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1812.0
755 http://www.wikidata.org/entity/Q2293943 Cristina Ali Farah http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1973.0
764 http://www.wikidata.org/entity/Q1587432 Francesco Scipione, marchese di Maffei http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1675.0
858 http://www.wikidata.org/entity/Q3290043 Marco Antonio Zucchi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1750.0
891 http://www.wikidata.org/entity/Q3611735 Alida Airaghi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1953.0
1035 http://www.wikidata.org/entity/Q3638918 Berto Barbarani http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1872.0
1090 http://www.wikidata.org/entity/Q3663490 Caterina Bon Brenzoni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1813.0
1098 http://www.wikidata.org/entity/Q3665350 Cesare Betteloni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1808.0
1143 http://www.wikidata.org/entity/Q3741666 Federico Ceruti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1532.0
1169 http://www.wikidata.org/entity/Q3746475 Flavio Ermini http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1947.0
1393 http://www.wikidata.org/entity/Q3762433 Giambattista Spolverini http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1695.0
1459 http://www.wikidata.org/entity/Q3766734 Giovanni Battista Pighi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1898.0
1489 http://www.wikidata.org/entity/Q3767945 Giovanni Pindemonte http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1751.0
1566 http://www.wikidata.org/entity/Q3768974 Girolamo Pompei http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1731.0
1694 http://www.wikidata.org/entity/Q4015300 Vittorio Betteloni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1840.0
1771 http://www.wikidata.org/entity/Q3081061 Francesco Pona http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1595.0
1869 http://www.wikidata.org/entity/Q3837018 Lorenzo Montano http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1893.0
1935 http://www.wikidata.org/entity/Q3846323 Marco Ongaro http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1956.0
2211 http://www.wikidata.org/entity/Q6999870 Rudy De Cadaval http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1933.0
2322 http://www.wikidata.org/entity/Q15432608 Ortensio Mauro http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1634.0
2361 http://www.wikidata.org/entity/Q15726796 Teresa Albarelli http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1788.0
2390 http://www.wikidata.org/entity/Q16574305 Luigi Nogarola http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1669.0
2460 http://www.wikidata.org/entity/Q17341090 Giovanni Ceriotto http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1883.0
2530 http://www.wikidata.org/entity/Q18945280 Francesco degli Allegri http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1495.0
2531 http://www.wikidata.org/entity/Q18945373 Giorgio Summaripa http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1435.0
2617 http://www.wikidata.org/entity/Q19597229 Giambattista Mutinelli http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1747.0
2634 http://www.wikidata.org/entity/Q20671732 Pietro Caliari http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1841.0
2651 http://www.wikidata.org/entity/Q23014868 Ilario Casarotti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1772.0
2841 http://www.wikidata.org/entity/Q30126093 Girolamo Orti Manara http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1769.0
2850 http://www.wikidata.org/entity/Q30303339 Paolo Zazzaroni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) NaN
2872 http://www.wikidata.org/entity/Q28778065 Angela Nogarola http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1380.0
2885 http://www.wikidata.org/entity/Q30308589 Bartolomeo Tortoletti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1560.0

How many people in Verona

Display how many people were born in Verona

[6]:
# write here


[6]:
37

Python is everywhere

Show poets born in Catania in the year -500

  • mind the minus

  • I swear we did not altered the dataset in any way :-)

[7]:
# write here


[7]:
subj subjLabel place placeLabel coord birthyear
2231 http://www.wikidata.org/entity/Q7263938 Python of Catana http://www.wikidata.org/entity/Q1903 Catania Point(15.087269444 37.502669444) -500.0

Verona after 1500

Display all people born in Verona after the year 1500

[8]:
# write here


[8]:
subj subjLabel place placeLabel coord birthyear
375 http://www.wikidata.org/entity/Q1236766 Ippolito Pindemonte http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1753.0
436 http://www.wikidata.org/entity/Q620193 Aleardo Aleardi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1812.0
755 http://www.wikidata.org/entity/Q2293943 Cristina Ali Farah http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1973.0
764 http://www.wikidata.org/entity/Q1587432 Francesco Scipione, marchese di Maffei http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1675.0
858 http://www.wikidata.org/entity/Q3290043 Marco Antonio Zucchi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1750.0
891 http://www.wikidata.org/entity/Q3611735 Alida Airaghi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1953.0
1035 http://www.wikidata.org/entity/Q3638918 Berto Barbarani http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1872.0
1090 http://www.wikidata.org/entity/Q3663490 Caterina Bon Brenzoni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1813.0
1098 http://www.wikidata.org/entity/Q3665350 Cesare Betteloni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1808.0
1143 http://www.wikidata.org/entity/Q3741666 Federico Ceruti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1532.0
1169 http://www.wikidata.org/entity/Q3746475 Flavio Ermini http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1947.0
1393 http://www.wikidata.org/entity/Q3762433 Giambattista Spolverini http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1695.0
1459 http://www.wikidata.org/entity/Q3766734 Giovanni Battista Pighi http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1898.0
1489 http://www.wikidata.org/entity/Q3767945 Giovanni Pindemonte http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1751.0
1566 http://www.wikidata.org/entity/Q3768974 Girolamo Pompei http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1731.0
1694 http://www.wikidata.org/entity/Q4015300 Vittorio Betteloni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1840.0
1771 http://www.wikidata.org/entity/Q3081061 Francesco Pona http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1595.0
1869 http://www.wikidata.org/entity/Q3837018 Lorenzo Montano http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1893.0
1935 http://www.wikidata.org/entity/Q3846323 Marco Ongaro http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1956.0
2211 http://www.wikidata.org/entity/Q6999870 Rudy De Cadaval http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1933.0
2322 http://www.wikidata.org/entity/Q15432608 Ortensio Mauro http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1634.0
2361 http://www.wikidata.org/entity/Q15726796 Teresa Albarelli http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1788.0
2390 http://www.wikidata.org/entity/Q16574305 Luigi Nogarola http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1669.0
2460 http://www.wikidata.org/entity/Q17341090 Giovanni Ceriotto http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1883.0
2617 http://www.wikidata.org/entity/Q19597229 Giambattista Mutinelli http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1747.0
2634 http://www.wikidata.org/entity/Q20671732 Pietro Caliari http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1841.0
2651 http://www.wikidata.org/entity/Q23014868 Ilario Casarotti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1772.0
2841 http://www.wikidata.org/entity/Q30126093 Girolamo Orti Manara http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1769.0
2885 http://www.wikidata.org/entity/Q30308589 Bartolomeo Tortoletti http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1560.0

First Antonio

Display all people with Antonio as first name

[9]:
# write here


[9]:
subj subjLabel place placeLabel coord birthyear
47 http://www.wikidata.org/entity/Q266482 Antonio Bonfini http://www.wikidata.org/entity/Q3415 Ancona Point(13.516666666 43.616666666) 1427.0
48 http://www.wikidata.org/entity/Q266482 Antonio Bonfini http://www.wikidata.org/entity/Q3897778 Patrignone Point(13.60926 42.98027) 1427.0
77 http://www.wikidata.org/entity/Q348311 Antonio Tebaldeo http://www.wikidata.org/entity/Q13362 Ferrara Point(11.619865 44.835297) 1463.0
120 http://www.wikidata.org/entity/Q470067 Antonio Fogazzaro http://www.wikidata.org/entity/Q6537 Vicenza Point(11.55 45.55) 1842.0
203 http://www.wikidata.org/entity/Q524960 Antonio Ghislanzoni http://www.wikidata.org/entity/Q6237 Lecco Point(9.4 45.85) 1824.0
... ... ... ... ... ... ...
2881 http://www.wikidata.org/entity/Q30250615 Antonio Bruni http://www.wikidata.org/entity/Q52019 Manduria Point(17.634166666 40.402777777) 1593.0
2917 http://www.wikidata.org/entity/Q42941837 Antonio Decio http://www.wikidata.org/entity/Q176180 Orte Point(12.386111111 42.460277777) 1560.0
2979 http://www.wikidata.org/entity/Q56166956 Antonio Rossetti http://www.wikidata.org/entity/Q51313 Vasto Point(14.708219444 42.111588888) 1770.0
3060 http://www.wikidata.org/entity/Q54860414 Antonio Ricci http://www.wikidata.org/entity/Q51240 Guardiagrele Point(14.221591666 42.189222222) 1952.0
3135 http://www.wikidata.org/entity/Q94075340 Antonio Gasparinetti http://www.wikidata.org/entity/Q46503 Ponte di Piave Point(12.466666666 45.716666666) 1777.0

85 rows × 6 columns

Some Antonio

Display all people with Antonio as one of the names (so also include 'Paolo Antonio Rolli')

[10]:
# write here


[10]:
subj subjLabel place placeLabel coord birthyear
47 http://www.wikidata.org/entity/Q266482 Antonio Bonfini http://www.wikidata.org/entity/Q3415 Ancona Point(13.516666666 43.616666666) 1427.0
48 http://www.wikidata.org/entity/Q266482 Antonio Bonfini http://www.wikidata.org/entity/Q3897778 Patrignone Point(13.60926 42.98027) 1427.0
53 http://www.wikidata.org/entity/Q55433 Michelangelo Antonioni http://www.wikidata.org/entity/Q13362 Ferrara Point(11.619865 44.835297) 1912.0
77 http://www.wikidata.org/entity/Q348311 Antonio Tebaldeo http://www.wikidata.org/entity/Q13362 Ferrara Point(11.619865 44.835297) 1463.0
120 http://www.wikidata.org/entity/Q470067 Antonio Fogazzaro http://www.wikidata.org/entity/Q6537 Vicenza Point(11.55 45.55) 1842.0
... ... ... ... ... ... ...
2906 http://www.wikidata.org/entity/Q41566775 Carlo Antonio Bertelli http://www.wikidata.org/entity/Q111705 Salò Point(10.533333333 45.6) 1637.0
2917 http://www.wikidata.org/entity/Q42941837 Antonio Decio http://www.wikidata.org/entity/Q176180 Orte Point(12.386111111 42.460277777) 1560.0
2979 http://www.wikidata.org/entity/Q56166956 Antonio Rossetti http://www.wikidata.org/entity/Q51313 Vasto Point(14.708219444 42.111588888) 1770.0
3060 http://www.wikidata.org/entity/Q54860414 Antonio Ricci http://www.wikidata.org/entity/Q51240 Guardiagrele Point(14.221591666 42.189222222) 1952.0
3135 http://www.wikidata.org/entity/Q94075340 Antonio Gasparinetti http://www.wikidata.org/entity/Q46503 Ponte di Piave Point(12.466666666 45.716666666) 1777.0

110 rows × 6 columns

Cesares during 1800

Display all people named Cesare who were born in 1800 century

[11]:
# write here


[11]:
subj subjLabel place placeLabel coord birthyear
389 http://www.wikidata.org/entity/Q1056872 Cesare Meano http://www.wikidata.org/entity/Q495 Turin Point(7.7 45.066666666) 1899.0
1098 http://www.wikidata.org/entity/Q3665350 Cesare Betteloni http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) 1808.0
1101 http://www.wikidata.org/entity/Q3665409 Cesare De Titta http://www.wikidata.org/entity/Q51292 Sant'Eusanio del Sangro Point(14.333333333 42.166666666) 1862.0
1105 http://www.wikidata.org/entity/Q3665495 Cesare Pascarella http://www.wikidata.org/entity/Q220 Rome Point(12.482777777 41.893055555) 1858.0

The old ones

Show poets in year of birth order

  • DO NOT include in the result NaN values

HINT: see pd.notnull

[12]:
# write here


[12]:
subj subjLabel place placeLabel coord birthyear
292 http://www.wikidata.org/entity/Q332797 Stesichorus http://www.wikidata.org/entity/Q54614 Gioia Tauro Point(15.9 38.433333333) -629.0
293 http://www.wikidata.org/entity/Q332802 Ibycus http://www.wikidata.org/entity/Q8471 Reggio Calabria Point(15.65 38.114438888) -600.0
327 http://www.wikidata.org/entity/Q336115 Theognis of Megara http://www.wikidata.org/entity/Q1457477 Megara Hyblaea Point(15.18194444 37.20388889) -569.0
84 http://www.wikidata.org/entity/Q125551 Parmenides http://www.wikidata.org/entity/Q272968 Velia Point(15.154444444 40.159444444) -514.0
2575 http://www.wikidata.org/entity/Q20002641 Glaucus of Rhegion http://www.wikidata.org/entity/Q8471 Reggio Calabria Point(15.65 38.114438888) -500.0
... ... ... ... ... ... ...
3104 http://www.wikidata.org/entity/Q78162153 Q78162153 http://www.wikidata.org/entity/Q13678 Agrigento Point(13.576547222 37.311075) 1985.0
2989 http://www.wikidata.org/entity/Q58995193 Giovanni Bertoglio http://www.wikidata.org/entity/Q495 Turin Point(7.7 45.066666666) 1986.0
2986 http://www.wikidata.org/entity/Q58308029 Gio Evan http://www.wikidata.org/entity/Q19300 Molfetta Point(16.6 41.2) 1988.0
2374 http://www.wikidata.org/entity/Q14922292 Q14922292 http://www.wikidata.org/entity/Q80652 Battipaglia Point(14.983333333 40.616666666) 1991.0
552 http://www.wikidata.org/entity/Q1151356 D.B.P.I.T. http://www.wikidata.org/entity/Q220 Rome Point(12.482777777 41.893055555) 2000.0

3070 rows × 6 columns

Cities of poets

Find the 5 cities with most poets, sorted from most to least.

  • use groupby and sort_values methods

[13]:
# write here


[13]:
placeLabel
Rome        198
Florence    165
Milan       121
Naples      113
Venice       94
Name: subj, dtype: int64

Most duplicated poets

Find first 8 duplicated poets

[14]:
# write here


[14]:
subjLabel
Sosiphanes               4
Alojz Rebula             4
Eliseo Calenzio          4
Giambattista Andreini    4
Tommaso Grossi           3
Giovanni della Casa      3
Giuseppe Carpani         3
Aulus Gellius            3
Name: subj, dtype: int64

All duplicated poets

Print the number of all duplicated poets

NOTE: a Series object has only one column, even if they look two (the apparent other is the index) - so if you have a Series object you don’t need to specify a column

[15]:
# write here


There are 118 duplicated poets

Northern poets

Find all the poets born north of a given town. In other words, look for town latitude (the second coordinate in coords), print it, and then filter the table.

  • DO NOT put constants like 46.5 in your code!

  • DO NOT add new columns for longitude and latitude

  • NOTE: coord column holds just simple strings!

  • HINT: to get an element at a given numerical index i of a filtered Series (regardless of the original dataframe row index), you need to use .iloc[i] property - note the square brackets!

[ ]:

[16]:
town = 'Bolzano'
#town = 'Trento'

# write here


Latitude of Bolzano : 46.5
[16]:
subj subjLabel place placeLabel coord birthyear
27 http://www.wikidata.org/entity/Q45105 Oswald von Wolkenstein http://www.wikidata.org/entity/Q1013962 Schöneck Castle Point(11.847977 46.820264) 1377.0
41 http://www.wikidata.org/entity/Q122070 Simon Lemnius http://www.wikidata.org/entity/Q257965 Val Müstair Point(10.39009 46.60566) 1511.0
42 http://www.wikidata.org/entity/Q122070 Simon Lemnius http://www.wikidata.org/entity/Q257965 Val Müstair Point(10.42 46.6) 1511.0
88 http://www.wikidata.org/entity/Q137683 Mary de Rachewiltz http://www.wikidata.org/entity/Q185541 Brixen Point(11.65 46.716666666) 1925.0
583 http://www.wikidata.org/entity/Q873784 Ignaz Vincenz Zingerle http://www.wikidata.org/entity/Q131605 Merano Point(11.163888888 46.668888888) 1825.0
636 http://www.wikidata.org/entity/Q1705031 Josef Kostner http://www.wikidata.org/entity/Q255568 Urtijëi Point(11.666666666 46.566666666) 1933.0
637 http://www.wikidata.org/entity/Q1705031 Josef Kostner http://www.wikidata.org/entity/Q255568 Urtijëi Point(11.66748 46.57432) 1933.0
770 http://www.wikidata.org/entity/Q1996716 Norbert Conrad Kaser http://www.wikidata.org/entity/Q185541 Brixen Point(11.65 46.716666666) 1947.0
1905 http://www.wikidata.org/entity/Q3839806 Luigi Maieron http://www.wikidata.org/entity/Q53240 Cercivento Point(12.983333333 46.533333333) 1954.0
2152 http://www.wikidata.org/entity/Q4505559 Karl Ziegler http://www.wikidata.org/entity/Q504216 St. Martin in Passeier Point(11.22727 46.78392) 1812.0
2741 http://www.wikidata.org/entity/Q24073666 Fedele Demetz http://www.wikidata.org/entity/Q499129 Sëlva Point(11.76038 46.55472) 1850.0
2939 http://www.wikidata.org/entity/Q55471982 Roberta Dapunt http://www.wikidata.org/entity/Q644159 Val Badia Point(11.89917 46.68361) 1970.0
2940 http://www.wikidata.org/entity/Q55471982 Roberta Dapunt http://www.wikidata.org/entity/Q644159 Val Badia Point(11.9 46.683333333) 1970.0
3128 http://www.wikidata.org/entity/Q95185585 Anna Katharina Mair http://www.wikidata.org/entity/Q185541 Brixen Point(11.65 46.716666666) 1967.0

Papers please

Extract subject id (i.e. Q8797) and place id (i.e. Q2028) and MODIFY df by putting them into two new columns subj_id and place_id

[17]:
# write here


[17]:
subj subjLabel place placeLabel coord birthyear subj_id place_id
0 http://www.wikidata.org/entity/Q8797 Aemilius Macer http://www.wikidata.org/entity/Q2028 Verona Point(10.992777777 45.438611111) NaN Q8797 Q2028
1 http://www.wikidata.org/entity/Q8833 Gaius Maecenas http://www.wikidata.org/entity/Q13378 Arezzo Point(11.878055555 43.463055555) NaN Q8833 Q13378
2 http://www.wikidata.org/entity/Q5592 Michelangelo http://www.wikidata.org/entity/Q52069 Caprese Michelangelo Point(11.985833333 43.640833333) 1475.0 Q5592 Q52069
3 http://www.wikidata.org/entity/Q6197 Horace http://www.wikidata.org/entity/Q52691 Venosa Point(15.816666666 40.966666666) -64.0 Q6197 Q52691
4 http://www.wikidata.org/entity/Q7170 Sallust http://www.wikidata.org/entity/Q177061 Amiternum Point(13.305769 42.400776) -85.0 Q7170 Q177061
... ... ... ... ... ... ... ... ...
3193 http://www.wikidata.org/entity/Q99308713 Giovanni Marrasio http://www.wikidata.org/entity/Q487174 Noto Point(15.083333333 36.883333333) 1405.0 Q99308713 Q487174
3194 http://www.wikidata.org/entity/Q100775377 Annemarie Innerebner http://www.wikidata.org/entity/Q6526 Bolzano Point(11.35 46.5) 1924.0 Q100775377 Q6526
3195 http://www.wikidata.org/entity/Q100775410 Nesti Lyro Wollek http://www.wikidata.org/entity/Q2044 Florence Point(11.254166666 43.771388888) 1875.0 Q100775410 Q2044
3196 http://www.wikidata.org/entity/Q99655533 Lorenza Meletti http://www.wikidata.org/entity/Q95093 Bondeno Point(11.41542 44.88944) 1940.0 Q99655533 Q95093
3197 http://www.wikidata.org/entity/Q99912619 Nanni Falconi http://www.wikidata.org/entity/Q391218 Pattada Point(9.11 40.582222222) 1950.0 Q99912619 Q391218

3198 rows × 8 columns

Unknown poets

Find all the ids of nameless poets and put them in a python list.

  • DO NOT use loops

  • NOTE a Series object from the point of view of Python is just a sequence

[18]:
# write here


[18]:
['Q4360247',
 'Q14922292',
 'Q19130448',
 'Q21207901',
 'Q19984452',
 'Q21209119',
 'Q21282215',
 'Q23673492',
 'Q29049430',
 'Q29052339',
 'Q31763467',
 'Q28465822',
 'Q48809843',
 'Q27553577',
 'Q48811051',
 'Q48861610',
 'Q55441810',
 'Q47468550',
 'Q50327630',
 'Q50330028',
 'Q55897192',
 'Q65019765',
 'Q51845316',
 'Q60838260',
 'Q64433131',
 'Q71684946',
 'Q93338246',
 'Q59187521',
 'Q61136330',
 'Q61450547',
 'Q52107491',
 'Q61790603',
 'Q61791394',
 'Q61827513',
 'Q61895377',
 'Q59851133',
 'Q59851150',
 'Q62066746',
 'Q66736238',
 'Q66921487',
 'Q85421610',
 'Q61080035',
 'Q87068357',
 'Q64031897',
 'Q64167386',
 'Q64364409',
 'Q69818426',
 'Q64512266',
 'Q78162153',
 'Q78499894',
 'Q88264630',
 'Q89674973',
 'Q94998318',
 'Q94325725',
 'Q84138681',
 'Q98102965',
 'Q80705985',
 'Q81100287',
 'Q81738068',
 'Q83643534',
 'Q83808244',
 'Q96097742',
 'Q96245247',
 'Q96248786',
 'Q95485499',
 'Q99196947']

Better unknown poets

Find all the ids, the birthplace and birthdate of nameless poets born after year 0, and put them in a python list of tuples.

  • birthplaces must be integers - if not specified, put -1

  • print also how many results were found

  • DO NOT use loops nor list comprehensions

[19]:
# write here


Found 66 results
[19]:
[('Q4360247', 'Rome', 1907),
 ('Q14922292', 'Battipaglia', 1991),
 ('Q19130448', 'Vicenza', 1492),
 ('Q21207901', 'Aradeo', -1),
 ('Q19984452', 'Anghiari', -1),
 ('Q21209119', 'Giuliano Teatino', 1711),
 ('Q21282215', 'Palermo', 1590),
 ('Q23673492', 'Butera', -1),
 ('Q29049430', 'Fondi', -1),
 ('Q29052339', 'Taranto', 1733),
 ('Q31763467', 'Caltanissetta', 1755),
 ('Q28465822', 'Orvieto', 1700),
 ('Q48809843', 'Palermo', -1),
 ('Q27553577', 'Cavriana', 1250),
 ('Q48811051', 'Roccabernarda', 1550),
 ('Q48861610', 'Vittorio Veneto', 1452),
 ('Q55441810', 'Vernio', 1844),
 ('Q47468550', 'Florence', 1607),
 ('Q50327630', 'Rome', 1700),
 ('Q50330028', 'Rome', 1680),
 ('Q55897192', 'Narni', 1872),
 ('Q65019765', 'Afragola', 1853),
 ('Q51845316', 'Genoa', -1),
 ('Q60838260', 'Tuscany', -1),
 ('Q64433131', 'Castel Goffredo', -1),
 ('Q71684946', "Trezzo sull'Adda", -1),
 ('Q93338246', 'Viterbo', -1),
 ('Q59187521', 'Padua', 1850),
 ('Q61136330', 'Osilo', 1865),
 ('Q61450547', 'Siderno', 1847),
 ('Q52107491', 'Rome', 1680),
 ('Q61790603', 'Veneto', 1857),
 ('Q61791394', 'Piedimonte del Calvario', 1905),
 ('Q61827513', 'Grimacco', 1847),
 ('Q61895377', 'Como', 1731),
 ('Q59851133', 'Gorizia', 1883),
 ('Q59851150', 'Trieste', 1929),
 ('Q62066746', 'Bologna', 1926),
 ('Q66736238', 'Lucca', 1635),
 ('Q66921487', 'Alghero', 1914),
 ('Q85421610', 'Alghero', 1869),
 ('Q61080035', 'Greci', 1830),
 ('Q87068357', 'Polla', 1970),
 ('Q64031897', 'Castel Goffredo', 1445),
 ('Q64167386', 'Sambiase', 1930),
 ('Q64364409', 'Corleone', 1752),
 ('Q69818426', 'Galati Mamertino', 1923),
 ('Q64512266', 'Rieti', 1890),
 ('Q78162153', 'Agrigento', 1985),
 ('Q78499894', 'San Giorgio a Cremano', 1932),
 ('Q88264630', 'Bari', 1930),
 ('Q89674973', 'Genoa', 1939),
 ('Q94998318', 'Trieste', 1939),
 ('Q94325725', 'Orune', 1908),
 ('Q84138681', 'Rome', -1),
 ('Q98102965', 'Florence', -1),
 ('Q80705985', 'Patti', 1879),
 ('Q81100287', 'Lecco', 1935),
 ('Q81738068', 'Rogolo', 1940),
 ('Q83643534', 'Alessandria', 1717),
 ('Q83808244', 'Cancello e Arnone', 1921),
 ('Q96097742', 'Milan', 1982),
 ('Q96245247', 'Lecce', 1919),
 ('Q96248786', 'Guardia Lombardi', 1970),
 ('Q95485499', 'Borgo San Martino', 1907),
 ('Q99196947', 'Tivoli', 1973)]