Median Ages

Median Ages#

../_images/5092cc583c1ddc1b02a6e7b86cac9636fd185a698a9f94f90cd1ca61947c0e01.png

Population pyramid of Niger (left) and Japan (right) in 2022.

Apr, 2023

Interactive visualization

Background#

Given a certain group, how many people are older than you? And how many are younger than you? When you are a newborn, everybody else is older than you. When you are the oldest one, everybody else is younger than you. In between, there will be a percentage of people who are older than you, and a complementary percentage who are younger than you.

The median age is defined so that 50% of the people are older and 50% are younger. The global average median age was 30 years in 2021 – half of the world population was older than 30 years, and the other half was younger. Japan has the highest median age at almost 49 years. One of the lowest is Niger at some 15 years. The median age in Spain is around 44 years.

When I turned 49 last year, it was clear to me that I had already left behind half of my life (life expectancy is some 84 years here). But population is aging in my town and I wondered where I was among my fellow citizens. It could be the case that even if I had left behind the middle age, I was still around the median age!

The data#

Instead of downloading the CSV, I provided the URL from https://www.gipuzkoairekia.eus/ to directly access the data: population of Urretxu in 2022 according to the age, gender and neighbourhood.

         NOMBRE CALLE EDAD  CANTIDAD MUJERES  CANTIDAD HOMBRES
          AREIZAGA    1                 1                 0
          AREIZAGA    2                 2                 0
          AREIZAGA    3                 0                 2
          AREIZAGA    4                 2                 0
          AREIZAGA    5                 0                 2
...               ...  ...               ...               ...
BASAGASTI KALEA   78                 1                 0
BASAGASTI KALEA   79                 0                 0
BASAGASTI KALEA   80                 0                 0
BASAGASTI KALEA  >80                 2                 1
BASAGASTI KALEA  000                 0                 2

[1804 rows x 4 columns]

Data validation#

Unfortunately, the number of the people over the age of 80 is aggregated and appears with the label “>80”. I will replace the label to “81”, and make it a number. I will consider that the people over 80 are all of them 81 years old (yes, it will look strange in the graphic representations but further information is missing).

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1804 entries, 0 to 1803
Data columns (total 4 columns):
 #   Column            Non-Null Count  Dtype 
---  ------            --------------  ----- 
 0   NOMBRE CALLE      1804 non-null   object
 1   EDAD              1804 non-null   int32 
 2   CANTIDAD MUJERES  1804 non-null   int64 
 3   CANTIDAD HOMBRES  1804 non-null   int64 
dtypes: int32(1), int64(2), object(1)
memory usage: 49.5+ KB

Population pyramid#

Let’s build the population pyramid for my town.

          CANTIDAD MUJERES  CANTIDAD HOMBRES
interval                                    
[0, 4]                  88                95
[5, 9]                 110               124
[10, 14]               196               205
[15, 19]               194               212
[20, 24]               180               203
[25, 29]               153               157
[30, 34]               140               140
[35, 39]               138               175
[40, 44]               195               204
[45, 49]               301               288
[50, 54]               260               274
[55, 59]               272               289
[60, 64]               235               260
[65, 69]               206               176
[70, 74]               180               161
[75, 79]               167               137
[80, 84]               314               174

../_images/836b1d9f9f3e995c9f88a52321c4715e241126de4213fc6e5c1ac90298c1c314.png

Certainly, this pyramid is closer to the Japanese than to the pristine one of Niger. In fact, it looks like a house of cards that is going to fall apart, with that sort of beret on top of it. The effect is due to the aggregation of the elderly mentioned earlier, it is strange that the individual ages of those over 80 are not attended to, because they make up a large group. This omission feels inconsiderate nowadays. Among women, those over 80 constitute the largest group in town.

Median age#

I was interested in calculating the median age, so I will compute total numbers adding men and women, then group by age and sum numbers creating a new dataframe.

    TOTAL
    22
    49
    31
    39
    42
..    ...
   53
   58
   41
  447
    0

[83 rows x 1 columns]

Total population in 2022 -> 6603

Now I am going to calculate the number (and percentage) of people younger and older for each age.

    TOTAL  younger   older  younger_%  older_%
    22      0.0  6603.0        0.0    100.0
    49     22.0  6581.0        0.0    100.0
    31     71.0  6532.0        1.0     99.0
    39    102.0  6501.0        2.0     98.0
    42    141.0  6462.0        2.0     98.0
..    ...      ...     ...        ...      ...
   53   6004.0   599.0       91.0      9.0
   58   6057.0   546.0       92.0      8.0
   41   6115.0   488.0       93.0      7.0
  447   6156.0   447.0       93.0      7.0
    0   6603.0     0.0      100.0      0.0

[83 rows x 5 columns]

Finally, let’s find out the median age: the age at which older people than you drops for the first time below 50%.

Median age -> 49 years

Here we have it: it turns out that when I turned 49 last year (2022), I was also turning the median age for my town!

So I am not that old, considering.