Self tracking

Self tracking#

Image: Xiaomi Redmi Smart Band Pro.

May, 2024

Data Analysis Hypothesis testing

Background#

Lately, I had the feeling that I wasn’t resting very well at night. Could it be because of the coffee I drink every day? Normally, it’s just three cups a day (okay, sometimes four), and I’ve been drinking it for many years, in the morning and in the afternoon. It may have some effect, so I decided to conduct a little experiment.

I decided to track the data of my physical activity, my heart rate, and my sleep patterns with my smart wristband in two different periods: during the first one, I would continue drinking coffee as usual, and in the second one, I would stop drinking it altogether (I would stop consuming caffeine). The goal was to see if I noticed any difference in my sleep patterns between these two periods.

So, I put on my Xiaomi activity tracker and didn’t take it off for 6 weeks. The first four weeks were normal (caffeinated), and the last two were the trial ones (decaffeinated).

Yes, I know, I should have extended the coffee-free period to 4 weeks to have as much a number of samples in both cases and give the experiment enough time. But it was hard, especially the first week: I noticed headaches and a lack of energy. The second week was better but I missed my beloved coffee, the punch of it in the morming, its taste having it with someone in a cafe. I think I did enough by abstaining for half a month. I love science, but not as much as coffee.

The data#

In my personal Xiaomi account, I requested the data registered by the device. Among the CSV files they provide you with, I focused on the ones aggregated by day (daily reports).

Show code cell source Hide code cell source

import datetime
import json

import matplotlib.dates as mdates
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import seaborn as sns


# Define some user functions for statistical tests
def bootstrap_replicate_1d(data, func):
    """Generate bootstrap replicate of 1D data."""
    bs_sample = np.random.choice(data, len(data))

    return func(bs_sample)


def draw_bs_reps(data, func, size=1):
    """Draw bootstrap replicates."""

    # Initialize array of replicates: bs_replicates
    bs_replicates = np.empty(size)

    # Generate replicates
    for i in range(size):
        bs_replicates[i] = bootstrap_replicate_1d(data, func)

    return bs_replicates


def permutation_sample(data1, data2):
    """Generate a permutation sample from two data sets."""

    # Concatenate the data sets: data
    data = np.concatenate((data1, data2))

    # Permute the concatenated array: permuted_data
    permuted_data = np.random.permutation(data)

    # Split the permuted array into two: perm_sample_1, perm_sample_2
    perm_sample_1 = permuted_data[: len(data1)]
    perm_sample_2 = permuted_data[len(data1) :]

    return perm_sample_1, perm_sample_2


def draw_perm_reps(data_1, data_2, func, size=1):
    """Generate multiple permutation replicates."""

    # Initialize array of replicates: perm_replicates
    perm_replicates = np.empty(size)

    for i in range(size):
        # Generate permutation sample
        perm_sample_1, perm_sample_2 = permutation_sample(data_1, data_2)

        # Compute the test statistic
        perm_replicates[i] = func(perm_sample_1, perm_sample_2)

    return perm_replicates


def diff_of_means(data_1, data_2):
    """Difference in means of two arrays."""

    # The difference of means of data_1, data_2: diff
    diff = np.mean(data_1) - np.mean(data_2)

    return diff


# Read file of aggregated data
band = pd.read_csv(
    "data/20240430_8158242397_MiFitness_hlth_center_aggregated_fitness_data.csv",
    usecols=["Tag", "Key", "Time", "Value"],
)

# Get daily reports only
band = band.loc[band["Tag"] == "daily_report", :].drop("Tag", axis=1)

# Convert time from Unix timestamps: the number of seconds since January 1, 1970.
band["Time"] = pd.to_datetime(band["Time"], unit="s")
band = band.rename(columns={"Time": "date"})

# Filter date range of interest for this project: [2024-03-19 -> 2024-04-30]
band = band.loc[band["date"].between("2024-03-19", "2024-04-30"), :]

band.head()

	Key	date	Value
249	valid_stand	2024-03-19	{"count":7}
250	valid_stand	2024-03-20	{"count":13}
251	valid_stand	2024-03-21	{"count":7}
252	valid_stand	2024-03-22	{"count":9}
253	valid_stand	2024-03-23	{"count":10}

The available data corresponds to:

['valid_stand', 'steps', 'spo2', 'sleep', 'heart_rate', 'calories']

An important aspect of the experiment was to consistently maintain my usual habits, such as daily exercise. I go for a walk every day, and the following step-count record demonstrates that I maintained the habit consistently and at the same level throughout the process.

	date	steps
512	2024-03-19	10462
513	2024-03-20	6586
514	2024-03-21	2082
515	2024-03-22	4815
516	2024-03-23	4129

../_images/0af1ee17d952c48a0a2df9ae7475d8c16d6e17549fa26b79d42539e610f51396.png

Therefore, I kept my physical activity more or less the same during this period. Let’s see how resting heart rate, one of the aspects that can influence sleep, evolved over those weeks.

Heart rate#

The device measures several parameters of heart rate such as maximum, minimum, and daily average, but the interesting one is the resting heart rate because supposedly it is independent of the physical activity carried out during the day.

	date	avg_rhr
888	2024-03-19	56
889	2024-03-20	57
890	2024-03-21	54
891	2024-03-22	57
892	2024-03-23	54

../_images/2d835791403cec7525789bf3198e80e12a67aafa1989862d9593a85360b6d353.png

Just in case, I check if there’s any correlation between resting heart rate and the steps taken that particular day, and as expected I find that there’s no relationship at all.

	steps	avg_rhr
steps	1.000000	0.045974
avg_rhr	0.045974	1.000000

I’m going to separate the data into two groups, 28 measures corresponding to the four normal weeks and 14 measures from the two caffeine-free weeks.

../_images/33de561fbda186ae67845299317833512d6a91d1c5accecc9b56a1da252412d9.png

Let’s resample and take the means to graph their distribution.

../_images/98a2946c41d56411454b3c9bbe1e3bab0ccc4871ee99f598d67ae5236bfb240c.png

Caffeinated mean: 55.9 beats per minute
Decaffeinated mean: 57.6 beats per minute
Difference in means -> 1.7

There is some overlapping between the distributions. Does that mean that the observed difference is not signifficant? We will proceed to conduct two hypothesis tests:

Bootstrapping considering equal mean values as null hypothesis-H0.
Random permutational test as as null hypothesis-H0.

Let’s start with the bootstrapping:

../_images/29792bfc62288a925ae12410581c78e6b3ffbd7da8046b2eb09e42da9cf3ae8b.png

p-bootstrp = 0.0028

So after centering both sample distributions around the same mean and bootstrapping 10000 times, only 28 times the difference between mean values of each group was equal or higher than the observed difference. That is to say, the probability of observing this difference if stopping the caffeine intake had no influence in the resting heart rate would be of just 0.28 %.

Let’s see what happens with the random permutational test, which is more restrictive that the previous bootstrapping test.

../_images/4b0e2987a93f4493b9fb55e5ccc13aec7e0e39453399d97f2c67d4bc800a0543.png

p-perm = 0.0143

So after randomly scrambling (permutating) the data from the two groups 10000 times and making two groups each time, 143 times the difference between mean values of each group was equal or higher than the difference among the original groups. That is to say, the probability of observing this difference if stopping the caffeine intake had no influence in the resting heart rate would be of just 1.43 %.

Therefore, does this mean that quitting caffeine increases resting heart rate? Um, in any case I would have said the opposite: wasn’t it the intake of caffeine that increased it and not the other way around?

This is where I realize that perhaps I should have allowed more time for the new caffeine-free situation to stabilize in my body. Maybe this is nothing more than the result of the transition when changing states.

But then, this is just a statistical test. If we had to make a decision about it, we would take this result into account, considering time restraints. But contemplating it as proof of a fact of an existing reality… that is more complicated. Firstly, there is this already mentioned transitory effect, I doubt that the samples are sufficiently representative. Then, the observed difference is tiny, at around the resolution of the measure. And then, many factors I was anaware of must have intervened during the days when they were taken. I am inclined to think that chance has had a lot to do with this result. But it’s just my impression.

Sleep#

Let’s take a look at sleep patterns.

	date	sleep_deep_duration	sleep_light_duration	sleep_rem_duration	total_duration
736	2024-03-20	148	252	93	493
737	2024-03-21	214	303	26	543
738	2024-03-22	147	293	26	466
739	2024-03-23	197	340	48	585
740	2024-03-24	131	320	91	542

../_images/9c26c7abe19019e37beed716ef632f487551e70eb03ada584bbed9c6187b2262.png

I have to warn that I have noticed that the Xiaomi smartband does not accurately monitor sleep. For example, what happened on April 20th, when it recorded that I slept for 677 minutes, which is 11 hours and 17 minutes? The date corresponds to a Saturday, and the night before, Friday, I went to the cinema. If I look at the mobile app, I see the device considered that I had fallen asleep at 8:43 PM, just when I was watching the movie. It could be, yes, that I had fallen asleep. But no, that was not the case, the film was good and I remember it all. The movie was so good that I watched it without hardly moving throughout its entire duration. This is what must have confused the device, which thought I had fallen asleep (two REM stages included!). In the next screenshot from the mobile app, I can clearly see this misconstruction.

This type of error has also occurred on several days in the evening, when I lie down on the sofa at home to read before going to bed. Therefore, the hours counted as sleep are more than what actually happened. In any case, the study will focus on the proportions of sleep phases so it will not influence the result that much.

../_images/6706724915292816b3e387901423835a826748eee2de1a0ef49d8bb8c97f854b.png

The total sleep time is divided into the three usual types: light sleep, deep sleep, and REM sleep. It’s interesting to see to what extent they occur in the appropriate proportions each night. The Xiaomi app recommends these:

Light: between 20% and 60% of the total sleep time.
Deep: between 20% and 40% of the total sleep time.
REM: between 20% and 40% of the total sleep time.

Let’s see if they stay inside the margins.

Show code cell source Hide code cell source

# Recommended ratios with respect to sleep total duration
deep_min_r = 0.2
deep_max_r = 0.4
light_min_r = 0.2
light_max_r = 0.6
rem_min_r = 0.1
rem_max_r = 0.3

# Create new columns with max and min values for each sleep phase
sleep.loc[:, "deep_min"] = sleep["total_duration"] * deep_min_r
sleep.loc[:, "deep_max"] = sleep["total_duration"] * deep_max_r
sleep.loc[:, "light_min"] = sleep["total_duration"] * light_min_r
sleep.loc[:, "light_max"] = sleep["total_duration"] * light_max_r
sleep.loc[:, "rem_min"] = sleep["total_duration"] * rem_min_r
sleep.loc[:, "rem_max"] = sleep["total_duration"] * rem_max_r

# Subset dataframe for independent plotting
deep = sleep.loc[:, ["date", "sleep_deep_duration", "deep_min", "deep_max"]]
light = sleep.loc[:, ["date", "sleep_light_duration", "light_min", "light_max"]]
rem = sleep.loc[:, ["date", "sleep_rem_duration", "rem_min", "rem_max"]]

# Plot
fig, ax = plt.subplots(3, 1, figsize=(6, 12))
sns.lineplot(
    ax=ax[0],
    x="date",
    y="sleep_light_duration",
    data=light,
    marker="o",
    linewidth=0.5,
    color=colors[0],
    label="Light",
)
sns.lineplot(
    ax=ax[1],
    x="date",
    y="sleep_deep_duration",
    data=deep,
    marker="o",
    linewidth=0.5,
    color=colors[1],
    label="Deep",
)
sns.lineplot(
    ax=ax[2],
    x="date",
    y="sleep_rem_duration",
    data=rem,
    marker="o",
    linewidth=0.5,
    color=colors[2],
    label="REM",
)

for i in range(3):
    ax[i].grid(axis="y", alpha=0.3)
    ax[i].set_axisbelow(True)
    ax[i].tick_params(axis="x", labelsize=10, rotation=0)
    ax[i].tick_params(axis="y", labelsize=10)
    ax[i].set_title("", size=12)
    ax[i].set_xlabel("")
    ax[i].set_ylabel("minutes", size=10)
    ax[i].set_ylim(0, 450)
    ax[i].xaxis.set_major_formatter(mdates.DateFormatter("%d/%m"))
    ax[i].set_xlim([datetime.date(2024, 3, 19), datetime.date(2024, 5, 1)])
    ax[i].axvspan(
        datetime.date(2024, 4, 17),
        datetime.date(2024, 4, 30),
        facecolor="grey",
        alpha=0.15,
    )

h, l = ax[0].get_legend_handles_labels()
ax[0].legend(
    h,
    ["Light"],
    bbox_to_anchor=(1.0, 0.5),
    loc="center left",
    fontsize=10,
    frameon=False,
)
h, l = ax[1].get_legend_handles_labels()
ax[1].legend(
    h,
    ["Deep"],
    bbox_to_anchor=(1.0, 0.5),
    loc="center left",
    fontsize=10,
    frameon=False,
)
h, l = ax[2].get_legend_handles_labels()
ax[2].legend(
    h, ["REM"], bbox_to_anchor=(1.0, 0.5), loc="center left", fontsize=10, frameon=False
)

ax[0].fill_between(
    sleep["date"], sleep["light_max"], sleep["light_min"], color=colors[0], alpha=0.3
)
ax[1].fill_between(
    sleep["date"], sleep["deep_max"], sleep["deep_min"], color=colors[1], alpha=0.3
)
ax[2].fill_between(
    sleep["date"], sleep["rem_max"], sleep["rem_min"], color=colors[2], alpha=0.3
)

sns.despine()

plt.show()

../_images/c122dbbdf29f50cccd281292ad4c677dd321b500dbf4676ca215a2bf35db36e3.png

While deep sleep seems to typically fall within the recommended ranges, light sleep appears to lean towards the higher end and REM towards the lower end.

In the following graph, I show it in terms of ratio.

../_images/d36362be3a87efe54852228f79cc00bf06c6c70ff957361f8c432cfccd0a2fef.png

I’m clearly experiencing less REM sleep at the expense of light sleep. Could this be the reason why I feel like I haven’t been sleeping very well lately? And does coffee have anything to do with it? From what I see in these graphs, it doesn’t seem like anything has changed from not having caffeine, but just in case, let’s take a look.

First at REM sleep:

../_images/bab684304665e07230cc6f32c5537101ac473318eb60d682cbeb55dc2e4a67a6.png

../_images/1ed0560a458e6d1e45b67f078304f71e62c8a58665222b5d76cd494726857bd2.png

Caffeinated mean: 0.09
Decaffeinated mean: 0.12
Difference in means observed -> 0.03

Decaffeinated days improve the observed REM sleep ratio but the difference is very small and the overlapping means that it is not signifficant. There is no need to do a hypothesis test.

Let’s do the same with deep sleep:

../_images/9751dd9a9eea1ee6e2e4b892fb560bf10489bc23d7350aba10c08827d1e34f81.png

../_images/bf444031fe9c89b88c526ba67d8be46539765000914c2f582e1df5fca87193aa.png

Caffeinated mean: 0.32
Decaffeinated mean: 0.29
Difference in means observed -> 0.03

Similarly, the difference in deep sleep does not seem significant.

Conclusions#

I couldn’t conclude that coffee has anything to do with sleep quality. At least not with the current measuring device, the Xiaomi monitoring band is limited. Furthermore, I believe that to draw any conclusion, I should expand the number of days of the experiment, but as I said, I am not willing to do so.

Anyway, I’m glad I documented and studied this data. I’ll keep looking for the reason behind my sleep patterns to find out the cause.