Abstract
Measuring and analyzing personal light exposure has become increasingly important in circadian and myopia research. Very small measurement values in light exposure patterns, especially zero, are regularly recorded in field studies. These zero-lux values are problematic for commonly applied logarithmic transformations and should neither be dismissed nor be unduly influential in visualizations and statistical modeling. We compare 4 ways to visualize such data on a linear, logarithmic, hybrid, or symlog scale, and we model the light exposure patterns with a generalized additive model by removing zero-lux values, adding a very small or -1 log(10) lux value to the dataset, or using the Tweedie error distribution. We show that a symlog-transformed visualization, implemented in LightLogR, displays relevant features of light exposure across scales, including zero-lux, while reducing the emphasis on the small values (<1 lux). Symlog is well-suited to visualize differences in light exposure covering heavy-tailed negative values. We further show that small but not negligible value additions to the light exposure data of -1 log(10) lux for statistical modeling allow for acceptable models on a logarithmic scale, while very small values distort results. We also demonstrate the utility of the Tweedie distribution, which does not require prior transformations, models data on a logarithmic scale, and includes zero-lux values, capturing personal light exposure patterns satisfactorily. Data from field studies of personal light exposure require appropriate handling of zero-lux values in a logarithmic context. Symlog scales for visualizations and an appropriate addition to input values for modeling, or the Tweedie distribution, provide a solid basis. Beyond light exposure, other time-series data relevant to biological rhythms, such as accelerometry for ambulatory sleep scoring in humans or wheel-running in animal models, exhibit zero inflation and can benefit from the methods introduced here.