Abstract
"Data hugging" blocks independent verification of medical AI. Apple claims age estimation with a mean absolute error of 2.9 years using photoplethysmographic (PPG) signals. Given PPG's noise, such accuracy is questionable, raising concerns about other tech companies' claims. Using UK Biobank data, we find this accuracy unreplicable, achieving results only marginally better than predicting mean age. We advocate for curated public benchmark datasets and evaluation platforms to protect the public from unverifiable claims.