Poorly represented and opaque data reduces the generalizability of models, creating a direct health risk: in a diverse ...