When the same content is stored multiple times, data quality is enhanced by extracting the most valid variant of this content, and store it one single time. Deduplicating content in this way enhances data quality, because users will not accidentally use low quality variantions, and data maintenance can focus on fixing and sustaining one instead of multiple versions.
Many collection items have information about their width iisg:widthInCm
and their height iisg:heightInCm
readily available in their properties. While most images do have their values inserted some miss either the height or the width of the image while this is available in the property iisg:size
. As shown in Table 1 below, these items miss either their iisg:heigthInCm
or their iisg:widthInCm
while the iisg:size
is always known.