When the same content is stored multiple times, data quality is enhanced by extracting the most valid variant of this content, and store it one single time. Deduplicating content in this way enhances data quality, because users will not accidentally use low quality variantions, and data maintenance can focus on fixing and sustaining one instead of multiple versions.
Many collection items have information about their width
iisg:widthInCm and their height
iisg:heightInCm readily available in their properties. While most images do have their values inserted some miss either the height or the width of the image while this is available in the property
iisg:size. As shown in the table below, these items miss either their
iisg:heigthInCm or their
iisg:widthInCm while the
iisg:size is always known.