This overview shows the quality improvements that have been made to the person entities in the IISG Knowledge Graph.
Figure 1 shows the data prior to making the quality improvements (2017 version). As shown in the diagram, there are many persons with more than one identifier. The person identifiers are collected per combination of person name, title, and ‘word & dates’ property. The diagram only shows the persons with the largest number of works attributed to them.
Figure 2 shows the same quality aspects, but for the current version of the data. A value of 1 for number_of_occurrences
(red line) means that there are no duplicates for the corresponding person (displayed on the horizontal axis). While there are still some duplicates left, many duplicates have been resolved.
Table 1 enumerates all persons that have more than one identifier in the latest version of the data.