Quality Comparison: Person Entities

This overview shows the quality improvements that have been made to the person entities in the IISG Knowledge Graph.

Figure 1 shows the data prior to making the quality improvements (2017 version). As shown in the diagram, there are many persons with more than one identifier. The person identifiers are collected per combination of person name, title, and ‘word & dates’ property. The diagram only shows the persons with the largest number of works attributed to them.

Figure 1 ― Persons with more than one identifier before quality improvements.

Figure 2 shows the same quality aspects, but for the current version of the data. A value of 1 for number_of_occurrences (red line) means that there are no duplicates for the corresponding person (displayed on the horizontal axis). While there are still some duplicates left, many duplicates have been resolved.

Figure 2 ― Persons with more than one identifier after quality improvements.

Table 1 enumerates all persons that have more than one identifier in the latest version of the data.

Table 1 ― Full overview of persons with more than one identifier.