Jump to content

Wikidata/human/identifier/statistics

From annawiki
Revision as of 2026-03-30T21:52:14 by Tobiasco (talk | contribs) (Created page with " ==Coverage of entities== https://qlever.dev/wikidata/N6iD5k 2026-03-25 The table shows data for the following IDs ("ID" here and sometimes in the following text short for "ID system") or groups of IDs: ISNI, ORCID, VIAF (VIAF cluster), then ten VIAF contributors - for GND, the VIAF contributor used the most on human entities, the subproperties are shown directly after GND - the CBDB (Chinese Biographical Database) and the three most used genealogical identifiers. Outs...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)


Coverage of entities

https://qlever.dev/wikidata/N6iD5k 2026-03-25

The table shows data for the following IDs ("ID" here and sometimes in the following text short for "ID system") or groups of IDs: ISNI, ORCID, VIAF (VIAF cluster), then ten VIAF contributors - for GND, the VIAF contributor used the most on human entities, the subproperties are shown directly after GND - the CBDB (Chinese Biographical Database) and the three most used genealogical identifiers. Outside humans the IDs can be used on other types of entities. The most type-restricted are ISNI, ORCID, CBDB and the genealogical identifiers, while the others may also contain works and classes.

The query used might be not providing exact numbers due to insufficient grouping. Nevertheless the data exposes differences between the IDs.

Topics to be covered by other queries:

  1. Ratio of humans among the identifiers (ISNI higher than GND and than VIAF)
  2. Quantity of humans 1) not covered by any of the identifiers listed 2) covered by ISNI, ORCID, GND, VIAF
  3. Quantity of sitelinks, external identifiers, statements on entities that have a given ID

13,099,079 items total about humans (row 1). The IDs with the largest coverage of these are: ISNI 2,136,019 ORCID 1,953,007 VIAF 3,837,770 GND 2,463,140 (rows 2 to 5) LC 1,407,979 (row 14).

VIAF has the highest value for distinct ID per item (1.021021843414) and for ID per item (1.030991174562). WikiTree is the only to have a value below 1 for distinct ID per item (0.9987) indicating usage of same ID on different items.

ISNI ratio (?p213_p_e) is as follows, total 0.1648, ORCID 0.05988 VIAF 0.5738 GND 0.6338 LC 0.8453 CBDB 0.006707 (lowest overall), Genealogics 0.0713, Geni 0.2344, WikiTree 0.2182. Among the VIAF contributors it is lowest on GND (0.6338) which has a total higher quantity followed by NLCR (0.6481) which has a lower total quantity, followed by LC (0.8453). It is highest on NTA (0.9831). Among the GND subproperties the ISNI ratio is highest on JudaicaLink (0.9789) followed by FID/PA (0.8949) and lowest on RPPD (0.3452) followed by HessBio (0.4231).

GND ratio outside the GND subproperties is highest on NTA (0.8062) and higher than ISNI ratio only on Genealogics.

Among the VIAF contributors NLCR has the lowest ISNI ratio and lowest GND ratio.

Table 1

# ?prop ?qty_e ?qty_did ?qty_id ?did_per_e ?id_per_e ?qty_p213 ?p213_per_e ?qty_p227 ?p227_per_e
1 - 13,099,079 - - - - 2,158,808 0.1648060905656 2,482,925 0.1895495858907
2 ISNI 2,136,019 2,153,336 2,208,800 1.008107137624 1.034073198787 2,208,800 1.034073198787 1,583,379 0.7412757096262
3 ORCID 1,953,007 1,956,955 1,958,768 1.002021498131 1.002949810216 116,958 0.05988611407947 105,726 0.05413498261911
4 VIAF/cluster 3,837,770 3,918,447 3,956,707 1.021021843414 1.030991174562 2,202,116 0.57380093127 2,543,961 0.662874794477
5 GND 2,463,140 2,467,866 2,504,381 1.001918689153 1.016743262665 1,561,331 0.63387830168 2,504,381 1.016743262665
6 GND/DDB 1,185,804 1,186,540 1,197,585 1.00062067593 1.009935031422 704,509 0.5941192642292 1,197,557 1.00991141875
7 GND/DtBio 863,855 865,311 875,709 1.001685468047 1.013722210325 502,854 0.5821046356159 875,693 1.013703688698
8 GND/Kalliope 281,876 282,020 286,703 1.000510862933 1.017124551221 198,063 0.702660034909 286,701 1.017117455903
9 GND/FID/PA 67,801 67,826 70,058 1.000368726125 1.033288594564 60,681 0.8949867996047 70,053 1.033214849339
10 GND/JudaicaLink 31,193 31,199 32,207 1.000192350848 1.032507293303 30,537 0.9789696406245 32,207 1.032507293303
11 GND/HessBio 16,267 16,269 16,379 1.0001229483 1.006885104813 6,883 0.4231265752751 16,375 1.006639208213
12 GND/RPPD 11,825 11,825 11,920 1.0 1.008033826638 4,082 0.345200845666 11,920 1.008033826638
13 GND/Saebi 1,823 1,823 1,843 1.0 1.010970927043 1,257 0.6895227646736 1,842 1.010422380691
14 LC 1,407,979 1,413,524 1,434,260 1.003938268966 1.01866576135 1,190,224 0.8453421535406 859,283 0.610295324007
15 IdRef 964,605 967,383 982,861 1.00287993531 1.018925881578 864,866 0.8966011994547 688,392 0.713651701992
16 NLCR 729,223 736,819 748,495 1.010416566674 1.026428129667 472,673 0.6481871800533 408,505 0.5601921497265
17 NUKAT 700,692 702,309 713,555 1.002307718655 1.01835756652 645,747 0.9215846620198 530,706 0.7574026819202
18 BNF 621,793 627,020 640,162 1.008406334584 1.029541985838 600,385 0.9655705355319 463,917 0.746095565566
19 NTA 585,639 589,104 601,936 1.00591661416 1.027827723222 575,763 0.9831363689918 472,185 0.8062731477924
20 PLWABN 415,540 418,450 426,879 1.007002935939 1.027287385089 359,268 0.8645810270973 295,313 0.7106728594118
21 SBN 240,414 243,478 248,844 1.01274468209 1.035064513714 212,867 0.8854184864442 184,687 0.7682040147412
22 BNE 224,925 229,211 235,347 1.019055240636 1.046335445148 201,562 0.8961298210515 164,483 0.7312793153273
23 CBDB 413,145 418,247 418,325 1.012349175229 1.01253797093 2,771 0.006707088310399 2,143 0.005187040869428
24 Genealogics 324,214 325,024 325,667 1.002498349855 1.004481607827 23,140 0.07137261191682 29,594 0.09127921681359
25 Geni 504,968 506,869 509,710 1.003764594984 1.009390694064 118,390 0.2344504998337 115,798 0.229317501307
26 WikiTree 411,654 411,146 413,981 0.9987659539322 1.005652805511 89,832 0.2182220991415 78,117 0.189763733621

Gender and parents

https://qlever.dev/wikidata/jY5NiD

Gender per item:

  1. Total 0.8234, ORCID below (0.3718), all other above, lowest of these GND 0.9259.

Parents per item:

  1. mother claims per father claim: highest are the genealogical identifiers, ISNI and each of the VIAF sources have ca. 0.5, lowest is CBDB (0.18) followed by ORCID (0.3929).
  2. father per item: Highest are the genealogical IDs, followed by CBDB, highest VIAF source is BNE (0.0904). Lowest is ORCID (0.001131) followed by a GND subproperty, then NLCR (0.04795) another GND subproperty and GND itself (0.04902).
  3. father total: Total is 1,180,514, followed by the three genealogical IDs (~240K each), then VIAF (190K), then GND, ISNI, CBDB, LC.

Table 2

# ?prop ?qty_e ?qty_p21 ?p21_per_e ?qty_p22 ?p22_per_e ?qty_p25 ?p25_per_e ?p25_per_p22
1 - 13,100,785 10,787,616 0.8234327942944 1,180,514 0.09011017278736 734,727 0.05608266985528 0.6223788959724
2 ISNI 2,136,140 2,001,230 0.9368440270769 118,036 0.05525667793309 56,294 0.02635314164802 0.4769222948931
3 ORCID 1,953,063 726,312 0.3718835490714 2,209 0.001131043903858 868 0.0004444301079893 0.3929379809869
4 VIAF/cluster 3,837,890 3,669,112 0.9560232315152 190,516 0.04964081826212 92,204 0.02402465938315 0.4839698503013
5 GND 2,463,249 2,280,891 0.9259685074469 120,760 0.04902468244177 59,256 0.0240560333121 0.4906922822127
6 GND/DDB 1,185,802 1,108,602 0.934896382364 49,067 0.04137874619878 24,990 0.02107434462077 0.5093036052744
7 GND/DtBio 863,851 863,613 0.9997244895242 72,034 0.08338706559349 36,140 0.04183591846279 0.5017075270011
8 GND/Kalliope 281,876 281,951 1.000266074444 32,720 0.1160794108048 16,675 0.05915721806752 0.5096271393643
9 GND/FID/PA 67,801 67,604 0.9970944381351 6,510 0.09601628294568 3,616 0.0533325467176 0.5554531490015
10 GND/JudaicaLink 31,192 31,038 0.9950628366248 2,502 0.08021287509618 1,181 0.03786227237753 0.4720223820943
11 GND/HessBio 16,267 16,272 1.000307370751 3,059 0.1880494252167 1,399 0.0860023360177 0.4573389996731
12 GND/RPPD 11,825 11,830 1.000422832981 570 0.04820295983087 300 0.02536997885835 0.5263157894737
13 GND/Saebi 1,823 1,826 1.001645639056 344 0.1886999451454 198 0.108612177729 0.5755813953488
14 LC 1,408,089 1,368,884 0.9721572997161 101,634 0.07217867620584 51,878 0.03684284161015 0.5104394198792
15 IdRef 964,663 961,030 0.9962339179589 63,691 0.06602409338805 30,352 0.03146383763034 0.4765508470585
16 NLCR 729,277 736,187 1.009475137705 34,972 0.04795434382272 16,887 0.02315581048079 0.4828720118952
17 NUKAT 700,717 686,705 0.9800033394366 44,916 0.06410005751252 21,179 0.03022475550044 0.4715246237421
18 BNF 621,836 624,554 1.004370927383 51,400 0.08265845013798 24,822 0.03991727722422 0.4829182879377
19 NTA 585,671 585,370 0.9994860595795 52,159 0.08905853286231 25,953 0.04431327485909 0.4975747234418
20 PLWABN 415,562 415,419 0.9996558876894 36,337 0.08744062257858 18,562 0.04466722173827 0.5108291823761
21 SBN 240,462 243,582 1.012975023081 21,412 0.08904525455165 10,415 0.04331245685389 0.4864094900056
22 BNE 224,940 226,632 1.007522005868 20,348 0.09045967813639 10,487 0.04662132124122 0.5153823471594
23 CBDB 413,145 419,489 1.015355383703 111,694 0.2703506032991 20,121 0.04870202955379 0.1801439647609
24 Genealogics 324,214 326,057 1.005684517017 234,543 0.7234203334834 201,638 0.6219287260883 0.859705896147
25 Geni 505,037 507,771 1.005413464756 235,907 0.4671083504773 145,777 0.2886461783988 0.6179426638463
26 WikiTree 411,684 412,624 1.002283304671 249,325 0.6056222733942 192,847 0.4684345274531 0.7734763862429