query-27d527af6e846c99cb870711e97db883

rq turtle/ttl

Motivation. (Q654724)SIMBAD and (sometimes) (Q5420639)Exoplanet Archive , (Q1385430)Extrasolar Planets Encyclopaedia I mainly work with astronomical data. That science is actively moving forward, data quickly becomes outdated and needs to be regularly updated. Unfortunately, direct information extraction from the text of scientific articles (even using ChatGPT) still seems impractical, so one have to use information from astronomical databases. If we narrow scope, for example, to exoplanets, then there are 2.5 major databases: . Now it is supported by three references: 2.33 ± 0.27(P2067)mass (Q9184117)CoRoT-11 b : from hereAny sufficiently large database contains errors. Consider for example the first statement CoRoT-11b: (Q1385430)Extrasolar Planets Encyclopaedia Gandolfi et al. 2012: (Q59246784)Doppler tomography of transiting exoplanets: a prograde, low-inclined orbit for the hot Jupiter CoRoT-11b Gandolfi et al. 2013: (Q56168679)CoRoT: Harvest of the exoplanet program ). table on page 3, you will see that the second article (Gandolfi et al. 2012) estimate is slightly higher: 2.49 ± 0.27 (check corresponding NASA Exoplanet Archive page, which contains the first two references. The problem is that if you open the third referenceAlthough they seem "equal", technically they were obtained by parsing I can write a SPARQL-query to identify problematic statements (the same object/predicate + the same source, but different values):

Use at

PREFIX wikibase: <http://wikiba.se/ontology#>
PREFIX wd: <http://www.wikidata.org/entity/>
PREFIX psv: <http://www.wikidata.org/prop/statement/value/>
PREFIX prov: <http://www.w3.org/ns/prov#>
PREFIX pr: <http://www.wikidata.org/prop/reference/>
PREFIX p: <http://www.wikidata.org/prop/>
SELECT * {
   { SELECT ?item ?source {
  VALUES ?item { wd:Q9184117 } # Limit to CoRoT-11b for demo purposes
  ?item p:P2067/prov:wasDerivedFrom/pr:P248 ?source
  MINUS { VALUES ?source { wd:1385430 wd:Q5420639 wd:654724} } # Exclude aggregators
} GROUP BY ?item ?source HAVING(COUNT(*) > 1)}  ?item p:P2067[psv:P2067[wikibase:quantityAmount ?value; wikibase:quantityUpperBound ?upper]; prov:wasDerivedFrom/pr:P248 ?source]
}

Query found at

graph TD classDef projected fill:lightgreen; classDef literal fill:orange; classDef iri fill:yellow; v3("?_anon_42b5416610844a34809f33a099ae9de157377") v1("?item"):::projected v3("?source"):::projected v5("?upper"):::projected v4("?value"):::projected a2((" ")) a3((" ")) a1((" ")) a5((" ")) a4((" ")) a6((" ")) f0[[" > '1^^xsd:integer'"]] f0 --> a1 bind1[/VALUES ?item/] bind1-->v1 bind10(["wd:Q9184117"]) bind10 --> bind1 v1 --"p:P2067"--> a2 a2 --"prov:wasDerivedFrom"--> a3 a3 --"p:reference/P248"--> v3 subgraph minus2["MINUS"] style minus2 stroke-width:6px,fill:pink,stroke:red; bind3[/VALUES ?source/] bind3-->v3 bind30(["wd:1385430"]) bind30 --> bind3 bind31(["wd:Q5420639"]) bind31 --> bind3 bind32(["wd:654724"]) bind32 --> bind3 end bind5[/"count(*)"/] bind5 --as--o v3 a4 --"wikibase:quantityAmount"--> v4 a4 --"wikibase:quantityUpperBound"--> v5 a5 --"p:statement/value/P2067"--> a4 a5 --"prov:wasDerivedFrom"--> a6 a6 --"p:reference/P248"--> v3 v1 --"p:P2067"--> a5