query-27d527af6e846c99cb870711e97db883
Motivation. (Q654724)SIMBAD and (sometimes) (Q5420639)Exoplanet Archive , (Q1385430)Extrasolar Planets Encyclopaedia I mainly work with astronomical data. That science is actively moving forward, data quickly becomes outdated and needs to be regularly updated. Unfortunately, direct information extraction from the text of scientific articles (even using ChatGPT) still seems impractical, so one have to use information from astronomical databases. If we narrow scope, for example, to exoplanets, then there are 2.5 major databases: . Now it is supported by three references: 2.33 ± 0.27(P2067)mass (Q9184117)CoRoT-11 b : from hereAny sufficiently large database contains errors. Consider for example the first statement CoRoT-11b: (Q1385430)Extrasolar Planets Encyclopaedia Gandolfi et al. 2012: (Q59246784)Doppler tomography of transiting exoplanets: a prograde, low-inclined orbit for the hot Jupiter CoRoT-11b Gandolfi et al. 2013: (Q56168679)CoRoT: Harvest of the exoplanet program ). table on page 3, you will see that the second article (Gandolfi et al. 2012) estimate is slightly higher: 2.49 ± 0.27 (check corresponding NASA Exoplanet Archive page, which contains the first two references. The problem is that if you open the third referenceAlthough they seem "equal", technically they were obtained by parsing I can write a SPARQL-query to identify problematic statements (the same object/predicate + the same source, but different values):
Use at
- https://query.wikidata.org/sparql
PREFIX wikibase: <http://wikiba.se/ontology#>
PREFIX wd: <http://www.wikidata.org/entity/>
PREFIX psv: <http://www.wikidata.org/prop/statement/value/>
PREFIX prov: <http://www.w3.org/ns/prov#>
PREFIX pr: <http://www.wikidata.org/prop/reference/>
PREFIX p: <http://www.wikidata.org/prop/>
SELECT * {
{ SELECT ?item ?source {
VALUES ?item { wd:Q9184117 } # Limit to CoRoT-11b for demo purposes
?item p:P2067/prov:wasDerivedFrom/pr:P248 ?source
MINUS { VALUES ?source { wd:1385430 wd:Q5420639 wd:654724} } # Exclude aggregators
} GROUP BY ?item ?source HAVING(COUNT(*) > 1)} ?item p:P2067[psv:P2067[wikibase:quantityAmount ?value; wikibase:quantityUpperBound ?upper]; prov:wasDerivedFrom/pr:P248 ?source]
}