Skip to content

Publication Retrieval

Publication fields in MaRDMO can be filled in two ways: the user can select an existing item directly from the MaRDI Portal or Wikidata, or they can enter a DOI and let MaRDMO fetch the metadata automatically. This page describes the DOI lookup path and how authors and journals are resolved.

Source Cascade

When a DOI is entered MaRDMO queries multiple sources in two stages.

Stage 1 — Knowledge graphs (parallel): The MaRDI Portal and Wikidata are queried simultaneously via SPARQL. If either source returns a result, Stage 2 is skipped entirely and the found metadata is used.

Stage 2 — External APIs (parallel, only if Stage 1 yields nothing): CrossRef, DataCite, zbMath, and the DOI metadata service are queried simultaneously:

Source Endpoint
CrossRef https://api.crossref.org/works/{doi}
DataCite https://api.datacite.org/dois/{doi}
zbMath https://api.zbmath.org/v1/document/_structured_search?DOI={doi}
DOI metadata https://citation.doi.org/metadata?doi={doi}

The first source in that order that returns a valid response wins.

Once the primary metadata has been retrieved, the ORCiD public API is queried to find researchers associated with the DOI:

https://pub.orcid.org/v3.0/search/?q=doi-self:"{doi}"

The results enrich author records with ORCiD identifiers regardless of which primary source was chosen.

Author Resolution

After the primary publication metadata has been retrieved, MaRDMO attempts to resolve each author to a persistent identifier:

  1. Each author may carry an ORCiD (from CrossRef, DataCite, DOI API, or the ORCiD enrichment step) and/or a zbMath author code (from zbMath).
  2. These identifiers are used to search Wikidata for the author's QID.
  3. The ORCiD, zbMath code, and Wikidata QID together are then used to search the MaRDI Portal for the author's portal QID.
  4. If the author cannot be matched to any MaRDI Portal or Wikidata item, they are stored as a plain string attached to the publication record.

Journal Resolution

  1. The ISSN is extracted from whichever primary source provided the publication metadata.
  2. The ISSN is used to search Wikidata for the journal's QID.
  3. The ISSN and the Wikidata QID together are used to search the MaRDI Portal for the journal's portal QID.
  4. If a match is found anywhere the existing item is reused; if not, a new journal item is created during export with the ISSN as an identifier (and the Wikidata QID stored as a MaRDI Portal property if available).

Field Mapping

The table below shows which field each source provides and how it maps to the MaRDMO publication record.

Field MaRDI Portal / Wikidata (SPARQL) CrossRef DataCite zbMath DOI API
Title label title[0] titles[0].title title.title title
Description arbitrary string generated by MaRDMO
Entry type entrytypelabel type types.bibtex document_type.description type
Date date published.date-parts dates (Issued) year published.date-parts
Volume volume volume relatedItems[0].volume source.series[0].volume volume
Issue issue issue relatedItems[0].issue source.series[0].issue issue
Page page page firstPagelastPage source.page page
DOI doi DOI doi links[doi].identifier DOI
Language languagelabel language language language.languages language
Authors authorInfos (SPARQL) author[].given + family + ORCiD creators[].givenName + familyName + nameIdentifiers[ORCID] contributors.authors[].name + codes[0] author[].given + family + ORCiD
Journal journalInfo (SPARQL) ISSN + container-title relatedIdentifiers + relatedItems source.series ISSN + container-title