Skip to content

fix creators parsing#83

Open
slint wants to merge 3 commits into
inveniosoftware:mainfrom
slint:fix-creators-parsing
Open

fix creators parsing#83
slint wants to merge 3 commits into
inveniosoftware:mainfrom
slint:fix-creators-parsing

Conversation

@slint

@slint slint commented Jun 30, 2026

Copy link
Copy Markdown
Member
  • fix(schema): flatten creators into parallel lists
    gpt-oss-20b drops an ORCID nested under each creator in tool calls; parallel name/orcid/affiliation lists survive. Adds synthetic field examples.

  • fix(extractor): inline ORCIDs next to their author
    A bare appended list misaligned when some authors had no ORCID; inline placement keeps each ID tied to its author.

  • fix(schema): drop ORCIDs that fail the check digit
    gpt-oss fabricates 0000-0000-0000-0000 for authors with no ORCID. Filter by the ISO 7064 check digit in to_suggestions, not on the LLM output schema, since a schema error feeds back and the model invents a checksum-valid fake instead of leaving it null.

slint added 3 commits June 30, 2026 21:14
gpt-oss-20b drops an ORCID nested under each creator in tool calls; parallel name/orcid/affiliation lists survive. Adds synthetic field examples.
A bare appended list misaligned when some authors had no ORCID; inline placement keeps each ID tied to its author.
gpt-oss fabricates 0000-0000-0000-0000 for authors with no ORCID. Filter by the ISO 7064 check digit in to_suggestions, not on the LLM output schema, since a schema error feeds back and the model invents a checksum-valid fake instead of leaving it null.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant