{"id":1602,"date":"2018-05-08T10:22:54","date_gmt":"2018-05-08T08:22:54","guid":{"rendered":"https:\/\/p686699.mittwaldserver.info\/?p=1602"},"modified":"2025-03-20T09:33:46","modified_gmt":"2025-03-20T08:33:46","slug":"converting-print-data-into-xml","status":"publish","type":"post","link":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/","title":{"rendered":"Converting print data into XML"},"content":{"rendered":"\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><em>A scientific reference work only available in print was processed electronically, so that the information in it is now searchable and usable in a multitude of ways. A special challenge was posed by diagrams in different depths of detail (similar to a map at different scales), in which data was deposited visibly or invisibly.<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>Project duration:<\/strong> 6 months<br>Even today, many reference books and standard textbooks are primarily produced for print, therefore being available for secondary use via electronic media only to a limited extent. Search options and navigation via cross-references in particular are heavily restricted. In order to enable search options and navigation in electronic media as well, we have, together with a leading academic publisher, developed a concept to transfer print data into an augmented <abbr title=\"XML (Extensible Markup Language) is a markup language for the representation of hierarchically structured data in the format of a text file that is readable both by humans and machines.\">XML format<\/abbr>. In a first step, we created raw files using a <abbr title=\"A parser is a computer program that is responsible for the analysis of entered data and its conversion into a format more suitable for further processing.\">parser<\/abbr>, exploiting information from the typesetting data as far as possible, e.g. type size, font, or color, as they have a clearly defined content relevance in the printed work. Even at this stage, it turned out that these features were often used redundantly, e.g. italics for highlighting and for the designations of biological species. Due to this, the unambiguous allocation of <abbr title=\"XML (Extensible Markup Language) is a markup language for the representation of hierarchically structured data in the format of a text file that is readable both by humans and machines. Tags supplement the data stock with additional information.\">XML tags<\/abbr> is only intellectually possible with the requisite expert knowledge. In addition, the publisher decided to augment the text on the content level as well.<br><br>Practical examples include:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>insertion of synonym relations (e.g. between term and abbreviation)<\/li>\n\n\n\n<li>insertion of additional information, definitions, etc.<\/li>\n\n\n\n<li>links between register and text<\/li>\n\n\n\n<li>meaningful additions (e.g. about a differentiation between base substance and product of a chemical reaction)<\/li>\n\n\n\n<li>additional search options by depositing invisible synonyms and spellings<\/li>\n\n\n\n<li>integration and indexing of purely graphical elements<\/li>\n<\/ul>\n\n\n\n<p>At the end of this post-processing and the subsequent validation on the basis of <abbr title=\"A Document Type Definition defines the details for the use of a certain XML language.\">DTD (Document Type Definition)<\/abbr>, a high-quality data stock will be available. The high formal and content-related consistency of the data is the precondition for its further use on electronic platforms and in various applications.<br>All requisite steps may be carried out optionally, either remotely in the client system or in GIMD\u2019s <a href=\"https:\/\/gimd.de\/en\/software\/\">ARTIS database<\/a>. The <a href=\"https:\/\/gimd.de\/en\/software\/\">ARTIS software<\/a> then supports editors with needs-based checking routines, keyboard shortcuts, automated data import, respectively export, allocation of work packages, and much more.<\/p>\n\n\n\n<p class=\"has-text-align-right\"><a href=\"https:\/\/gimd.de\/en\/contact\/\">Contact<\/a> <a href=\"https:\/\/gimd.de\/en\/category\/projects\/#projectsContent\">Show all<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A scientific reference work only available in print was processed electronically, so that the information in it is now searchable [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[109,115,76,78,118,71],"tags":[],"class_list":["post-1602","post","type-post","status-publish","format-standard","hentry","category-converting","category-digitalizing","category-e-books-en","category-life-science-en","category-processing-editing","category-projects"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Converting print data into XML &#8211; GIMD<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Converting print data into XML &#8211; GIMD\" \/>\n<meta property=\"og:description\" content=\"A scientific reference work only available in print was processed electronically, so that the information in it is now searchable [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\" \/>\n<meta property=\"og:site_name\" content=\"GIMD\" \/>\n<meta property=\"article:published_time\" content=\"2018-05-08T08:22:54+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-03-20T08:33:46+00:00\" \/>\n<meta name=\"author\" content=\"gimd-redaktion\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"gimd-redaktion\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\"},\"author\":{\"name\":\"gimd-redaktion\",\"@id\":\"https:\/\/gimd.de\/en\/#\/schema\/person\/ba78560ef83b195d9b67f42a1c6e0a6e\"},\"headline\":\"Converting print data into XML\",\"datePublished\":\"2018-05-08T08:22:54+00:00\",\"dateModified\":\"2025-03-20T08:33:46+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\"},\"wordCount\":382,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/gimd.de\/en\/#organization\"},\"articleSection\":[\"converting\",\"digitalizing\",\"E-Books\/E-Journals\",\"Life Science\",\"processing\/editing\",\"Projects\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\",\"url\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\",\"name\":\"Converting print data into XML &#8211; GIMD\",\"isPartOf\":{\"@id\":\"https:\/\/gimd.de\/en\/#website\"},\"datePublished\":\"2018-05-08T08:22:54+00:00\",\"dateModified\":\"2025-03-20T08:33:46+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Startseite\",\"item\":\"https:\/\/gimd.de\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Converting print data into XML\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/gimd.de\/en\/#website\",\"url\":\"https:\/\/gimd.de\/en\/\",\"name\":\"GIMD\",\"description\":\"Limited Corporation for Information Management and Documentation\",\"publisher\":{\"@id\":\"https:\/\/gimd.de\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/gimd.de\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/gimd.de\/en\/#organization\",\"name\":\"Gesellschaft f\u00fcr Informations-Management und Dokumentation mbH\",\"url\":\"https:\/\/gimd.de\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/gimd.de\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/gimd.de\/wp-content\/uploads\/2025\/03\/cropped-GIMD-Logo-002.png\",\"contentUrl\":\"https:\/\/gimd.de\/wp-content\/uploads\/2025\/03\/cropped-GIMD-Logo-002.png\",\"width\":800,\"height\":140,\"caption\":\"Gesellschaft f\u00fcr Informations-Management und Dokumentation mbH\"},\"image\":{\"@id\":\"https:\/\/gimd.de\/en\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/gimd.de\/en\/#\/schema\/person\/ba78560ef83b195d9b67f42a1c6e0a6e\",\"name\":\"gimd-redaktion\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g\",\"caption\":\"gimd-redaktion\"},\"url\":\"https:\/\/gimd.de\/en\/author\/gimd-redaktion\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Converting print data into XML &#8211; GIMD","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/","og_locale":"en_US","og_type":"article","og_title":"Converting print data into XML &#8211; GIMD","og_description":"A scientific reference work only available in print was processed electronically, so that the information in it is now searchable [&hellip;]","og_url":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/","og_site_name":"GIMD","article_published_time":"2018-05-08T08:22:54+00:00","article_modified_time":"2025-03-20T08:33:46+00:00","author":"gimd-redaktion","twitter_card":"summary_large_image","twitter_misc":{"Written by":"gimd-redaktion","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#article","isPartOf":{"@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/"},"author":{"name":"gimd-redaktion","@id":"https:\/\/gimd.de\/en\/#\/schema\/person\/ba78560ef83b195d9b67f42a1c6e0a6e"},"headline":"Converting print data into XML","datePublished":"2018-05-08T08:22:54+00:00","dateModified":"2025-03-20T08:33:46+00:00","mainEntityOfPage":{"@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/"},"wordCount":382,"commentCount":0,"publisher":{"@id":"https:\/\/gimd.de\/en\/#organization"},"articleSection":["converting","digitalizing","E-Books\/E-Journals","Life Science","processing\/editing","Projects"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/","url":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/","name":"Converting print data into XML &#8211; GIMD","isPartOf":{"@id":"https:\/\/gimd.de\/en\/#website"},"datePublished":"2018-05-08T08:22:54+00:00","dateModified":"2025-03-20T08:33:46+00:00","breadcrumb":{"@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/gimd.de\/en\/2018\/05\/08\/converting-print-data-into-xml\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Startseite","item":"https:\/\/gimd.de\/en\/"},{"@type":"ListItem","position":2,"name":"Converting print data into XML"}]},{"@type":"WebSite","@id":"https:\/\/gimd.de\/en\/#website","url":"https:\/\/gimd.de\/en\/","name":"GIMD","description":"Limited Corporation for Information Management and Documentation","publisher":{"@id":"https:\/\/gimd.de\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/gimd.de\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/gimd.de\/en\/#organization","name":"Gesellschaft f\u00fcr Informations-Management und Dokumentation mbH","url":"https:\/\/gimd.de\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/gimd.de\/en\/#\/schema\/logo\/image\/","url":"https:\/\/gimd.de\/wp-content\/uploads\/2025\/03\/cropped-GIMD-Logo-002.png","contentUrl":"https:\/\/gimd.de\/wp-content\/uploads\/2025\/03\/cropped-GIMD-Logo-002.png","width":800,"height":140,"caption":"Gesellschaft f\u00fcr Informations-Management und Dokumentation mbH"},"image":{"@id":"https:\/\/gimd.de\/en\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/gimd.de\/en\/#\/schema\/person\/ba78560ef83b195d9b67f42a1c6e0a6e","name":"gimd-redaktion","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e0982fccb74046b96083b824011f76dd98b3644c85bed7cc8c056ebb40fcb2cd?s=96&d=mm&r=g","caption":"gimd-redaktion"},"url":"https:\/\/gimd.de\/en\/author\/gimd-redaktion\/"}]}},"_links":{"self":[{"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/posts\/1602","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/comments?post=1602"}],"version-history":[{"count":3,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/posts\/1602\/revisions"}],"predecessor-version":[{"id":1612,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/posts\/1602\/revisions\/1612"}],"wp:attachment":[{"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/media?parent=1602"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/categories?post=1602"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/gimd.de\/en\/wp-json\/wp\/v2\/tags?post=1602"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}