Towards plug-and-play integration of archetypes into legacy electronic health record systems: the ArchiMed experience

Duftschmid, Georg; Chaloupka, Judith; Rinner, Christoph

doi:10.1186/1472-6947-13-11

Research article
Open access
Published: 22 January 2013

Towards plug-and-play integration of archetypes into legacy electronic health record systems: the ArchiMed experience

Georg Duftschmid¹,
Judith Chaloupka¹ &
Christoph Rinner¹

BMC Medical Informatics and Decision Making volume 13, Article number: 11 (2013) Cite this article

4308 Accesses
10 Citations
7 Altmetric
Metrics details

Abstract

Background

The dual model approach represents a promising solution for achieving semantically interoperable standardized electronic health record (EHR) exchange. Its acceptance, however, will depend on the effort required for integrating archetypes into legacy EHR systems.

Methods

We propose a corresponding approach that: (a) automatically generates entry forms in legacy EHR systems from archetypes; and (b) allows the immediate export of EHR documents that are recorded via the generated forms and stored in the EHR systems’ internal format as standardized and archetype-compliant EHR extracts. As a prerequisite for applying our approach, we define a set of basic requirements for the EHR systems.

Results

We tested our approach with an EHR system called ArchiMed and were able to successfully integrate 15 archetypes from a test set of 27. For 12 archetypes, the form generation failed owing to a particular type of complex structure (multiple repeating subnodes), which was prescribed by the archetypes but not supported by ArchiMed’s data model.

Conclusions

Our experiences show that archetypes should be customized based on the planned application scenario before their integration. This would allow problematic structures to be dissolved and irrelevant optional archetype nodes to be removed. For customization of archetypes, openEHR templates or specialized archetypes may be employed. Gaps in the data types or terminological features supported by an EHR system will often not preclude integration of the relevant archetypes. More work needs to be done on the usability of the generated forms.

Peer Review reports

Background

According to the EHR IMPACT study, interoperability is a key factor for the success of electronic health record (EHR) systems [1]. In today’s heterogeneous world of health information technology with many different EHR systems on the market, the employment of EHR standards is widely seen as a prerequisite for interoperability [2–4]. In this scenario, EHR systems transform the data to be exchanged from their internal format to a common standard format called EHR extract[5] and vice versa. To obtain optimum information management for integrated care, semantic interoperability should be strived for [5].

The dual model approach represents a promising method for achieving semantic interoperability [6]. It combines two kinds of models, the Reference Model (RM) and Archetype Model (AM), to represent EHR content [7]. By specifying the structure of an individual EHR content and providing an interface to medical terminology, archetypes are an important means of achieving semantic interoperability. Currently, ISO/EN 13606, HL7 Clinical Document Architecture, and openEHR represent the most important dual model based EHR standards [8–10]. HL7 is currently working on the so-called templates concept [11], which is conceptually similar to archetypes. In the following, we refer only to archetypes.

A frequently stated benefit of the dual model approach is that, unlike the single model approach, EHR systems do not have to be programmatically updated each time new types of EHR content have to be introduced or existing ones need to be modified [4, 7, 12]. In the dual model approach only the stable RM is “hardcoded” in the EHR system. Modifications of existing and additions of new archetypes can be handled without having to reprogram the EHR system, as shown in several pilot implementations of the dual model approach [13–15].

Existing implementations typically require some sort of manual system parameterization when integrating an archetype, such as a manual mapping between the archetype and the internal data model of the EHR system. If the effort involved in this system parameterization exceeds a certain limit, the dual model approach will still not be practicable. The ideal solution would be automatic integration of archetypes into an EHR system without any manual effort. This corresponds to the so-called “plug-and-play” integration of archetypes in [12].

Previous work on plug-and-play integration of archetypes focused primarily on the automatic generation of forms from archetypes within EHR systems where the latter are already internally based on a dual data model [16–19]. In contrast to these, our present work concentrates on the integration of archetypes into legacy EHR systems with proprietary internal data models. In accordance with [9], we assume that the dual model approach is used only to standardize the communication layer. This complicates the task insofar as the limitations of the legacy EHR system data models have to be considered.

In [20] Chen et al. present an approach for an automatic bi-directional conversion between openEHR archetypes and the internal data model of an EHR system called COSMIC. They describe how the AM and RM can be semantically mapped to so-called COSMIC templates, which can be directly used to record data within the COSMIC system.

Our goal is to extend the work of Chen et al. with respect to the following:

Based on their semantic mapping, we develop a more generalized approach for automatically generating entry forms in legacy EHR systems from archetypes. As a prerequisite for applying our approach, we define a set of basic requirements for the EHR systems, which are in accordance with ISO/TS 18308 “Requirements for an Electronic Health Record Architecture” [21].

Additionally, we introduce a method for the immediate export of EHR documents that are recorded via the generated forms and stored in the EHR systems’ internal proprietary format as standardized and archetype-compliant EHR extracts.

To test our approach, we implemented a corresponding prototype within the EHR system ArchiMed [22].

We chose the openEHR architecture for our study, as it currently provides the most mature public library of archetypes [23].

In this study, our focus is on the integration of archetypes into legacy EHR systems, where the archetypes have been published by an organisation that adheres to the principles of domain knowledge governance [24]. Therefore, we do not address the transformation of EHR system forms into archetypes.

Methods

In the following, we first address the requirements that must be satisfied by an EHR system’s data model as a prerequisite for applying our approach for plug-and-play integration of archetypes. We then describe the first part of our approach, i.e., the automatic generation of entry forms within legacy EHR systems from archetypes (cf. Figure 1). Finally, we explain the second part of our approach, that is, how EHR documents that are recorded via the generated forms and stored in the EHR system’s internal format may immediately be exported as standardized and archetype-compliant EHR extracts.

Prerequisites for applying our approach

We restrict our prerequisites to a small number of basic requirements to enhance the general applicability of our approach. The following requirements for an EHR system’s data model that are needed in order to apply our approach are supported by corresponding statements in the ISO/TS 18308 “Requirements for an Electronic Health Record Architecture” [21]:

It must contain a component that represents entry forms. This is supported by ISO/TS 18308 requirement PRO1.1: “The EHR architecture shall support the recording of any type of clinical event […] relevant to the care of a patient”, insofar as clinical events are typically recorded via forms in an EHR system.
It must contain a component that represents labelled entry fields. This demand is supported by ISO/TS 18308 requirement STR2.4: “The EHR architecture shall enable storage of data such that simple name/value pairing is preserved”.
It must support a dynamic duplication of entry fields during documentation (e.g., via tables with extendable rows). This is essential for the representation of repeating archetype nodes, i.e., nodes with an upper occurrence limit greater than 1. This demand is supported by ISO/TS 18308 requirement STR2.2: “The EHR architecture shall enable storage of data in tables such that the relationships of data with the row and column headings are preserved”.
It should support at least textual, numeric, date, and time data types. This demand is supported by ISO/TS 18308 requirements STR2.6: “The EHR architecture shall support the inclusion of narrative free text”, STR3.1: “The EHR architecture shall support the definition of the logical structure of numeric and quantifiable data […]”, and STR3.6: “The EHR architecture shall support the definition of the logical structure of dates and times”. The closer an EHR system’s set of supported data types matches the set of data types used in archetypes, the smaller is the loss of data quality when transforming an archetype to an EHR system form.

The EHR system must further allow individual access of all form components and all data recorded via forms. Depending on the underlying database, SQL queries, XQueries, or similar technologies may be applied for this purpose.

Automatic generation of EHR system forms from archetypes

An openEHR archetype (see Figure 2) consists of a tree-like hierarchical structure of nodes, which define valid instantiations of the openEHR RM. Each node constrains a class of the RM or a data type. Archetype leaf nodes constrain a primitive data type. The data that are to be collected in the generated EHR system form are exclusively described by the leaf nodes. All other nodes serve to describe the structural and semantic context of the data to be collected.

For the generation of EHR system forms from archetypes, the latter have to be augmented to comprehensive archetypes[15] in the first step. This is essential as archetypes only include those constraints, which they tighten with respect to the RM. Mandatory attributes of the RM that are not further constrained by the archetype can be seen as “implicit” constraints, which also have to be considered and must be “looked up” in the RM. As an example, the openEHR RM prescribes a mandatory attribute origin for class HISTORY, which is not addressed in node at0001 of Figure 2, but still has to appear in the generated form. When creating the comprehensive archetypes, the archetypes are augmented with the implicit constraints. In the following steps, comprehensive archetypes are used exclusively.

EHR system forms may be derived from archetypes based on a three-layered semantic mapping that addresses structural constraints, data value constraints, and terminology related constraints [20].

Structural constraints mapping

The goal of this step is to map the hierarchical structures of archetype nodes to semantically comparable structures within the EHR system data model.

The entry points in the two models, which are mapped to each other, are the archetype root node and the EHR system form. Semantically, an archetype node of class COMPOSITION represents an obvious counterpart of the EHR system form, as both describe the structure for a class of documents. However, archetypes frequently start with a root node that resides below the COMPOSITION class in the RM hierarchy. In this case, the form that is derived from the root node may be seen as an artificial container that is required in the EHR system to document the data described by the archetype.

Leaf nodes may be mapped to entry fields. Intermediate nodes describing the context of “their” leaf node may be mapped to textual labels, which precede the label of the entry field (e.g., compare column “Single. BodyMassIndex.value.units” in Figure 3). Naturally, the nodes’ local terms as defined in the archetype ontology section can be the source of the labels. In Figure 2 the local terms are shown as comments of the corresponding nodes.

If the EHR system supports additional “organisational” form components (e.g., pages, sections, or groups) corresponding to the semantics of RM classes, the context may alternatively be expressed by mapping the intermediate nodes to these form components.

Repeating archetype nodes, i.e., nodes with an upper occurrence limit greater than 1 (such as node at0002 in Figure 2), must be mapped to a form component, which allows entry fields to be dynamically duplicated during documentation (e.g., a table). The leaf nodes of the different branches “below” the repeating node represent the entry fields, which may be dynamically duplicated.

EHR system data models do not usually support a recursive duplication of entry fields during documentation. Thus, the mapping will fail if an archetype includes multiple levels of repeating nodes, i.e., a repeating node holding a repeating subnode (cf. Figure 4).

Figure 5 summarizes the structural constraints mapping using pseudo-code notation.

Data value constraints mapping

The goal of this step is to map the data types and associated constraints that may occur within archetypes to those supported by the EHR system model. The AM provides eight primitive types, which may be constrained by an archetype leaf node [26]. These primitive types have to be mapped to corresponding data types in the EHR system model.

Terminology related mapping

The goal of this step is to map the terminology bindings within archetypes to the EHR system model. Archetypes may define locally defined terms and associated display strings as allowed value sets for their nodes. They may also define bindings for their nodes to terms in external terminologies within their term_bindings section.

Generation of standardized and archetype-compliant EHR extracts from the collected documents

To prepare for the generation of standardized and archetype-compliant EHR extracts from the data collected via the generated forms, we record the complete path of the original comprehensive archetype node during the generation of each form component. This is necessary since, owing to the fact that the RM is typically more expressive than the EHR system data model [20], different RM classes will usually be mapped to the same EHR system data model class. Thus, the class of the original archetype node cannot be unambiguously recovered from the types of the generated form components.

Each time a generated form is populated with data, a document is created in the EHR system. For each document that needs to be exported as a standardized and archetype-compliant EHR extract, the underlying form components are retrieved. Based on the structure of the form and the paths of the comprehensive archetype nodes, which were recorded during the creation of the form components, the XML-based EHR extract is composed from the source document data. If the complete paths of the archetype nodes associated with each form component are stored instead of only the node identifiers, the structure of the EHR extract can be assembled without having to access the original archetype. In our prototype we stored the paths of the archetype nodes in XPath format.

To export data from a legacy EHR system as openEHR conformant EHR extracts, the Generic_extract package of the Extract Information Model specification [27] must be used. Existing XML schemas [28] of the openEHR RM and the Extract Information Model may be used to validate the EHR extract.

Results

In the following, we present our prototype implemented within the EHR system ArchiMed. For ease of explanation, we refer to an example, which shows how an ArchiMed form (see Figure 3) is automatically generated from the archetype depicted in Figure 2.

Automatic generation of ArchiMed forms from OpenEHR archetypes

As part of our prototype we used the open source Java-version of the archetype parser that was developed in the course of the openEHR Java Reference Implementation Project [29] and is available from [25].