The Interop Glossary
An evolving vocabulary for the law of interoperability governing electronic data format and communication protocol technical specifications, standards, and technical regulations.
- AGP
- Agreement on Government Procurement. A treaty governing procurement by all levels of government in signatory nations, which include the U.S.A. The AGP is intended to remove unnecessary obstacles to international trade in government procurement practices and procedures in part by requiring use of international standards adopted pursuant to the ATBT as government procurement specifications at all levels of government in member nations except to the extent international standards were prepared, adopted, or applied with a view to or with the effect of creating unnecessary obstacles to international trade. See text of the AGP (annotated version).
- ANSI
- American National Standards Institute, a major voluntary standards and conformance assessment procedure development organization in the U.S. ANSI is organized as an industry consortium. Through a memorandum of understanding that is of questionable legality, NIST has delegated responsibilities for representing the U.S. government's interests in international standard development efforts to ANSI and its chosen delegees, whereas the ATBT and at least arguably the U.S. Administrative Procedures Act rests those responsibilities with the federal government itself. On ANSI generally, see About ANSI.
- application-level interoperability
- Interoperability among ICT systems established through means other than by adherence to a data format or communications protocol specification, for example where knowledge about another ICT system's interface for a given interchange of information is not fully and publicly specified and such information must be obtained through collaboration among developers or through non-collaborative techniques such as litigation, legislation, or reverse engineering. Another variant of application-level interoperability requires the use of an application's programming interfaces in lieu of writing directly to a file format or communications protocol. JTC 1 Directives, Annex I, prohibits standards and technical regulations from enabling only application-level interoperability by requiring that international standards in the ICT sector "clearly and unambiguously specify the conformity requirements essential to achieve the interoperability." The Agreement on Government Procurement has the effect of extending that requirement to technical (procurement) specifications.
- ASCII
- American Standard Code for Information Interchange, often referred to as plain text. ASCII is a character encoding based on characters of the English language plus various non-printing device control codes. There are many variants. ASCII has been extended to accommodate a large number of human languages and for other purposes using, e.g., Unicode and ISO/IEC:10646-Universal Character Set. ASCII is the foundation of many markup languages, including all variants of XML. See generally An annotated history of some character codes or ASCII. ASCII was orginally developed as a telegraphy code, i.e., a communications protocol, but quickly became a "file format" as electromagnetic data storage media displaced punched paper tape.
- ATBT
- Agreement on Technical Barriers to Trade. A treaty intended to remove unnecessary obstacles to international trade in the preparation, adoption, and application of technical standards, technical regulations, and conformity assessment procedures. Like the AGP, the ATBT applies at all levels of government in signatory nations, as well as to all non-governmental standards development organizations in member nations. See text of the ATBT (annotated version).
- business process
- A set of linked business activities that take one or more inputs and transform them to create an output. Ideally, the transformation that occurs in the process should add value to the input and create an output that is more useful and effective to the recipient either upstream or downstream in the processing chain. See H. Johansson, et al, Business Process Reengineering: BreakPoint Strategies for Market Dominance (1993).
- CDF
- Compound Document Formats. A set of World-Wide Web Consortium candidate recommendations (standards) for generating and rendering compound documents. It includes a markup language-neutral interoperability framework (CDRF + CDIF) and WICDprofiles for different combinations of XML-derived markup languages. See How Does Compound Document Framework Benefit Us? and W3C CDF Working Group public home page.
- CDIF
- Compound Document by Inclusion Framework. An interoperability framework and markup profile under development by the W3C Compound Document Formats Working Group. Using CDIF, compound documents will be created by direct inclusion of copied content rather than by reference. CDIF is planned as a layer paralleling that of WICD Core profile between CDRF and the WICD Mobile profile. Planned version 2.0 WICD profiles must integrate with both CDRF and CDIF. See W3C CDIF Draft.
- CDRF
- Compound Document by Reference Framework. An interoperability framework specification that is one of the W3C CDF recommendations. CDRF addresses interoperability issues not addressed by individual markup languages incorporated in a compound document such as the propagation of events across namespaces, the combination of rendering, and the user interaction model. CDRF is designed for the interoperability of implementations of the various Web Integration Compound Document (WICD) profiles and integration of the forthcoming CDIF. The Compound Document Framework is language-independent. While CDRF is clearly meant to serve as the basis for integrating W3C's family of XML formats within its Interaction Domain (e.g., MathML, SMIL, SVG, VoiceXML, XForms, XHTML, XSL) with each other, together with CSS and the W3C Document Object Model, it can also be used to integrate non-W3C formats with W3C formats or to integrate non-W3C formats with other non-W3C formats. See latest CDRF specification.
- communications protocol
- A common signaling language used for communications purposes. Such protocols predate the advent of electronic computing, e.g., viewed broadly, speech, writing systems, smoke signals, semaphore code, sign language, and telegraphy codes are all communications protocols. In the ICT context, the signals making up a communication may be transmitted by any of numerous methods such as copper wire, electrical circuits, radio frequencies, microwave, fiber optic cable, quantum mechanics, etc. See also definition in State of New York v. Microsoft, pp. 35-36. The line between communications protocol and data formats is difficult to draw in a consistent fashion. E.g., TTS is unmistakably both.
- compatibility
- In the information technology context, the qualitative degree to which one information technology system can properly process information provided by another IT system. Compatibility involves consideration of fidelity and is not a synonym for interoperability.
- composite application
- a perspective of software engineering that defines an application built by combining multiple existing functions into a new application. People often compare composite applications to mashups. However, composite applications leverage enterprise and enterprise-ready sources (e.g., existing modules or even enterprise web services) of information, while mashups usually rely on web-based, and often free, sources. -- Wikipedia. The creation of composite applications may be considered one method of achieving interoperability.
- compound document
- A document that combines multiple electronic document formats in a single document.
- conformance
- Fulfilment by a product, process, or service of specified requirements, e.g., as required by a standard, technical regulation, or procurement specification.
- conformance assessment
- Any procedure used, directly or indirectly, to determine that relevant requirements in a technical regulation, standard, or procurement specification is fulfilled. Such procedures include, inter alia, procedures for sampling, testing and inspection; evaluation, verification and assurance of conformity; registration, accreditation and approval as well as their combinations. See ATBT Article XVII § 3; ISO/IEC JTC 1 Directives, pp. 11, 83-84, 145. Validation is only one conformity assessment procedure that will often be insufficient by itself to ensure full conformity because of rules that cannot be expressed in a markup language's grammar.
- convert
- In the ICT context, conversion is the process of translating data or a carrier to another data format or carrier.
- data format
- A specification for the consistent structuring of data of a specified type. Data structures include, inter alia, markup languages, file formats, file storage systems, database records, and communications protocols. Most data formats include multiple sub-structures. Specification of data formats is essential to interoperability in the ICT sector.
- document-level interoperability
- As opposed to application-level interoperability, document-level interoperability in the standards context is achieved without knowledge of another implementation's unique data formats or communications protocols because a standardized interface for the interchange of information is fully specified by a standard, technical regulation, or technical specification.
- DTD
- Document Type Definition defined slightly differently within the XML and SGML standards (XML is a subset of SGML). DTD is one of several SGML and XML schema languages, and is also the term used to describe a document or portion thereof that is authored in the particular DTD language, e.g., HTML. A DTD is primarily used for the expression of a schema via a set of declarations that conform to a particular markup syntax and that describe a class, or type, of SGML or XML documents, in terms of constraints on the structure of those documents. A DTD may also declare constructs that are not always required to establish document structure, but that may affect the interpretation of some documents. As a starting point, see Wikipedia article.
- E-SIGN
- Electronic Signatures in Global and National Commerce Act. A U.S. federal statute widely authorizing the substitution of electronically signed digital records for paper records to the extent the Act's requirements are followed. E-SIGN covers a broad range of record types. The Act applies at all levels of government and throughout the private sector. 15 U.S.C. §§ 7001, et seq.
- Ecma
- Ecma International, also known as the European Association for Standardizing Information and Communication Systems. Ecma is a software industry consortium that develops standards for information and communication technology as well as for consumer electronics. Formerly known as the European Computer Manufacturers Association ("ECMA"), the organization's present name is commonly mistakenly written in all capital characters; however, the name was formally changed in 1994. The OOXML standard was developed under the auspices of Ecma. See What Is Ecma International[?]
- end point solution
- A software application or device incapable of serving as a router of information in a business process processing chain and commonly using a data format that is incompatible with other applications, requiring application-level interoperability to the extent interoperability can be achieved. If an end point solution's data is stored in formats that are incompatible with applications capable of functioning as a router of information, such data is said to be stored in a data silo. End point solutions were common before the advent of ubiquitous networking but are increasingly being obsoleted by router-capable software.
- fidelity
- A key measure of interoperability and compatibility, the quality of document content, presentation and metadata preserved on exchange between applications. Fidelity is an objective measure, usually the percentage extent to which data and metadata can be properly mapped from one format to the other without being lost or garbled in the conversion process.
- ICT
- Information and communications technology. The term encompasses both hardware and software.
- IEC
- International Electrotechnical Commission. An international standardization body with membership limited to national standards bodies, designated by the ATBT (along with ISO and the International Telecommunications Union body of the United Nations) as a body for preparation and adoption of technical international standards and conformity assessment procedures. See About IEC.
- INCITS
- InterNational Committee for Information Technology Standards, a standards development organization sponsored by the Information Technology Industry Council, which bills itself as "[o]ne of Washington's most effective lobbying groups." INCITS is an industry consortia that develops and maintains de jure standards and is accredited by ANSI as the U.S. government's representative at ISO/IEC/JTC 1. INCITS board of directors includes three federal entities, NIST, Dept. of Homeland Security, and Dept. of Defense; the remainder of the board members are private corporations. See generally, What is INCITS?
- inclusion
- Electronic document creation embedding content from other documents by creating a new complete and permanent document that must itself be edited to make changes. Inclusion is in contrast to transclusion, which allows a single editing point to be maintained for content that recurs in multiple documents. Inclusion is generally preferred to inclusion when a document must not be changed except by editing the same document. A familiar manual example of inclusion is copying a portion of one electronic document and pasting the content into a new document.
- incompatible
- To the extent that information received by one ICT system from another cannot be mapped and processed correctly without data loss or garbling, the systems are said to be incompatible.
- interoperability
- The ability of ICT systems to exchange information at one or more standardized interfaces and to make equal mutual use of the information that has been exchanged, without differences in use attributable to inadequacies in technical regulations, standards, or technical specifications. ICT systems that achieve interoperability are said to be interoperable. Interoperabiity is commonly referred to in abbreviated form as interop.
- interoperability framework
- In the context of electronic data formats and communications protocols, a standardized interface for information exchange that implementations must adhere to in order to achieve interoperability, normally specified as interoperability conformity requirements essential to achieve the interoperability. However, in minor applications, there may be no specification other than source code and comments in it. An example of an interoperability framework is the W3C's CDRF.
- ISO
- International Organization for Standardization. A non-governmental standardization body composed of a network of standardization bodies, both governmental and non-governmental, which works closely with IEC. Like IEC and the International Telecommunications Union body of the United Nations, ISO has a quasi-government ministerial role recognized under the ATBT in the creation and maintenance of international standards and conformity assessment procedures See About ISO.
- JTC 1
- Joint Technical Committee 1. A joint technical committee jointly established by ISO and IEC to develop and process international standards and conformity assessment procedures in the software information technology sector. See JTC 1 home page.
- mapping
- In conversions between data formats or communications protocols, the process of relating each source encoding to a target encoding and identifying the translation that satisfies the relationship. In an over-simplified example, a mapping might specify that all occurrences of <foo> will be translated to <bar> where <foo> and <bar> identify the same functionality in two different markup languages. It can be difficult or impossible to satisfactorily map between two formats/protocols to the extent they do not share the same or very similar functionality, a condition known as incompatibility.
- markup
- In the context of XML markup languages, text consists of intermingled character data and markup. Markup consists of syntactically delimited characters added to the character data of a document to represent and define its structure and presentation. XML markup takes the form of start-tags, end-tags, empty-element tags, entity references, character references, comments, CDATA section delimiters, document type declarations, processing instructions, XML declarations, text declarations, and any white space that is at the top level of the document entity (that is, outside the document element and not inside any other markup). All text that is not markup constitutes the character data of the document. See W3C XML 1.0 § 2.4. The original markup language was a handwritten code used in the typographical trade that evolved over a period of some 500 years.
- metadata
- Literally, data about data. For example, a library card index is data about data, the books in the library's collection. In the context of data formats, metadatamarkup. Metadata can identify data types, classify data, define data and document structures, specify how an application should process and present data, and specify other data about data.
- multi-tripping
- An essential aspect of interoperability when multiple ICT systems must function as routers of information in a work flow or business process, passing data through a succession of applications without loss of data. In business processes and workflows, all particular applications that must process data in a processing chain often are unknown at the time the data is created. As data recycling becomes more common, the ability of ICT systems to participate in multi-tripping without data loss is becoming mission critical. Compare with round-tripping.
- NAFTA
- North American Free Trade Agreement. A regional treaty among Canada, Mexico, and the U.S.A. that implements a number of the Free Trade Agreements, including the AGP and ATBT. See NAFTA text; NAFTA Secretariat web site; NAFTA Glossary.
- namespace
- XML namespaces provide a simple method for qualifying element and attribute names used in XML documents by associating them with namespaces identified by Uniform Resource Identifier ("URI"). In XML compound documents, namespaces indicate the particular markup language in use for a specified portion of the document. XML namespaces are generally specified according to the W3C Namespaces in XML 1.0 Recommendation. For a short overview, see Wikipedia article.
- NB
- An acronym commonly used to refer to national standardization bodies that act on behalf of national governments in the preparation, adoption, and maintenance of technical regulations, international standards, and conformity assessment procedures developed pursuant to the requirements of the ATBT. In most nations that are ATBT members, NBs are units of government. However, some nations such as the U.S. have delegated such responsibilities to private industry consortia.
- NIST
- National Institute of Standards and Technology. A U.S. federal agency with broad jurisdiction over standards activities. See NIST General Information web page.
- normative
- A normative document is any document that provides rules, guidelines or characteristics for activities or their results and covers such documents as standards, technical specifications, codes of practice, and technical regulations. A normative reference is another document incorporated by reference in a normative document in such a way as to make it indispensable for the implementation of the specification in the normative document. A normative requirement is a requirement that is indispensable for the implementation of a normative document, as opposed to optional or recommendatory requirements. Competent normative documents incorporate definitions of requirements keyword terms that may differ. E.g., compare definition of may in RFC 2119 with that in ISO/IEC Directive Part 1, Annex H. (The former imposes a nuanced requirement of interoperability while the latter does not.)
- OASIS
- Organization for the Advancement of Structured Information Standards. A software industry consortium based in the U.S. that develops data format voluntary standards. See About OASIS web page.
- ODF
- An acronym widely used to refer to any version or derivative of a particular family of XML compound document file formats for office productivity software, one version of which is the international standard ISO/IEC:26300-2006 OpenDocument. ODF is derived from the XML schemas originally developed for Sun Microsystems' StarOffice/OpenOffice.org office suites. Contrary to widespread misinformation, the OpenDocument formats were not designed for interoperability and are grossly under-specified for that purpose.
- OOXML
- Office Open XML. A term used to describe various versions of a family of office productivity XML compound document formats originally developed by Microsoft. There are significant differences among versions. One version has been adopted as the international standard ISO/IEC:29500-2008. Contrary to widespread misinformation, OOXML was not designed for interoperability and is grossly under-specified for that purpose.
- A three-letter extension to a software file name indicating to the system that the file type is one of the Portable Document Formats originally developed by Adobe. PDF is widely used for pre-print and archival purposes, as well as for portable documents. PDF/A ("Archival") is now ISO 19005-1:2005 and PDF/X is standardized as ISO 15929 and 15930. See generally White Paper PDF Primer by PDF Tools AG. PDF is expected to be succeeded by XML-based formats, with Adobe Mars and Microsoft XPS as leading contenders.
- processing chain
- Common to workflows and other workgroup-related document exchange, a processing chain is the routing of electronic documents between multiple information processing systems. Multiple applications are involved, with the possibility of document content changing frequently through extraction of information from one or more documents to automatically create another. Applications in a processing chain must respect the processing instructions, scripts, macros, the embedded linking of objects, and security settings needed by other applications in the chain.
- profile
- In electronic document standards work, a profile consists of a defined superset or subset of a specification with fully specified conformance requirements essential to achieve interoperability among implementing applications. Complex technical specifications often have many optional features, making interoperability problematic because of differences as to which options are supported. Although profiles are widely used in other types of data formats, their utility was generally neglected in standards for the office productivity software sector before the W3C's CDRF was introduced. E.g., neither ODF nor OOXML have been profiled for interoperability purposes.
- rip out and replace
- The process of removing existing applications and replacing them with competing but incompatible alternatives. This stands in contrast to the process of upgrading existing applications to access new features, capabilities and emerging technologies. More importantly, the success of any Service Oriented Architecture/Software as a Service approach is predicated on the value of a non-disruptive leveraging of existing information systems to access the benefits and features of emerging technologies. Rip out and replace is costly and disruptive, and should only be used as a last resort to break free of vendor controlled information systems. The term is sometimes condensed to rip and replace.
- round-tripping
- An essential aspect of interoperability whereby two ICT systems such as software applications are able to exchange data without loss and make full mutual use of the same data. Each transfer of data from one IT system to the other in one direction is referred to as a trip, i.e., the outbound trip and the return trip. Compare with multi-tripping.
- router of information
- An ICT system design based on achieving the highest levels of interoperability with other applications whether in a specific multi-ICT system processing chain or not. The phrase encapsulates the ability of an ICT system to "route" or pass along information rather than consume information as an end point solution. Compare with end point solution.
- SC 34
- Subcommittee 34. A particular technical subcommittee of ISO/IEC/JTC 1 established for development and processing of Document Description and Processing Languages standards, technical reports, and conformity assessment procedures relating to structured markup languages (specifically the Standard Generalized Markup Language (SGML) and the Extensible Markup Language (XML) in the areas of information description, processing and association. See SC 34 Scope.
- schema
- A schema is a description of a type of XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntax constraints imposed by XML itself. An XML schema provides a view of the document type at a relatively high level of abstraction. See Wikipedia article.
- SDO
- Standard development organization. SDOs may be governmental or non-governmental.
- SOA
Service Oriented Architecture. An architectural style for ICT systems that is largely a response to a crisis of complexity in legacy systems. An SOA guides all aspects of creating and using business processes, packaged as services, throughout their life cycle, as well as defining and provisioning the IT infrastructure that allows different applications to exchange data and participate in business processes loosely coupled from the operating systems and programming languages underlying those applications.
SOA represents a model in which functionality is decomposed into distinct units (information services) that can be distributed over a network and can be combined together and reused to create business applications. These services communicate with each other by passing data from one service to another, or by coordinating an activity between two or more services. SOA concepts are often seen as being built upon, and the evolution of, older concepts of distributed computing and modular programming. See Wikipedia article.
- standard
- Under the Agreement on Technical Barriers to Trade, "[a] document approved by a recognized body, that provides, for common and repeated use, rules, guidelines or characteristics for products or related processes and production methods, with which compliance is not mandatory. It may also include or deal exclusively with terminology, symbols, packaging, marking or labeling requirements as they apply to a product, process or production method." The definition is found in ATBT Annex 1 § 2.
- technical regulation
- Under the Agreement on Technical Barriers to Trade, "a document which lays down product characteristics or their related processes and production methods, including the applicable administrative provisions, with which compliance is mandatory. It may also include or deal exclusively with terminology, symbols, packaging, marking or labeling requirements as they apply to a product, process or production method." For a leading case interpreting the definition, see WTDS 135 EC - Asbestos ¶¶ 66-70. The definition is found in ATBT Annex 1 § 1.
- technical specification
- A term of art under the Agreement on Government Procurement. "Technical specifications lay[] down the characteristics of the products or services to be procured, such as quality, performance, safety and dimensions, symbols, terminology, packaging, marking and labelling, or the processes and methods for their production[."] (Quoted from AGP article VI § 1.) A specification may simultaneously be a standard, a technical regulation, or a technical specification in different contexts. Also commonly referred to as a "procurement specification."
- transclusion
- The inclusion of a document or document part into another document, temporarily by reference rather than by embedding the same content. It is a feature of substitution templates. For example, a web page document viewed in a browser often includes content transcluded from a variety of documents or document parts originating from different servers, producing a compound document. Transclusion has among its advantages the maintenance of a single editing point for recurring content, allowing changes in one document "part" to be automatically propagated to all other documents that transclude the edited document. Transclusion can be thought of as a method of interoperability. See generally Wikipedia article.
- transition technologies
- Technologies capable of connecting legacy systems and applications to emerging information systems. In SOA-speak, these technologies are often called "connectors," which is different from that of "converters." SOA converters are used to connect legacy systems via a common XML layer, which is then used to wire legacy systems and applications into advancing web ready systems. A common transition would be from client/server workgroup oriented workflows to that of client/Web/server business processes. An example is the use of the OOXML Microsoft Office plug-in (Microsoft Compatibility Pack) to perfect a transition to a new stack alignment of Microsoft Office/Exchange/ SharePoint/MS-SQL.
- triple
- An n-tuple with three objects.
- TTS
- Teletypesetter. The TTS telegraphy code was introduced in 1928 to enable the Associated Press telegraphy network to distribute news articles in a format that did not require rekeyboarding to be typeset. TTS was input and output as 6-bit punched paper tape. TTS was the first commercially successful digital markup language/communication protocol that included code for presentation purposes. Most modern desktop publishing systems and word processors are based on fundamental software methods and concepts first commercialized in the newspaper publishing industry for use with TTS code.
- tuple
- In mathematics and computer science, a tuple is a finite sequence (also known as an "ordered list") of objects, each of a specified type. A tuple containing n objects is known as an "n-tuple". For example the 4-tuple (or "quadruple"), with components of respective types PERSON, DAY, MONTH and YEAR, could be used to record that a certain person was born on a certain day of a certain month of a certain year. When the term 'tuple" is used by itself without specifying the number of objects, in programming the automated processing of ordered word processing lists, the general assumption is that n=2, the ordered alphanumeric character or symbol and the text or other content that follows in the same paragraph. See Wikipedia article.
- user agent
- In a broad sense, an entity that acts on behalf of the user in a software application process. More narrowly, a client application used with a particular network protocol. Still more narrowly in the W3C recommendations context, an implementation of a specification that retrieves data across a network and processes compound documents.
- validation
- The process of verifying whether an instance of a data format or communications protocol actually follows the rules for the language(s) it uses is called validation, and the automated tool used for that purpose is called a validator. A signal or data that passes this process with success is said to be valid or validated. Validation against an incomplete or insufficient set of criteria can lead to a state of validation that does not confer the confidence that the term intends. Thus validation of the validation criteria is an important aspect that is often overlooked.
- vendor lock-in
- In the information technology sector, the design and implementation of a product in such a manner that barriers are created for users wishing to switch to a competing product. Often accomplished by using unique data formats or communications protocols protected from use by competitors by one or more patents. The corollary is that other vendors are "locked out" from developing interoperable applications.
- vendor lock-out
- Manipulation of data formats or communications protocols to deny competitors' applications the ability to interoperate, a term that achieved some notoriety in a Bill Gates internal speech quoted by the Court in Commission v. Microsoft, ¶ 771: "‘What we are trying to do is use our server control to do new protocols and lock out Sun and Oracle specifically ..."
- W3C
- World Wide Web Consortium. An international consortium that develops information technology standards ("recommendations") with a particular emphasis on standards development for the World Wide Web. W3C is generally regarded as the leading standards organization in regard to development and maintenance of XML standards. See About the World Wide Web Consortium.
- WICD
- Web Integration Compound Document. WICD profiles describe rules for combining various combinations of XML and related markup languages into a compound document. WICD profiles all depend on W3C CDRF, a compound document interoperability framework standard. As of this writing, three WICD profiles have been published as candidate recommendations: [i] WICD Core 1.0; [ii] WICD Mobile 1.0; and [ii] WICD Full 1.0. WICD Mobile supersets WICD Core and WICD Full supersets WICD Mobile. WICD Full is targeted at desktop user agents.
- XML
- Extensible Markup Language. XML is a relatively simple, very flexible standard for creating markup languages that describe the structure of data, a metalanguage for the specification of markup languages. There are thousands of derivatives, generically referred to as XML languages. XML and its derivatives have as particular hallmarks their human legibility, separation of presentation and data, easy transformability to other XML languages, and ease of repurposing and reformatting data extracted using a parser. See generally, W3C XML activities web site.
- XML transformation language
- A computer language designed specifically for transforming an input XML document into an output format, usually another XML markup language. See generally Wikipedia article for a list of XML transformation languages.
