What is OpenII?
OpenII (pronounced open-eye-eye) is a freely downloadable, open source information integration (II) tool suite. It includes 1) an extensible, plug-and-play platform for II tools and 2) several tools that assist with common integration tasks, including fully- or semi-automated support in the following scenarios:

  • An integration engineer building a data warehouse must determine how diverse component data schemas map to the schema of the warehouse.
  • An XML document that conforms to one schema needs to be converted into an equivalent document that conforms to a second (different) schema.
  • To support data exchanges, a community needs to create a shared data model based on the models of its members. When a new member joins, the community needs to identify promising data exchange partners, and to what extent its shared model needs to be extended. Similarly, a chief information officer must identify data integration opportunities and make level-of-effort estimates after an acquisition or merger.
To support these scenarios, OpenII provides a schema repository into which diverse data models can easily be imported. It also provides tools that 1) assist with identifying semantic correspondences across data models (Harmony), 2) compare a set of data models against a common reference model (Proximity), 3) visually organize a set of data models into clusters of related data models (Affinity), and 4) establish a common data model for a set of inter-related data models (Unity).

What are the benefits of OpenII?
Why should I use OpenII? Here are some reasons:
  • OpenII is the only open-source platform for information integration tools. OpenII and its source code are freely available using the Apache 2.0 license, so you are free to borrow, extend or resell any portions of OpenII.
  • The OpenII schema and mapping repository is based on a neutral modeling language. Thus, all of the OpenII tools can be used regardless of the underlying modeling language. For example, Harmony can identify correspondences among an XML schema, a relational database, and an OWL ontology. By comparison, most commercial tools are tied to a particular modeling language.
  • OpenII is based on the Eclipse framework. As a result, the environment is already familiar to many programmers. Non-programmers can choose, instead, to use OpenII off-the-shelf without needing to first install Eclipse.
  • OpenII is fully extensible. If needed components do not exist, they can be readily added. For example, adding a new importer or exporter is a straightforward task that can be completed in only a few hours. Moreover, each of the tools supports the introduction of new algorithms. And, programmers familiar with the Eclipse environment can add new views with moderate effort.

Data models