BIRD Development
Contents
Origin BIRD System
BIRD was based on main principe of Saada project [1].
SAADA - Systèm d’Archivage Automatique des Données Astronomiques First Goal : Archive & Exploitation of Data of the European XMM Newton satellite [2].The 2XMM catalogue of X-ray sources, the largest of its kind ever, has now been released. In a PhD Framework (2002-2005, Prototype Saada V.1.3) of Dr.NGUYEN at University of Strasbourg I,Supported by the CNES[www.cnes.fr] and the Alsace Region, Supervised by Dr. Michel and Dr.Motch.
Conceptual Data Model
In order to automatically integrate heterogeneous data, we have designed several business data model corresponding to the real format of the data banks. Figure below illustrates the conceptual data model of the BIRD system. It can simultaneously host several bank types. Each type can itself concern several User Defined banks having the same format. Thanks to this conceptual model, BIRD can host different versions of a given data bank and manage them so that the programs launched on grid computing within an application can exploit the same data version during their computation time.
The data model of a data bank is predefined in an XML configuration file . This metadata is used to create the Java and SQL code. The code generation is launched at the moment of the configuration of the BIRD data bank or at data loading or reloading. In the example given figure 3, the metadata of Genbank are used to create Genbank-EST, Genbank-Refseq.
This Figure illustrates the business model for Genbank. Each bank can have several associated entries. Each entry has its associated information like Dbref, SEQData, FTSource,... According to our design, the Java classes of the business models are automatically generated by BIRD. Only instances of classes inherited from super ObjectPersistence will be recognized by the BIRD API. This super class contains common attributes and methods for all generated classes.
Query Engine
Data Integration
The creation of a database goes through some principal stages : Initially the relational schema system (meta-model) is created when BIRD is installed. In the second phase, the configuration module creates the business data model including SQL and Java codes corresponding to the predefined metadata given by the XML configuration. Then, the system analyzes some integration rules to select the data files and proceed to their conversion and loading into the relational tables .
Architecture
Key Technologies
Relational Core store
IBM DB2 WareHouse V9.1 WebSphere Federation Server
WEB Server & Services
IBM WebSphere Application Server ( main Portal) Tomcat Server (services, non graphic) Hibernate and JSF-Java Server Face Object Relational Mapping Web component
XML & JAVA
Project Distribution
Not net to public