pedigree import ibp activity 2.2.2, subactivity 2: develop genealogy manager application principal...

24
Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda, IRRI

Upload: deasia-nunn

Post on 02-Apr-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Pedigree Import

IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application

Principal Investigator:Mylah Anacleto, IRRI

Presentor: Alex Cañeda, IRRI

Page 2: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Pedigree Import

• Import of germplasm entries using a predefined file format containing pedigree strings

• Software allows user to verify if the entries in the imported file exist in the database

• Software parses the pedigree strings based on selected rice nomenclature rules and name standardization

• If unrecognized, allows users to edit portions of the pedigree string, and then check the resulting split for correctness, and apply the changes made back to the pedigree entry.

Page 3: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Pedigree Import

• The target end users of the application are the data managers and data curators whose tasks include the bulk loading of historical pedigree entries

Page 4: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Timeline and milestones

• October 2013 – Beta version• January 2014 – Release candidate 1• June 2014 – Release candidate 2

Page 5: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Timeline and milestones

Page 6: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Collection of nomenclature rules for Rice

Page 7: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Collection of nomenclature rules for Rice

Page 8: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Sample input pedigree strings

”IR” cross number designation - for all crosses made in IRRI; assigned by the database administrator of PBGB

IR 88888 is an F2 plant.• IR 88888-21 is an F3

21st selection from IR 88888 F2 population• IR 88888-21-2 is an F4.

2nd selection from the IR 88888-21 (F3)• IR 88888-21-2-2 is an F5.

2nd selection from the IR 88888-21-2 (F4) • IR 88888 -21-2-2-2 is an F6.

2nd selection from the IR 88888-21-2-2 (F5)

Page 9: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Features

• Loading of large volume of historical germplasm data in to the ICIS GMS database

• Parse cross history strings of unknown crosses and, look for parents in the database

• User to control the selection of parents found in the database or create a new entry for the parents

• Derivative names of parents are recognized by looking for a cross number followed by letters and numbers separated by dashes

Page 10: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Features

• Pedigree Importer will use the same parsing algorithm that has been developed under the Delphi based ICIS GMS parser application the GMSInput– Note: using java, developed parser based on

expected output as specified in the guide on nomenclature rules, not sure if the same algorithm

• basic data validation (for example, the parent should not be younger than its offspring) – not yet done

• The other features can be viewed from IBP site

Page 11: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Technologies used

• Web-based: GUI was written using PHP programming language

• Java Web Services (Jersey)• Web Servers: Apache Tomcat and Apache httpd• Database: MySQL while testing the IBP

middleware but will eventually need to use PostgreSQL

Page 12: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI

• Home page

Page 13: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI

• File is uploaded. Pedigree Strings not in standardized format are in red color.

Page 14: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI• Pedigree strings are standardized after clicking

the standardize button. Unrecognized patterns are in red color. Click the pedigree string to edit.

Page 15: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI

• Edit Germplasm Name

Page 16: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI

• Changes are applied after editing germplasm names with unrecognized patterns.

Page 17: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI• Create GID’s.

• After clicking the Create GID button. The link to display the strings created with GID’s is shown

Page 18: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

GUI

• Sample of Create GID’s page .

Page 19: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Features: string parsers

Pedigree string parser• Identify tokens using ‘–’ as delimiter

For each token:• Name standardization• Use of Regex libraries• Check spaces and patterns• Check cross notations (Single cross, compound

cross, backcross)

Page 20: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Interaction of application components

Pedigree Import GUI

Restful Web Services

Pedigree Import Web Services Component

(Pedigree string parsers)

IBP middleware (.jar file)

-create germplasm-search germplasm

IRIS GMS

Pedigree string(s) to process

Pedigree string(s) to process

Standardized name(s)Filename

(where result was stored)

Matching germplasm,Created GIDS

Filename (where result was stored)

Apache Web Server

Page 21: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Pedigree Import GUI

Restful Web Services

IBP middleware -create germplasm-search germplasm

IRIS GMS

Apache Web Server

Restful Web Services

Pedigree Import Web Services Component

(Pedigree string parsers)

Pedigree Import

*Only 1 deployment in IRRI

With availability of IBP Web services

Page 22: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Next module

Pedigree Editor• To communicate with

IBP/Efficio on middleware requirements

• Scope is for pedigree/genealogy editor needs (limited germplasm editing); to be able to edit the information uploaded using pedigree import tool.

Page 23: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Project Information

Software developer: Nikki Carumba

Nikki is part of Breeding Information Management group of IRRI-PBGB

With user inputs from IRRI:

Dr. Ruaraidh Sackville-Hamilton

William Eusebio

Page 24: Pedigree Import IBP Activity 2.2.2, Subactivity 2: Develop Genealogy Manager Application Principal Investigator: Mylah Anacleto, IRRI Presentor: Alex Cañeda,

Thank you!