sonia phillips phd supervisors: h van heerden ia dubery molecular characterization of...

Post on 01-Apr-2015

218 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Sonia Phillips

PhD Supervisors: H van Heerden IA Dubery

Sonia Phillips

PhD Supervisors: H van Heerden IA Dubery

Molecular Characterization of Elicitor-responsive Genes in

Cotton

Background Cotton is a multi-billion rand industry (> $ 500 billion) and Gossypium hirsutum accounts for > 95%

cotton fibre

Verticillium wilt = vascular plant disease caused by Verticillium dahliae, a destructive soil-borne, multi-host fungus (> 400 plant species afflicted)

Cotton production severely hampered by V. dahliae: in SA yield losses are around 30 % and can be as high as 70 %

Symptoms:

Wilting

Discolouration

Stunted growth

Defoliation

Smaller fruits

Poor fiber quality

Ultimately - death

Soil fumigation and fungicides are costly and have a negative ecological and health impact – alternative eco-friendly control measures are required: need to investigate the genetic components of plant immunity to V. dahliae

DDRT-PCR with V. dahliae elicitor in cotton (Michelle Zwiegellar, 2003, MSc dissertation) ID several ESTs differentially induced upon treatment of cell suspensions with V. dahliae elicitor

Objectives

Obtain, analyze and characterize the full genomic sequences (+ promoter) of 3 genes represented by ESTs ID in DDRT-PCR study by genome walking, RACE, PCR, RT-PCR, bioinformatics analysis and Southern blots:

C4B5 (kinase?)

C4B4 (arm repeat?)

C1B10 (?)

Elicitation studies: Isolate elicitor from V. dahliae cell wall

Investigate if CWD elicitor induces PR protein production

Viability studies to determine appropriate elicitor concentrations for expression analysis

RT-qPCR to determine the time frame and kinetics of induction with CWD elicitor

Molecular characterization and genomic assignment of GhLIPN homoeologs (GhLIPN N and I)

GhLRK = Lectin receptor-like kinase (RLK) gene

GhARM = Armadillo-repeat gene

GhLIPN = Lipin gene

Polyploidy in cotton

AADD tetraploid2n = 521 – 2 mya

G. hirsutum

G. darwinii

G. barbadense

G. mustelinum

G. tomentosum

AA diploid 2n = 26represented byG. herbaceum and G. arboreum

DD diploid 2n = 26represented byG. raimondii

Common ancestor5 – 10 mya

Hybridization and Endoreduplication

New allotetraploid species

B

Endoreduplication

New autotetraploid species

A

Autoployploidy Alloployploidy

Genome Walking

Genomic DNA digested with four different restriction enzymes to obtain a genomic library from each

Genomic DNA isolated from the young leaves of Gossipium hirsutum cv. ‘OR-19’

Adapters ligated to digested fragments. Primary PCR with first gene-specific primers and adaptor primer 1

SP 1

AP 1

Secondary or ‘nested’ PCR with the second gene-specific primers and adaptor primer 2

SP 2

AP 2

Major bands cut from the gel, cloned, sequenced and characterized

5` RACE

PCR product ready for agarose gel analysis and cloning

SP 1

(A)n 3`5`Synthesis of first strand cDNA with gene-specific primer SP 1 and reverse transcriptase

mRNA

(A)n 3`3`

cDNA

Degradation of the mRNA template by the RNase H activity of reverse transcriptase

3` (A)nAAA

Tailing of the purified cDNA with dATP by terminal transferase

Purify cDNA

Amplification of the tailed cDNA by primary PCR using the oligo (dT)-anchor primer and a nested gene-specific primer SP2

(A)nAAAA (T)nTTTTV

SP 2

Oligo dT-anchor primer

*V = A, C or G

SP 3

Inner PCR anchor primer

Secondary nested PCR with the PCR anchor primer and a SP 3 primer

3` RACE

Amplification of the cDNA by PCR using gene-specific primers SP 4 and the outer PCR anchor primer

SP 4

Outer PCR anchor primer

*V = A, C or G

cDNA synthesis using the oligo (dT)- anchor primer by reverse transcriptase

5`mRNA AAAAAAAVTTTT

3`

Degradation of the mRNA by the RNase H activity of reverse transcriptase

cDNA

TTTT

PCR product ready for agarose gel analysis and cloning

SP 5

Inner PCR anchor primer

Nested secondary PCR, with the inner PCR anchor primer and gene-specific primer SP5

Viability Study

0

10

20

30

40

50

60

70

80

90

100

110

-2.5 0 2.5 5 7.5 10 12.5 15 17.5 20

V. dahliae elicitor (g/ml)

% V

iabi

lity

Genome Walking: GhLRK

2171 – 2501 bp

C4B5 EST

A

Stu

I

Dra

I

Eco

RI

PvuII

M

B

Stu

I

Dra

I

Eco

RI

PvuII

MUpstream from EST

Primary PCR Secondary/nested PCR

2240 bp

Stu

I

Dra

I

Eco

RI

PvuII

M

A B

Stu

I

Dra

I

Eco

RI

PvuII

M

Downstream from EST

Primary PCR Secondary/nested PCR

657 bp

1080 bp

1 2

1259 - 2339

5` RACE: GhLRK

21

255 bp

312 - 567

3` RACE: GhLRK

279 bp

132 bp2306 – 2583 bp

2306 – 2436 bp

2171 – 2501 bp

C4B5 EST

1 2

Genomic analysis: GhLRK

1: SacI2: XbaI3: HindIII4: EcoRI

2277 bp

2007 bp30 bp 240 bp

309 bp2171 – 2501 bp

Full genomic sequence obtained from genome walking: 3011 bp

Promoter analysis: GhLRK-309 + AAAGAAATTT TGAAATTTGT CGCAACATTC TAATCCCTCT TTACATGATA TTATTATGCT CTATTTTCAA CCCAAAATAT TTATTAGGGT GATTAAATAA AAAAAAGGAC - TTTCTTTAAA ACTTTAAACA GCGTTGTAAG ATTAGGGAGA AATGTACTAT AATAATACGA GATAAAAGTT GGGTTTTATA AATAATCCCA CTAATTTATT TTTTTTCCTG -199 + ATTCCGTGAT AAATTACTAA TGGGTCAATA TTTCAAGTTA GTACCTAAAA TTCAAACCCC ACTTCCCCAC GGCAACTGCC CAGATCATAT GTTTTGAGCC AATGCCATTT - TAAGGCACTA TTTAATGATT ACCCAGTTAT AAAGTTCAAT CATGGATTTT AAGTTTGGGG TGAAGGGGTG CCGTTGACGG GTCTAGTATA CAAAACTCGG TTACGGTAAA -89 + GATCAGTGTA TGACTTTGAA ATGTTCAAGT CATCCATTGG GGACTAATTG TTTCTATTCA CATATTTCTT ATGCTATATT ATTTTTTAG* - CTAGTCACAT ACTGAAACTT TACAAGTTCA GTAGGTAACC CCTGATTAAC AAAGATAAGT GTATAAAGAA TACGATATAA TAAAAAATC

Elicitor/pathogen-responsive

-309 + AAAGAAATTT TGAAATTTGT CGCAACATTC TAATCCCTCT TTACATGATA TTATTATGCT CTATTTTCAA CCCAAAATAT TTATTAGGGT GATTAAATAA AAAAAAGGAC - TTTCTTTAAA ACTTTAAACA GCGTTGTAAG ATTAGGGAGA AATGTACTAT AATAATACGA GATAAAAGTT GGGTTTTATA AATAATCCCA CTAATTTATT TTTTTTCCTG -199 + ATTCCGTGAT AAATTACTAA TGGGTCAATA TTTCAAGTTA GTACCTAAAA TTCAAACCCC ACTTCCCCAC GGCAACTGCC CAGATCATAT GTTTTGAGCC AATGCCATTT - TAAGGCACTA TTTAATGATT ACCCAGTTAT AAAGTTCAAT CATGGATTTT AAGTTTGGGG TGAAGGGGTG CCGTTGACGG GTCTAGTATA CAAAACTCGG TTACGGTAAA -89 + GATCAGTGTA TGACTTTGAA ATGTTCAAGT CATCCATTGG GGACTAATTG TTTCTATTCA CATATTTCTT ATGCTATATT ATTTTTTAG* - CTAGTCACAT ACTGAAACTT TACAAGTTCA GTAGGTAACC CCTGATTAAC AAAGATAAGT GTATAAAGAA TACGATATAA TAAAAAATC

Myb/Myc TF-binding

-309 + AAAGAAATTT TGAAATTTGT CGCAACATTC TAATCCCTCT TTACATGATA TTATTATGCT CTATTTTCAA CCCAAAATAT TTATTAGGGT GATTAAATAA AAAAAAGGAC - TTTCTTTAAA ACTTTAAACA GCGTTGTAAG ATTAGGGAGA AATGTACTAT AATAATACGA GATAAAAGTT GGGTTTTATA AATAATCCCA CTAATTTATT TTTTTTCCTG -199 + ATTCCGTGAT AAATTACTAA TGGGTCAATA TTTCAAGTTA GTACCTAAAA TTCAAACCCC ACTTCCCCAC GGCAACTGCC CAGATCATAT GTTTTGAGCC AATGCCATTT - TAAGGCACTA TTTAATGATT ACCCAGTTAT AAAGTTCAAT CATGGATTTT AAGTTTGGGG TGAAGGGGTG CCGTTGACGG GTCTAGTATA CAAAACTCGG TTACGGTAAA -89 + GATCAGTGTA TGACTTTGAA ATGTTCAAGT CATCCATTGG GGACTAATTG TTTCTATTCA CATATTTCTT ATGCTATATT ATTTTTTAG* - CTAGTCACAT ACTGAAACTT TACAAGTTCA GTAGGTAACC CCTGATTAAC AAAGATAAGT GTATAAAGAA TACGATATAA TAAAAAATC

Dof TF-binding

-309 + AAAGAAATTT TGAAATTTGT CGCAACATTC TAATCCCTCT TTACATGATA TTATTATGCT CTATTTTCAA CCCAAAATAT TTATTAGGGT GATTAAATAA AAAAAAGGAC - TTTCTTTAAA ACTTTAAACA GCGTTGTAAG ATTAGGGAGA AATGTACTAT AATAATACGA GATAAAAGTT GGGTTTTATA AATAATCCCA CTAATTTATT TTTTTTCCTG -199 + ATTCCGTGAT AAATTACTAA TGGGTCAATA TTTCAAGTTA GTACCTAAAA TTCAAACCCC ACTTCCCCAC GGCAACTGCC CAGATCATAT GTTTTGAGCC AATGCCATTT - TAAGGCACTA TTTAATGATT ACCCAGTTAT AAAGTTCAAT CATGGATTTT AAGTTTGGGG TGAAGGGGTG CCGTTGACGG GTCTAGTATA CAAAACTCGG TTACGGTAAA -89 + GATCAGTGTA TGACTTTGAA ATGTTCAAGT CATCCATTGG GGACTAATTG TTTCTATTCA CATATTTCTT ATGCTATATT ATTTTTTAG* - CTAGTCACAT ACTGAAACTT TACAAGTTCA GTAGGTAACC CCTGATTAAC AAAGATAAGT GTATAAAGAA TACGATATAA TAAAAAATC

Hormone-responsive

Lectin domain Ser/Thr Protein Kinase domainTMS P

Protein analysis: GhLRK

Predicted protein structure: GhLRK

Predicted carbohydratebinding site

Additional loopN-ter

N28

N52

C-ter

Activation segment

C-ter

N-ter

N577

Catalytic loop

Phosphate anchor

Lectin domain Protein kinase domainHomology to legume lectin from bark of Robinia pseudoacacia: 30% ID, E-score = 9.8e-25, EP = 100%

Homology to PK domain from AvrPto: 33% ID; E-score = 1e-35, EP = 100%

RT-qPCR: GhLRK

0 2 4 6 8 100.000

1.000

2.000

3.000

4.000

5.000

6.000

7.000

8.000

9.000

10.000

Time (h)

Rela

tive

conc

entr

ation *

*

**

*

* p-value > 0.05

Genome Walking: GhARMC4B4 EST

2759 - 3218

Stu

ID

raI

Eco

RI

PvuII

MB

Stu

I

Dra

I

Eco

RI

PvuII

MA

Secondary/nested PCRPrimary PCR

Downstream from EST

1078 bp

BStu

I

Dra

I

Eco

RI

PvuII

MA

Stu

I

Dra

I

Eco

RI

Pvu

II

M

Secondary/nested PCRPrimary PCR

Upstream from EST

2883 bp

5` RACE: GhARM

2

2

1

1

832 bp

2202 - 3034

2126 - 2321

197 bp

1

3` RACE: GhARM

1

2

2

1

2 1

2595 - 2969

2512 - 3213

3107 - 3213

367 bp

402 bp231 bp

1762 bp

C4B4 EST

2759 - 3218

Genomic analysis: GhARM

Retrotransposon(777 bp)

1: SacI2: XbaI3: EcoRI4: HindIII

2759 – 3218 bp

1780 bp

1713 bp67 bp

986 bp

Full genomic sequence obtained from genome walking: 4131 bp

-986 + GAATAA TAGCAGTTAA CAGTTGCAGT TAACAATTAA GGAAGTTAAG TTTTCTGTTA CAGTTGGTTA GCTAGACTCT TTATATACCT - CTTATT ATCGTCAATT GTCAACGTCA ATTGTTAATT CCTTCAATTC AAAAGACAAT GTCAACCAAT CGATCTGAGA AATATATGGA -900 + AAAGTTGTAT TCATTCGATA CAATGAAAAT ATAGAAGTAA TTCAGTTTCT ACAGCTTGTT TGCTGAGATT TAACAAGCAA GCACAGCCAA ATGCGGAGTG - TTTCAACATA AGTAAGCTAT GTTACTTTTA TATCTTCATT AAGTCAAAGA TGTCGAACAA ACGACTCTAA ATTGTTCGTT CGTGTCGGTT TACGCCTCAC -800 + AGTGAGGCTC GGCGCAGTTA TAGGAAACAA AGTCTAAATT TGATACAGTA AAAAGAGGCT AAAAAAGCGG GTCCCAAGAA GCAAGAGTAC AGCGACATCA - TCACTCCGAG CCGCGTCAAT ATCCTTTGTT TCAGATTTAA ACTATGTCAT TTTTCTCCGA TTTTTTCGCC CAGGGTTCTT CGTTCTCATG TCGCTGTAGT -700 + CATTCATCTA GGAAAAAGAA ACATTATAAA AATATACTAC TTGACTACTC CAACTCTAGA TTTATCCACC CCCAAGGATT GAGTGGTCCA TTTTTTCTTT - GTAAGTAGAT CCTTTTTCTT TGTAATATTT TTATATGATG AACTGATGAG GTTGAGATCT AAATAGGTGG GGGTTCCTAA CTCACCAGGT AAAAAAGAAA -600 + CATCCCCCCT CCCCAACACC TTCACATGCT AATGCTTTTG CTTCATGCTC ATTTCCTCCC TTTTCTTTAA TCTTTGATGT ACAGTTCAAT TAGTGGTCGT - GTAGGGGGGA GGGGTTGTGG AAGTGTACGA TTACGAAAAC GAAGTACGAG TAAAGGAGGG AAAAGAAATT AGAAACTACA TGTCAAGTTA ATCACCAGCA -500 + TATTTCCAAT ATATACAGTG AAAAACTGAA AATTACCCTA CCAATGCCTG CAGTCACTAT GTATCAATAT GCCTGCTAAG AGTTAAATGC TAAATGCTAA - ATAAAGGTTA TATATGTCAC TTTTTGACTT TTAATGGGAT GGTTACGGAC GTCAGTGATA CATAGTTATA CGGACGATTC TCAATTTACG ATTTACGATT -400 + GCAAACTGTT CCACTTTTCT TTTTTTATTT ATCGTTTAAT AGTGTTGGTA AAAGGTGATA AATAATGTGC CCCTTATACC CTTTTATTTA AAGCTGCTGT - CGTTTGACAA GGTGAAAAGA AAAAAATAAA TAGCAAATTA TCACAACCAT TTTCCACTAT TTATTACACG GGGAATATGG GAAAATAAAT TTCGACGACA -300 + GTGAACAAGC TGTAAAGCTT CATGCTGCCT CCTCTATTGC TTCTATCATT ACCTTTCACA ATTGTGTTCA GCCTTTTCCC TCTCTCATCT CTCGCGCCAG - CACTTGTTCG ACATTTCGAA GTACGACGGA GGAGATAACG AAGATAGTAA TGGAAAGTGT TAACACAAGT CGGAAAAGGG AGAGAGTAGA GAGCGCGGTC -200 + ATACAACCAA CATTACTAAT TTACTGTTAA TGTTATTGGT ATTTCTATTT TTTTCGAATC CAATGTTATT TGCCCCGTGA ACTCAGTCCA GTACTGGACT - TATGTTGGTT GTAATGATTA AATGACAATT ACAATAACCA TAAAGATAAA AAAAGCTTAG GTTACAATAA ACGGGGCACT TGAGTCAGGT CATGACCTGA -100 + TGTTTTTTCA TTATTGGAAC TGTCTTTCTC TCCATTTCCT TGGCTCCCCT TTTCCGTCCA ACCAAAGTGC CTTTTTTTTC CCAAGTCTTT TCATGTTTTT * - ACAAAAAAGT AATAACCTTG ACAGAAAGAG AGGTAAAGGA ACCGAGGGGA AAAGGCAGGT TGGTTTCACG GAAAAAAAAG GGTTCAGAAA AGTACAAAAA

Promoter analysis: GhARM

Elicitor/pathogen-responsive

Promoter analysis: GhARM

Hormone-responsive

-986 + GAATAA TAGCAGTTAA CAGTTGCAGT TAACAATTAA GGAAGTTAAG TTTTCTGTTA CAGTTGGTTA GCTAGACTCT TTATATACCT - CTTATT ATCGTCAATT GTCAACGTCA ATTGTTAATT CCTTCAATTC AAAAGACAAT GTCAACCAAT CGATCTGAGA AATATATGGA -900 + AAAGTTGTAT TCATTCGATA CAATGAAAAT ATAGAAGTAA TTCAGTTTCT ACAGCTTGTT TGCTGAGATT TAACAAGCAA GCACAGCCAA ATGCGGAGTG - TTTCAACATA AGTAAGCTAT GTTACTTTTA TATCTTCATT AAGTCAAAGA TGTCGAACAA ACGACTCTAA ATTGTTCGTT CGTGTCGGTT TACGCCTCAC -800 + AGTGAGGCTC GGCGCAGTTA TAGGAAACAA AGTCTAAATT TGATACAGTA AAAAGAGGCT AAAAAAGCGG GTCCCAAGAA GCAAGAGTAC AGCGACATCA - TCACTCCGAG CCGCGTCAAT ATCCTTTGTT TCAGATTTAA ACTATGTCAT TTTTCTCCGA TTTTTTCGCC CAGGGTTCTT CGTTCTCATG TCGCTGTAGT -700 + CATTCATCTA GGAAAAAGAA ACATTATAAA AATATACTAC TTGACTACTC CAACTCTAGA TTTATCCACC CCCAAGGATT GAGTGGTCCA TTTTTTCTTT - GTAAGTAGAT CCTTTTTCTT TGTAATATTT TTATATGATG AACTGATGAG GTTGAGATCT AAATAGGTGG GGGTTCCTAA CTCACCAGGT AAAAAAGAAA -600 + CATCCCCCCT CCCCAACACC TTCACATGCT AATGCTTTTG CTTCATGCTC ATTTCCTCCC TTTTCTTTAA TCTTTGATGT ACAGTTCAAT TAGTGGTCGT - GTAGGGGGGA GGGGTTGTGG AAGTGTACGA TTACGAAAAC GAAGTACGAG TAAAGGAGGG AAAAGAAATT AGAAACTACA TGTCAAGTTA ATCACCAGCA -500 + TATTTCCAAT ATATACAGTG AAAAACTGAA AATTACCCTA CCAATGCCTG CAGTCACTAT GTATCAATAT GCCTGCTAAG AGTTAAATGC TAAATGCTAA - ATAAAGGTTA TATATGTCAC TTTTTGACTT TTAATGGGAT GGTTACGGAC GTCAGTGATA CATAGTTATA CGGACGATTC TCAATTTACG ATTTACGATT -400 + GCAAACTGTT CCACTTTTCT TTTTTTATTT ATCGTTTAAT AGTGTTGGTA AAAGGTGATA AATAATGTGC CCCTTATACC CTTTTATTTA AAGCTGCTGT - CGTTTGACAA GGTGAAAAGA AAAAAATAAA TAGCAAATTA TCACAACCAT TTTCCACTAT TTATTACACG GGGAATATGG GAAAATAAAT TTCGACGACA -300 + GTGAACAAGC TGTAAAGCTT CATGCTGCCT CCTCTATTGC TTCTATCATT ACCTTTCACA ATTGTGTTCA GCCTTTTCCC TCTCTCATCT CTCGCGCCAG - CACTTGTTCG ACATTTCGAA GTACGACGGA GGAGATAACG AAGATAGTAA TGGAAAGTGT TAACACAAGT CGGAAAAGGG AGAGAGTAGA GAGCGCGGTC -200 + ATACAACCAA CATTACTAAT TTACTGTTAA TGTTATTGGT ATTTCTATTT TTTTCGAATC CAATGTTATT TGCCCCGTGA ACTCAGTCCA GTACTGGACT - TATGTTGGTT GTAATGATTA AATGACAATT ACAATAACCA TAAAGATAAA AAAAGCTTAG GTTACAATAA ACGGGGCACT TGAGTCAGGT CATGACCTGA -100 + TGTTTTTTCA TTATTGGAAC TGTCTTTCTC TCCATTTCCT TGGCTCCCCT TTTCCGTCCA ACCAAAGTGC CTTTTTTTTC CCAAGTCTTT TCATGTTTTT * - ACAAAAAAGT AATAACCTTG ACAGAAAGAG AGGTAAAGGA ACCGAGGGGA AAAGGCAGGT TGGTTTCACG GAAAAAAAAG GGTTCAGAAA AGTACAAAAA

Promoter analysis: GhARM

-986 + GAATAA TAGCAGTTAA CAGTTGCAGT TAACAATTAA GGAAGTTAAG TTTTCTGTTA CAGTTGGTTA GCTAGACTCT TTATATACCT - CTTATT ATCGTCAATT GTCAACGTCA ATTGTTAATT CCTTCAATTC AAAAGACAAT GTCAACCAAT CGATCTGAGA AATATATGGA -900 + AAAGTTGTAT TCATTCGATA CAATGAAAAT ATAGAAGTAA TTCAGTTTCT ACAGCTTGTT TGCTGAGATT TAACAAGCAA GCACAGCCAA ATGCGGAGTG - TTTCAACATA AGTAAGCTAT GTTACTTTTA TATCTTCATT AAGTCAAAGA TGTCGAACAA ACGACTCTAA ATTGTTCGTT CGTGTCGGTT TACGCCTCAC -800 + AGTGAGGCTC GGCGCAGTTA TAGGAAACAA AGTCTAAATT TGATACAGTA AAAAGAGGCT AAAAAAGCGG GTCCCAAGAA GCAAGAGTAC AGCGACATCA - TCACTCCGAG CCGCGTCAAT ATCCTTTGTT TCAGATTTAA ACTATGTCAT TTTTCTCCGA TTTTTTCGCC CAGGGTTCTT CGTTCTCATG TCGCTGTAGT -700 + CATTCATCTA GGAAAAAGAA ACATTATAAA AATATACTAC TTGACTACTC CAACTCTAGA TTTATCCACC CCCAAGGATT GAGTGGTCCA TTTTTTCTTT - GTAAGTAGAT CCTTTTTCTT TGTAATATTT TTATATGATG AACTGATGAG GTTGAGATCT AAATAGGTGG GGGTTCCTAA CTCACCAGGT AAAAAAGAAA -600 + CATCCCCCCT CCCCAACACC TTCACATGCT AATGCTTTTG CTTCATGCTC ATTTCCTCCC TTTTCTTTAA TCTTTGATGT ACAGTTCAAT TAGTGGTCGT - GTAGGGGGGA GGGGTTGTGG AAGTGTACGA TTACGAAAAC GAAGTACGAG TAAAGGAGGG AAAAGAAATT AGAAACTACA TGTCAAGTTA ATCACCAGCA -500 + TATTTCCAAT ATATACAGTG AAAAACTGAA AATTACCCTA CCAATGCCTG CAGTCACTAT GTATCAATAT GCCTGCTAAG AGTTAAATGC TAAATGCTAA - ATAAAGGTTA TATATGTCAC TTTTTGACTT TTAATGGGAT GGTTACGGAC GTCAGTGATA CATAGTTATA CGGACGATTC TCAATTTACG ATTTACGATT -400 + GCAAACTGTT CCACTTTTCT TTTTTTATTT ATCGTTTAAT AGTGTTGGTA AAAGGTGATA AATAATGTGC CCCTTATACC CTTTTATTTA AAGCTGCTGT - CGTTTGACAA GGTGAAAAGA AAAAAATAAA TAGCAAATTA TCACAACCAT TTTCCACTAT TTATTACACG GGGAATATGG GAAAATAAAT TTCGACGACA -300 + GTGAACAAGC TGTAAAGCTT CATGCTGCCT CCTCTATTGC TTCTATCATT ACCTTTCACA ATTGTGTTCA GCCTTTTCCC TCTCTCATCT CTCGCGCCAG - CACTTGTTCG ACATTTCGAA GTACGACGGA GGAGATAACG AAGATAGTAA TGGAAAGTGT TAACACAAGT CGGAAAAGGG AGAGAGTAGA GAGCGCGGTC -200 + ATACAACCAA CATTACTAAT TTACTGTTAA TGTTATTGGT ATTTCTATTT TTTTCGAATC CAATGTTATT TGCCCCGTGA ACTCAGTCCA GTACTGGACT - TATGTTGGTT GTAATGATTA AATGACAATT ACAATAACCA TAAAGATAAA AAAAGCTTAG GTTACAATAA ACGGGGCACT TGAGTCAGGT CATGACCTGA -100 + TGTTTTTTCA TTATTGGAAC TGTCTTTCTC TCCATTTCCT TGGCTCCCCT TTTCCGTCCA ACCAAAGTGC CTTTTTTTTC CCAAGTCTTT TCATGTTTTT * - ACAAAAAAGT AATAACCTTG ACAGAAAGAG AGGTAAAGGA ACCGAGGGGA AAAGGCAGGT TGGTTTCACG GAAAAAAAAG GGTTCAGAAA AGTACAAAAA

Dof TF-binding

-986 + GAATAA TAGCAGTTAA CAGTTGCAGT TAACAATTAA GGAAGTTAAG TTTTCTGTTA CAGTTGGTTA GCTAGACTCT TTATATACCT - CTTATT ATCGTCAATT GTCAACGTCA ATTGTTAATT CCTTCAATTC AAAAGACAAT GTCAACCAAT CGATCTGAGA AATATATGGA -900 + AAAGTTGTAT TCATTCGATA CAATGAAAAT ATAGAAGTAA TTCAGTTTCT ACAGCTTGTT TGCTGAGATT TAACAAGCAA GCACAGCCAA ATGCGGAGTG - TTTCAACATA AGTAAGCTAT GTTACTTTTA TATCTTCATT AAGTCAAAGA TGTCGAACAA ACGACTCTAA ATTGTTCGTT CGTGTCGGTT TACGCCTCAC -800 + AGTGAGGCTC GGCGCAGTTA TAGGAAACAA AGTCTAAATT TGATACAGTA AAAAGAGGCT AAAAAAGCGG GTCCCAAGAA GCAAGAGTAC AGCGACATCA - TCACTCCGAG CCGCGTCAAT ATCCTTTGTT TCAGATTTAA ACTATGTCAT TTTTCTCCGA TTTTTTCGCC CAGGGTTCTT CGTTCTCATG TCGCTGTAGT -700 + CATTCATCTA GGAAAAAGAA ACATTATAAA AATATACTAC TTGACTACTC CAACTCTAGA TTTATCCACC CCCAAGGATT GAGTGGTCCA TTTTTTCTTT - GTAAGTAGAT CCTTTTTCTT TGTAATATTT TTATATGATG AACTGATGAG GTTGAGATCT AAATAGGTGG GGGTTCCTAA CTCACCAGGT AAAAAAGAAA -600 + CATCCCCCCT CCCCAACACC TTCACATGCT AATGCTTTTG CTTCATGCTC ATTTCCTCCC TTTTCTTTAA TCTTTGATGT ACAGTTCAAT TAGTGGTCGT - GTAGGGGGGA GGGGTTGTGG AAGTGTACGA TTACGAAAAC GAAGTACGAG TAAAGGAGGG AAAAGAAATT AGAAACTACA TGTCAAGTTA ATCACCAGCA -500 + TATTTCCAAT ATATACAGTG AAAAACTGAA AATTACCCTA CCAATGCCTG CAGTCACTAT GTATCAATAT GCCTGCTAAG AGTTAAATGC TAAATGCTAA - ATAAAGGTTA TATATGTCAC TTTTTGACTT TTAATGGGAT GGTTACGGAC GTCAGTGATA CATAGTTATA CGGACGATTC TCAATTTACG ATTTACGATT -400 + GCAAACTGTT CCACTTTTCT TTTTTTATTT ATCGTTTAAT AGTGTTGGTA AAAGGTGATA AATAATGTGC CCCTTATACC CTTTTATTTA AAGCTGCTGT - CGTTTGACAA GGTGAAAAGA AAAAAATAAA TAGCAAATTA TCACAACCAT TTTCCACTAT TTATTACACG GGGAATATGG GAAAATAAAT TTCGACGACA -300 + GTGAACAAGC TGTAAAGCTT CATGCTGCCT CCTCTATTGC TTCTATCATT ACCTTTCACA ATTGTGTTCA GCCTTTTCCC TCTCTCATCT CTCGCGCCAG - CACTTGTTCG ACATTTCGAA GTACGACGGA GGAGATAACG AAGATAGTAA TGGAAAGTGT TAACACAAGT CGGAAAAGGG AGAGAGTAGA GAGCGCGGTC -200 + ATACAACCAA CATTACTAAT TTACTGTTAA TGTTATTGGT ATTTCTATTT TTTTCGAATC CAATGTTATT TGCCCCGTGA ACTCAGTCCA GTACTGGACT - TATGTTGGTT GTAATGATTA AATGACAATT ACAATAACCA TAAAGATAAA AAAAGCTTAG GTTACAATAA ACGGGGCACT TGAGTCAGGT CATGACCTGA -100 + TGTTTTTTCA TTATTGGAAC TGTCTTTCTC TCCATTTCCT TGGCTCCCCT TTTCCGTCCA ACCAAAGTGC CTTTTTTTTC CCAAGTCTTT TCATGTTTTT * - ACAAAAAAGT AATAACCTTG ACAGAAAGAG AGGTAAAGGA ACCGAGGGGA AAAGGCAGGT TGGTTTCACG GAAAAAAAAG GGTTCAGAAA AGTACAAAAA

Promoter analysis: GhARM

Myb/Myc TF-binding

Protein analysis: GhARM

SP ARM ARM ARM Armadillo-type fold AB C

Predicted protein structure: GhARM

• Homology to human β-catenin protein from ARM repeat family• 14% ID; E-score = 3.6e-25,

estimated precision = 100%

RT-qPCR: GhARM

0 0.5 1 2 4 6 8 100.000

0.200

0.400

0.600

0.800

1.000

1.200

Time (h)

Rela

tive

conc

entr

ation

* *

*

*

*

* p-value > 0.05

Genome Walking: GhLIPNC1B10 EST

6488 - 6704

B a

Stu

ID

raI

Eco

RI

PvuII

M SnaB

IS

spI

Pd

iIEco

47

III

M SacI

Xb

aI

Hin

dIII

Eco

RV

M

b

Stu

I

SacI

Xb

aI

Hin

dIII

Eco

RV

MSsp

I

Pd

iI

MDra

I

Eco

RI

PvuII

M Eco

47

III

SnaB

Secondary/nested PCRPrimary PCR

1462 bp

B

Stu

I

Dra

I

Eco

RI

PvuII

M

Secondary/nested PCR

A

Stu

I

Dra

I

Eco

RI

PvuII

M

Primary PCR

Downstream from EST

1649 bp

Upstream from EST

Stu

I

Dra

I

Eco

RI

PvuII

M Stu

I

Dra

I

Eco

RI

PvuII

M

aA b

Primary PCR Secondary/nested PCR

676 bp

5` RACE: GhLIPN

21

4

41290 – 3685 bp

1 2

33

3577 – 4073 bp

21

2

23877 – 5243 bp1 2

1

15805 – 6542 bp

3` RACE: GhLIPN

1 2

C1B10 EST

6488 - 6704

6458 - 6837

6458 - 6704

246 bp

379 bp

Genomic analysis: GhLIPN

75666-5738

9 6179-6307

11 6617-7021

106394-6492

86029-6072

65421-5507

44375-4448

54767-4851

33682-4280

22146-3602

11465-1633

1: XbaI 2: EcoRI3: HindIII

Genomic sequence GhLIPN N: 8 423 bpGhLIPN I: 6 704 bp

Transcribed sequenceGhLIPN N: 4 467 bpGhLIPN I: 4 479 bp

ORFBoth: 2 643 bp

PromoterGhLIPN N: 1 464 bpGhLIPN I: 1 289 bp

Several indels: mostly SNPs; 13 and 17bp

GhLIPN_I ccaaaaaataaaaaaataaaaattcaaagctaccagagaatcctaacagcgtcggatcgttg 62 1 GhLIPN_N ccaaaaaataaaaaaataaaaattcaaagctaccagagaatcctaacagcgtcggatcgttg 62 GhLIPN_I tat-----------------agtaaagagaatcttacaattgtttcgatttcgagctgccag 107 GhLIPN_N tatcatcattcatcgatcatagtaaagagaatcttacaattgttttgatttcgagctgccag 124 GhLIPN_I attcgttcatcgcgggaattttcttctttgcgcaattcttaatctgtaagcttaccgacttt 169 1 GhLIPN_N gttcgttcatcccgggaattttcttctttgtgcagttcttaatctgtaagtttactgacttt 186 GhLIPN_I gactgtatagttatattttagttaaattttgaaattagaaagttcgattttttttcccacaa 231 GhLIPN_N ggctgtatagttatattttagttcaattttgaaattagaaagttcgattttttctcc-acaa 247 GhLIPN_I ttctcgtcgtttaatctgtatgttatcaatcttgtttgattgcttttactgtatactggaca 293 GhLIPN_N ttctcgtcgtttaatctgtatgttatcgatcttgtttgattgcttttaccgtatactggaca 309 GhLIPN_I tttattttatgtttggttgctgagaaaattggaggttttttttcc---cttgtgtttctaaa 352 GhLIPN_N tttattttatgtttggttgctgagaaaattggaggtttttttccctttcttgtgtttctaaa 371 GhLIPN_I gtttaagcttttcttcttttacgataaatatgttttagcattctatattgggatcatacttc 414 GhLIPN_N gtttaagcttttcttcttttatgagaaatatgttttagcattctatattgggatcatacttc 433 GhLIPN_I atataatattcgttaaacttggatttctatatttggttccaagtggcagcttatagattccc 476 GhLIPN_N atataatattcgttaaacttggatttctatatttggttccaagtgacagcttatagattccc 495 GhLIPN_I atacatgctttcttccttttttt--ccccattttattttcttgctgagctaaagggaacatg 536 GhLIPN_N atacatgctttcttccttttttttaccccattttattttctcgctgagcaaaagggaacatg 557 GhLIPN_I ttatggtttaagttggcttcgtttaattttgaactgaacgcagatacctgaaagtaaagaac 598 GhLIPN_N ttatggtttaagttggcttcgtttaattttgaactgaacgcagataactgaaagtaaagaac 619 GhLIPN_I aaagatccttgattagaacgattttggatgtttatattgatttgtggattcattttttgata 660 GhLIPN_N aaagatccttgattagaacgattttggatgtttatattgatttgtggattcattttt-gata 680 GhLIPN_I gaattaa-ggaaactgcagaaggatttttgttaatctaagaaaggtttaattttgacaaATG 721 2 GhLIPN_N gaattaaaggaaactgcagaaggatttttgttaatctaaggaaggtttaattttgacaaATG 742 GhLIPN_I AATGTGGTTGGCAAAGTTGGGAGTTTAATTTCACAAGGTGTATATTCTGTTGCTACTCCTTT 783 GhLIPN_N AATGTGGTTGGCAAAGTTGGGAGTTTAATTTCACAAGGTGTATATTCTGTTGCTACTCCTTT 804 GhLIPN_I CCATCCTTTCGGTGGAGCGGTTGATATAATTGTTGTTCGGCAACCAGATGGGACTTTTCGGA 845 GhLIPN_N CCATCCTTTCGGTGGAGCGGTTGATATAATTGTTGTTCGGCAAGCAGATGGGACTTTTCGGA 866 GhLIPN_I GCACACCTTGGTATGTTCGGTTTGGGAAGTTTCAGGGTGTCTTGAAAGGGGCTGAGAAGGTC 907 GhLIPN_N GCTCGCCTTGGTATGTTCGTTTTGGGAAGTTTCAGGGTGTCTTGAAAGGGGCTGAAAAGGTC 928 GhLIPN_I GTTCGTATAACTGTTAATGGCATTGAAGTAGATTTTCATATGTATCTTGATAACTCTGGGCA 969 GhLIPN_N GTTCGTATAACTGTTAATGGCATTGAAGCAGATTTTCATATGTATCTTGATAACTCTGGGCA 990 GhLIPN_I AGCATATTTTTTAAAGGAGGTTGAATCTGGTAAAGGATTTGAGACAAATGGAGATTTGAAGG 1031 GhLIPN_N AGCATATTTTTTAAAGGAGGTTGAATCTGGTAAAGGATTTCAGACAAATGGAGATTTGAAGG 1052 GhLIPN_I ATTCTGATGGTGAAGTTGTTTCAGATTCCTGGGTGGCTCAATTGAGGGATGAATGTGATGCC 1093 GhLIPN_N ATTCTGATGGTGAAGTTGTTTCAGATTCCTCGGTGGCTCAATTGAGGGATGAATGTGATGCC 1114 GhLIPN_I ACAAATCAAAAACAGCTTGAAAGGGCAGAATCTGATACTAGGTTCTATGATTTTCAAGATGA 1155 GhLIPN_N ACAAATCACAAACAGCTTGAAAGGGCAGAATCTGATACTAGGTTCTATGATTTTCAAGATGA 1176 GhLIPN_I TCAGTTTTCTCAAGAGGGTCTGGTTAATTTTTCAGAATATGGGTCCAACCGATATGAGGGTT 1217 GhLIPN_N TCAGTTTTATCAAGAGGGTCTGGTTAATTTTTCAGAATATGGGTCCGACCGATATGGGGGTT 1238 GhLIPN_I TAGATAGTGAGTGTTTTGGGGAAGCAAAAGGTTTAGATTCAGTTGTCTTCTTCAGTGAGGAT 1279 GhLIPN_N TAGATAGTGAGTGTTTTGGGGAAGCACAAGGTTTAGATTCAGTTGTCTTCTTCAGTAAGGAT 1300 GhLIPN_I GGCCATATTCTTACTGCCCCTGTTTTGGCATCAGACAGGAGTGCTGAAAATGTGCAACTAAG 1341 GhLIPN_N GGCCATATTCTCACAGCCCCTGTTTTGGCATCAGACAGGAGTGCTGAAAATGTGCAACTAAG 1362 GhLIPN_I CACACCTCTGTTCCATATAGGAGCAGGTGAAGGGCCTGACTCTTGTGACGGTAATGGGGAAT 1403 GhLIPN_N CACACCTCTGTTCCATATAGGCGCAGGTGAAGGACCTGACTTTTGTGACGGTAATGGGGAAT 1424 GhLIPN_I TTAGTCCAGGTGGCAATGAATCAGATGCTGACTATATAGGTAAGCTCAATGCTGCGGCACCT 1465 GhLIPN_N TTAGTCCAGGTGGCAATGAATCAGATGCTGACTATATAGGTAAGCTCAATGCTGCGGCACCT 1486 GhLIPN_I AAGAATTCCTCTGATATTGTTTGCAGCTTGGACAATGATTCTACTGCTTTGAGACACCATCT 1527 GhLIPN_N AAGAATTCCTCTGATATTGTTTGCAGCTTGGACAATGATTCTACTGCTTTGAGACACCATCT 1548

GhLIPN_I AGAAGTTTGTGAAAAAGGGGGGGAACATGCTTGTCAAACTGAAGAAACTCGAAACCTTTTCA 1589 GhLIPN_N AGAAGTTTGTGAAAGAGGGGGGGAACATGCCTGTCATACTGAAGAAACTCGAAACCTTTTCA 1610 GhLIPN_I AGCATGAAAATGAATTTATCAGGCAAAGTTACAGTGAAGATGCATCCGTGCATATTAAGGAT 1651 GhLIPN_N AGCATGAAAATGAATTTATCAGGCAAAGTGACATTGAAGATGCATCCATGCATATTAAGGAT 1672 GhLIPN_I GATGTTTTTAAGAGCTGTCTTGAACTATCTGAATTGGGCAGACACGATGAGAACACCAATTC 1713 GhLIPN_N GATGTTTTTAAGAGCAGTCTTGAACTATCTGAATTGGGCAGACACGAAGAGAACACCAATTC 1734 GhLIPN_I TGAAGAAATAGATAGCCCACTGCAAGCTCAGATTTCACAAGATAAGCCTTCTTGCAGCCCCC 1775 GhLIPN_N TGAAGGAATAGATAGCCCACTGCAAGCTCAGATTTCACAAGATAAGCGTTCTTGCAGCCCCC 1796 GhLIPN_I CAGAAGTTGGTGAAACTGAAAATGGAGCTATTGGTGGCTCCAGAAATAAAGATGTGTTGTCT 1837 GhLIPN_N CAGAAGTTGGTGAAACTGAAGATGGAGCTATTGGTGGCTCCAGAAATAAAGATGTGTTGTCT 1858 GhLIPN_I TCCTCTTGTAGTCCCAATTATTCCAATGAAAATGGGTCTCCTGATTTACCAGTTGAAAAGAC 1899 GhLIPN_N TCCTCTTGTATTCCCAATTATTCCAATGAAATTGGGTCTCCTGATTTACCAGTTGAAAAGAC 1920 GhLIPN_I AATGTTGGCTACAGATAATATGGGTTCTAACAATGCATCTGTTGATTTGGTTGTTAATGACC 1961 GhLIPN_N AATGTTGGCTACAGATAATAAGGGTTCTAACAATGCATCTGTTGATTTGGTTGTTAATGACC 1982 GhLIPN_I CTGAATTGAGAGATGAACAGTTCAATACTTCAGCAGCAACTGAAGGGATGAATAGCAGTCTG 2023 GhLIPN_N CTGAATTGAGAGATGAACAGTTCGATACATCAGCAGCAACTGAGGGGATGAATAGCAGTCTG 2044 GhLIPN_I CAAAGTCCTCCACCTGAGGACAAGAGTAGCATAAGTGAGACTGTGGAAACTGAAACAAGTTG 2085 GhLIPN_N CAAAGTCCTCCACCTGAGGACAAGAGTAGCATAAGTGAGACTGTGGAAACTGAAACAAGTTG 2106 GhLIPN_I TGCTAAAGAGATAGATGTTCGTGCTAGCCTGGgtaaggaactaagtgcttttcctcctttaa 2147 2 GhLIPN_N TGCTAAAGAGATAGATGTTAGTGCTAGCCTGGgtaaggaactaagtgcttttcctccttgaa 2168 GhLIPN_I ttacttaacctcccttattcttgctctgattctcgccttaaatttccagGATTTGAGATCTC 2209 3 GhLIPN_N ttacttaacttcccttattcttgctctgattctcgccttaaatttccagGGTTTGAGATCTC 2230 GhLIPN_I ACTCTGTGGCAATGAACTTTATGTGGGTATGGGTTCAGATGCTGCAGCAGAAGTCTTTGAAT 2271 GhLIPN_N ACTCTGTGGCAATGAACTTTATGTGGGTATGGGCTCAGATGCTGCAGCAGAAGTCTTTGAAG 2292 GhLIPN_I CACATCGGATATCAATGGAGGAATACAAAAATAATGCAATGTCAATTATTAAGAATACAAAC 2333 GhLIPN_N CACATCGGATATCAATGGAGGAATACAAAAATAATGCAATTTCAATTATTAAGAATACAAAC 2354 GhLIPN_I CTAATCATCCGATTTGGAGAGATGTACTTCACATGGGAAAAAGCTGCTCCTGTTGTTCTCGG 2395 GhLIPN_N CTAATCATCCGATTTGGAGAGATGTACTTCACATGGGAAAAAGCTGCTCCTGTTGTTCTCGG 2416 GhLIPN_I AATGGCTGCATTTGGTTTAGAGTTAGCTATTGAGCCCCAGGATGCAATCCCTGTTGAAAAGG 2457 GhLIPN_N AATGGCTGCATTTGGTTTAGAATTAGCTATTGAGCCCCAGGATGCAATCCCTGTTGAAAAGG 2478 GhLIPN_I ATGAGTCATCAAAGCCTAAGGGTGGTGCTTCTGGTGTCACTTCTGCACCTTCTGGCCGCAGA 2519 GhLIPN_N ATGAGTCACCAAAGCCTAAGGGTGCTGCTTCTGGTGTCACTTCTGCACCTTCTGGCTGCAGA 2540 GhLIPN_I TGGAGGCTTTGGTCTATTCCCTTAAAAAGGGTCAAAACACTTGAGAAGACCGGTAGCAATTT 2581 GhLIPN_N TGGAGGCTTCGGTCTATTCCCTTAAAAAGGGTCAAAACACTTGAGAAGACCGGTAGCAATTT 2602 GhLIPN_I ATCTAGTGAAGAGGTATTTCTTGATACTGAATCTTCTCTACAAAATTCACCAGAAGATTTAA 2643 GhLIPN_N ATCTAGTGAAGAGGTATTTCTTGATACTGAATCTTCTCTACAAAATTCACCAGCAGATTTAA 2664 GhLIPN_I TTCCAACATCCAGTGGAAGGATTGAATCTCCTGGCAAACAATTTGTGAGGACAAATATTCCC 2705 GhLIPN_N TTCCAACATCCAGTGGAAGGATTGAATCTCCTGGCAAACAATTTGTGAGGACAAATATTCCC 2726 GhLIPN_I ACCAACGAGCAGATTGCTTCCTTGAATCTGAAAAATGGTCAAAATATGATTACTTTCAGTTT 2767 GhLIPN_N ACCAACGAGCAGATTGCTTCCTTGAATCTGAAAAATGGTCAAAACATGATTACTTTCAGTTT 2788 GhLIPN_I CTCCTCCAGGGTTCTGGGAATACAACAGgtttcaattttctccttactcaattcttctggta 2829 3 GhLIPN_N CTTCTCCAGGGTTCTGGGAACACAACAGgtatcaattttttctttacacaattcttctggta 2850 GhLIPN_I ggttcgcaaggtttgtgttctgcttaaattttagcctaaaaatctgtgtatctttaacagGT 2891 4 GhLIPN_N ggttcgcaaggtttgtgttctgcttaaattttagcctaaaaatctgtgtatctttaacagGT 2912 GhLIPN_I TGAAGCTCATCTTTACTTGTGGAAATGGAACGCAAAGATTGTAATTTCAGATGTGGATGGAA 2953 GhLIPN_N TGAAGCTCATCTTTACTTGTGGAAATGGAACGCAAAGATTGTAATTTCAGATGTGGATGGAA 2974 GhLIPN_I CTATTACCAAgtagggatttatctcctgatattgttgctctgctgtatttaaaattttcccg 3015 4 GhLIPN_N CTATTACCAAgtaaggatttatctcctgatattgttgctctgcagtatttaaaattttcctg 3036 GhLIPN_I tttcacccatttatggacctgtctaaactagctctgaatcttattatgatattattatggaa 3077 GhLIPN_N tttcacccatttatggacctgtctaaactagctctgaatcttattatgatattattatggaa 3098

Homoeolog analysis: GhLIPN N & I

GhLIPN_I acagagaaattaaaccatatattcgcatttgatgctgaaattataacttccattatgatctt 3139 GhLIPN_N acagagaaattaaaccatatattcgcatttgatgctgaaattataacttccattatggtctt 3160 GhLIPN_I tagtattagagcagcagtgagcattctagaagcagtcataccctaattgatttggtgtgaaa 3201 GhLIPN_N tagtattagagcagcagtgagcattctagaagcagtcataccctaattgatttggtgtgaaa 3222 GhLIPN_I tctttctttcgaatatgttgtagttatgtatagtgtaatacaagatgcatgtttcaataacc 3263 GhLIPN_N tctttctttcgaatatgttttagtcatgtatagtgtaatacaagatgcatgtttcaaaaacc 3284 GhLIPN_I ttaacatatttctttcagGTCTGATGTCTTAGGCCAGTTTATGCCTTTAGTTGGAAGGGATT 3325 5 GhLIPN_N ttaacatatttctttcagGTCTGATGTCTTAGGCCAGTTTATGCCTTTAGTTGGAAGGGATT 3346 GhLIPN_I GGACACAATCTGGTGTAGCTAATCTTTTCTCAGCTATTAAGgtatgcttttctcaattgttt 3387 5 GhLIPN_N GGACACAATCTGGTGTAGCTAATCTTTTCTCAGCTATTAAGgtatgcttttctcaattgttt 3408 GhLIPN_I ctttgttcagcttgacttgactgagaagttgttcttttctggactatcatgcaaaactagat 3449 GhLIPN_N ctttgttcaacttgacttgactgagaagtttttcttttctggactatcatgcaaaactagat 3470 GhLIPN_I ctaggaaattgctccaccgttttttagtttcaccacaactgaaatcaaaacttgtctcatgt 3511 GhLIPN_N ctagtaaattgctccaccattttttagtttcaccacaacagaaatcaaaacttgtctcatgt 3532 GhLIPN_I ttcccttgcgctttacttttactagttctttataaaattccccactgatgtaagtgatgata 3573 GhLIPN_N ttcccttgcgctttacttttactagttctttataaaattccccactgatgtaagtgatgata 3594 GhLIPN_I aattttttaatgcaagtttatagagtaacttttgataaccttagtatcaagattatttgagc 3635 GhLIPN_N aattttttaatgcaagtttatagagtaacttttgataaccttagtatcaagattatgtgagc 3656 GhLIPN_I attatcctgtcaccattatgccccttaacatttatttttctgttgcgaaatgtatttttgca 3697 GhLIPN_N attatcctgtcaccattatgcccctcaacatttatttttctgttgtgaaatgtatttttgca 3718 GhLIPN_I ctccttagtttatatatgggtagtgatgaaattcttctcataagttgcatttatgtcatctt 3759 GhLIPN_N cttcttagtttatatatgggtagtgatgaaattcttctcataagttgcatttgtgtcatctt 3780 GhLIPN_I tatccatccctcctgtaacatgtgtgcatgcacaaacacaatattgtttgtagggaacaata 3821 GhLIPN_N tattcatccctcctgtaacatgtgtgcatgcacaaacacaatattgtttgtagagaacaata 3842 GhLIPN_I tcaaactgtgtggctcttctatttattccattaaccaacatattttggcatgcatattaacc 3883 GhLIPN_N tcaaacagtgtggctcttctatttattccattaaccaacatattttggcatgcatattaacc 3904 GhLIPN_I aaatgatttccttgggtgtct-atgtccaacactaactgtatgttcacaacagGAGAATGGG 3944 6 GhLIPN_N aatt-attttcttgggtgtccgatgtccaacactaactgtatgttcacaacagGAGAACGGA 3965 GhLIPN_I TATCAGCTTCTATTTCTCAGTGCACGTGCAATTGTTCAGGCATATCTAACCAGAAGTTTCTT 4006 GhLIPN_N TATCAGCTTCTATTTCTCAGTGCACGTGCAATTGTTCAGGCATATCTAACCAGAAGTTTCTT 4027 GHLIPN_I ACTTAACCTGAAACAGgtaatggaatagcctgaatacaaatttttaaatttcccttatggaa 4068 6 GhLIPN_N ACTTAACCTGAAACAGgtaatggaatagcctgaatacacttttttacatttccctaatggaa 4089 GHLIPN_I gcaaaaaggcattagtgaaagtgttaaccaaagagaaaagctaacatttggaaaccatcttg 4130 GhLIPN_N gcaaaaaggt-------------ttaaccaaagagaaaagctaacatttggaaaccatcttg 4138 GhLIPN_I ttaaaagactttgaaatcacga-tttgtaaccgattttaactgatatagctttgtttatgta 4191 GhLIPN_N ttaaaagactttaaaatcacaaatttgtaaccgattttaactgatatagctttgtttatgta 4200 GhLIPN_I gGATGGAAAAGCTTTACCCTCTGGACCTGTTGTTATTTCTCCTGACGGTTTGTTTCCCTCAT 4253 7 GhLIPN_N gGATGGAAAAGCTTTACCCTCTGGACCTGTTGTGATTTCTCCTGACGGTTTGTTTCCCTCAT 4262 GhLIPN_I TGTACCGTGAGGgtgagatctttcattctttattttctcttacattgtttgtctattgaagt 4315 7 GhLIPN_N TGTACCGTGAGGgtgagatctttcattctttattttctctgacattgtttgtctattgaagt 4324 GhLIPN_I aatctagctgtttttttagtttgcttcacaattttgaggcttcagctacatgagaggaaaaa 4377 GhLIPN_N aatctagctgtttttttagtttgcttcacaattttgaggctccagctacatgagaggaaaaa 4386 GhLIPN_I gaattcttattgcattccatcacaaaacctttaaatgataaacgtagcaatatttgttgtct 4439 GhLIPN_N gaattcttattgtattccatcacaaaacctttaaatgataaacgtggcattatttgttgtct 4448 GhLIPN_I tggccttttctttatatcagaatcattaggctttattatcaagttacattgcttttcttctt 4501 GhLIPN_N tggccttttctttatatcagaatcattaggctttatcatcaagttacattgcttttcttctt 4510 GhLIPN_I ttacttcctttttaattatttataggttccattctaataaactatacttggcagTGATAAGA 4563 8 GhLIPN_N ttacttcctttttaattatttatgggttccattctaataaactatacttggcagTGATAAGA 4572 GhLIPN_I AGAACACCTCATGAATTCAAAATAGCTTGTTTAAAGgtgaattctgaaatattcaaccttgt 4625 8 GhLIPN_N AGAACACCTCATGAATTCAAAATAGCTTGTTTAAAGgtgaattctgaaatattcaaccttgt 4634

GhLIPN_I ttctaaatgtacaacggaactattactgcatccggatcagtcttttcacctacgatttcttc 4687 GhLIPN_N ttctaaatatacaacggaactattactgcatccagatcagtcttttcacctatgatttcttt 4696 GhLIPN_I cttatacgtgcgttgcagAATATCAGGAAGCTTTTCCCTTCGGACTACAATCCATTTTATGC 4749 9 GhLIPN_N cttatacatgaattgcagAATATCAGGAAACTTTTCCCTTCCGACTACAATCCATTTTATGC 4758 GhLIPN_I AGGCTTTGGAAACAGAGACACAGATGAACTCAGTTATAAAGAAATTGGCATCCCAAAGGGAA 4811 GhLIPN_N AGGCTTTGGAAACAGAGACACAGATGAACTCAGTTATAAAGAAATTGGCATCCCAAAGGGAA 4820 GhLIPN_I AAAGGTTTATTATTAATCCAAAGgtacatgtttttcatgtcctatttatgagaaatctatat 4873 9 GhLIPN_N AAATATTTATTATTAATCCAAAGgtacatgtttttcatgtcctacttatgagaaatctatat 4882 GhLIPN_I tgcaattaccatatcaaatttataacaatcggactcaaaactggcagGGTAAGGTGGCTGTA 4935 10 GhLIPN_N tgcaattaccatatcaaatttataacaatcgggctcaaaactggcagGGTGAGGTGGCTGTA 4944 GhLIPN_I AGTTATTGTATGAACTCGAGGTCATACACATCATTGCATACTCTTGTAGAAGAGATGTTTCC 4997 GhLIPN_N AGTTATTGTATGAACACGAGGTCATACACATCATTGCATACTCTTGTAGAAGAGATGTTTCC 5006 GhLIPN_I ACCAACCTCATCCATTGAACCGgtaaggcatccatttctcgcctctttatatatgatgcaaa 5059 10 GhLIPN_N ACCAACCTCATCCATTGAACTGgtaaggcatcaatttctcgcctgtttatatatgatgcaaa 5068 GhLIPN_I aaaacaattgaaaatttatctacaactgtctcttatttgttttggagccttcttgatatcct 5121 GhLIPN_N agaacaattgaaaatttatctacaactgtctcttatttgttttggagccttcttgatatcct 5130 GhLIPN_I tgatattttaatacgtttgcagGAAGATTATAACGAATGGAATTTTTGGAAAGTGCCATTGC 5183 11 GhLIPN_N tgatattttaatacgtttgcagGAAGATTATAACGAATGGAATTTTTGGAAAGTGCCATTGC 5192 GhLIPN_I AAGATATTGAGTAAcatagttgattgcctaaagaatacggtttagttataccaaggagatga 5245 GhLIPN_N AAGATATTGAGTAAcatagttgattgcctaaagaatacggtttagttataccaaggagatga 5254 GhLIPN_I tcagtagcttaatttgtcccataggtaagaaattgttctgagattcttcaagttcttaaact 5307 GhLIPN_N tcagtagcgtaatttgtcccataggtaagaaattgttatgatattcttcaagttcttaaact 5316 GhLIPN_I cgcttcctcactgacattcatattctttggtgaccttgaaatggtctgttggggttcatctt 5369 GhLIPN_N cgcttcctcactgacattcatattctttggtgaccttgaaatggtctgttggggttcatctt 5378 GhLIPN_I gtaaggaataaataattcaagtggataaaatcaaagccattgccta 5377 GhLIPN_N gtaaggaataaataattcaagtggataaaatcaaagccattgccta 5386

Sequence divergence: 2.5% (ungapped); between introns: 2.97%, exons: 2.27%

GhLIPN_N MNVVGKVGSLISQGVYSVATPFHPFGGAVDIIVVRQADGTFRSSPWYVRFGKFQGVLKGA 60 GhLIPN_I MNVVGKVGSLISQGVYSVATPFHPFGGAVDIIVVRQPDGTFRSTPWYVRFGKFQGVLKGA 60 ************************************.******:**************** GhLIPN_N EKVVRITVNGIEADFHMYLDNSGQAYFLKEVESGKGFQTNGDLKDSDGEVVSDSSVAQLR 120 GhLIPN_I EKVVRITVNGIEVDFHMYLDNSGQAYFLKEVESGKGFETNGDLKDSDGEVVSDSWVAQLR 120 ************.************************:**************** ***** GhLIPN_N DECDATNHKQLERAESDTRFYDFQDDQFYQEGLVNFSEYGSDRYGGLDSECFGEAQGLDS 180 GhLIPN_I DECDATNQKQLERAESDTRFYDFQDDQFSQEGLVNFSEYGSNRYEGLDSECFGEAKGLDS 180 *******:******************** ************:** **********:**** GhLIPN_N VVFFSKDGHILTAPVLASDRSAENVQLSTPLFHIGAGEGPDFCDGNGEFSPGGNESDADY 240 GhLIPN_I VVFFSEDGHILTAPVLASDRSAENVQLSTPLFHIGAGEGPDSCDGNGEFSPGGNESDADY 240 *****:*********************************** ****************** GhLIPN_N IGKLNAAAPKNSSDIVCSLDNDSTALRHHLEVCERGGEHACHTEETRNLFKHENEFIRQS 300 GhLIPN_I IGKLNAAAPKNSSDIVCSLDNDSTALRHHLEVCEKGGEHACQTEETRNLFKHENEFIRQS 300 **********************************:******:****************** GhLIPN_N DIEDASMHIKDDVFKSSLELSELGRHEENTNSEGIDSPLQAQISQDKRSCSPPEVGETED 360 GhLIPN_I YSEDASVHIKDDVFKSCLELSELGRHDENTNSEEIDSPLQAQISQDKPSCSPPEVGETEN 360 ****:*********.*********:****** ************* ***********: GhLIPN_N GAIGGSRNKDVLSSSCIPNYSNEIGSPDLPVEKTMLATDNKGSNNASVDLVVNDPELRDE 420 GhLIPN_I GAIGGSRNKDVLSSSCSPNYSNENGSPDLPVEKTMLATDNMGSNNASVDLVVNDPELRDE 420 **************** ****** **************** ******************* GhLIPN_N QFDTSAATEGMNSSLQSPPPEDKSSISETVETETSCAKEIDVSASLGFEISLCGNELYVG 480 GhLIPN_I QFNTSAATEGMNSSLQSPPPEDKSSISETVETETSCAKEIDVRASLGFEISLCGNELYVG 480 **:*************************************** ***************** GhLIPN_N MGSDAAAEVFEAHRISMEEYKNNAISIIKNTNLIIRFGEMYFTWEKAAPVVLGMAAFGLE 540 GhLIPN_I MGSDAAAEVFESHRISMEEYKNNAMSIIKNTNLIIRFGEMYFTWEKAAPVVLGMAAFGLE 540 ***********:************:*********************************** GhLIPN_N LAIEPQDAIPVEKDESPKPKGAASGVTSAPSGCRWRLRSIPLKRVKTLEKTGSNLSSEEV 600 GhLIPN_I LAIEPQDAIPVEKDESSKPKGGASGVTSAPSGRRWRLWSIPLKRVKTLEKTGSNLSSEEV 600 ****************.****.********** **** ********************** GhLIPN_N FLDTESSLQNSPADLIPTSSGRIESPGKQFVRTNIPTNEQIASLNLKNGQNMITFSFFSR 660 GhLIPN_I FLDTESSLQNSPEDLIPTSSGRIESPGKQFVRTNIPTNEQIASLNLKNGQNMITFSFSSR 660 ************ ******************************************** ** GhLIPN_N VLGTQQVEAHLYLWKWNAKIVISDVDGTITKSDVLGQFMPLVGRDWTQSGVANLFSAIKE 720 GhLIPN_I VLGIQQVEAHLYLWKWNAKIVISDVDGTITKSDVLGQFMPLVGRDWTQSGVANLFSAIKE 720 *** ******************************************************** GhLIPN_N NGYQLLFLSARAIVQAYLTRSFLLNLKQDGKALPSGPVVISPDGLFPSLYREVIRRTPHE 780 GhLIPN_I NGYQLLFLSARAIVQAYLTRSFLLNLKQDGKALPSGPVVISPDGLFPSLYREVIRRTPHE 780 ************************************************************ GhLIPN_N FKIACLKNIRKLFPSDYNPFYAGFGNRDTDELSYKEIGIPKGKIFIINPKGEVAVSYCMN 840 GhLIPN_I FKIACLKNIRKLFPSDYNPFYAGFGNRDTDELSYKEIGIPKGKRFIINPKGKVAVSYCMN 840 ******************************************* *******:******** GhLIPN_N TRSYTSLHTLVEEMFPPTSSIELEDYNEWNFWKVPLQDIE 880 GhLIPN_I SRSYTSLHTLVEEMFPPTSSIEPEDYNEWNFWKVPLQDIE 880 :********************* *****************

Homoeolog analysis: GhLIPN N & I

* Identical aa: Conserved aa (16). Semi-conserved aa (5)

40 mismatches (4.5%)

GhLIPN_I CGTGCAATTGTTCAGGCATATCTAACCAGAAGTTTCTTACTTAACCTGAAACAGgta 4025 GhLIPN_N CGTGCAATTGTTCAGGCATATCTAACCAGAAGTTTCTTACTTAACCTGAAACAGgta 4046 GR_Ea05119 ------------------------------------------------------... 209 GhLIPN_I atggaatagcctgaatacaaatttttaaatttcccttatggaagcaaaaaggcatta 4082 GhLIPN_N atggaatagcctgaatacacttttttacatttccctaatggaagcaaaaaggt.... 4099 GR_Ea05119 ......................................................... 209 GhLIPN_I gtgaaagtgttaaccaaagagaaaagctaacatttggaaaccatcttgttaaaagac 4139 GhLIPN_N .........ttaaccaaagagaaaagctaacatttggaaaccatcttgttaaaagac 4147 GR_Ea05119 ......................................................... 209 GhLIPN_I tttgaaatcacga.tttgtaaccgattttaactgatatagctttgtttatgtagGAT 4195 GhLIPN_N tttaaaatcacaaatttgtaaccgattttaactgatatagctttgtttatgtagGAT 4204 GR_Ea05119 ......................................................--- 212 GhLIPN_I GGAAAAGCTTTACCCTCTGGACCTGTTGTTATTTCTCCTGACGGTTTGTTTCCCTCA 4252 GhLIPN_N GGAAAAGCTTTACCCTCTGGACCTGTTGTGATTTCTCCTGACGGTTTGTTTCCCTCA 4261 GR_Ea05119 -----------------------------T--------------------------- 269 GhLIPN_I TTGTACCGTGAGGgtgagatctttcattctttattttctcttacattgtttgtctat 4309 GhLIPN_N TTGTACCGTGAGGgtgagatctttcattctttattttctctgacattgtttgtctat 4318 GR_Ea05119 -----------------------------------------t--------------- 326 GhLIPN_I tgaagtaatctagctgtttttttagtttgcttcacaattttgaggcttcagctacat 4366 GhLIPN_N tgaagtaatctagctgtttttttagtttgcttcacaattttgaggctccagctacat 4375 GR_Ea05119 -----------------------------------------------t--------- 383 GhLIPN_I gagaggaaaaagaattcttattgcattccatcacaaaacctttaaatgataaacgta 4423 GhLIPN_N gagaggaaaaagaattcttattgtattccatcacaaaacctttaaatgataaacgtg 4432 GR_Ea05119 ---------------------.-c--------------------------------a 439 GhLIPN_I gcaatatttgttgtcttggccttttctttatatcagaatcattaggctttattatca 4480 GhLIPN_N gcattatttgttgtcttggccttttctttatatcagaatcattaggctttatcatca 4489 GR_Ea05119 --gt------------------------------------------------t---- 496 GhLIPN_I agttacattgcttttcttcttttacttcctttttaattatttataggttccattcta 4537 GhLIPN_N agttacattgcttttcttcttttacttcctttttaattatttatgggttccattcta 4546 GR_Ea05119 --------------------------------------------a------------ 553 GhLIPN_I ataaactatacttggcagTGATAAGAAGAACACCTCATGAATTCAAAATAGCTTGTT 4594 GhLIPN_N ataaactatacttggcagTGATAAGAAGAACACCTCATGAATTCAAAATAGCTTGTT 4603 GR_Ea05119 --------------------------------------------------------- 610 GhLIPN_I TAAAGgtgaattctgaaatattcaaccttgtttctaaatgtacaacggaactattac 4651 GhLIPN_N TAAAGgtgaattctgaaatattcaaccttgtttctaaatatacaacggaactattac 4660 GR_Ea05119 ---------------------------------------g----------------- 667 GhLIPN_I tgcatccggatcagtcttttcacctacgatttcttccttatacgtgcgttgcagAAT 4708 GhLIPN_N tgcatccagatcagtcttttcacctatgatttctttcttatacatgaattgcagAAT 4717 GR_Ea05119 -------g------------------t--------t-------g--cg--------- 724 GhLIPN_I ATCAGGAAGCTTTTCCCTTCGGACTACAATCCATTTTATGCAGGCTTTGGAAACAGA 4765 GhLIPN_N ATCAGGAAACTTTTCCCTTCCGACTACAATCCATTTTATGCAGGCTTTGGAAACAGA 4774 GR_Ea05119 --------G-----------G------------------------------------ 781 GhLIPN_I GACACAGATGAACTCAGTTATAAAGAAATTGGCATCCCAAAGGGAAAAAGGTTTATT 4822 GhLIPN_N GACACAGATGAACTCAGTTATAAAGAAATTGGCATCCCAAAGGGAAAAATATTTATT 4831 GR_Ea05119 -------------------------------------------------GA------ 838 GhLIPN_I ATTAATCCAAAGgtacatgtttttcatgtcctatttatgagaaatctatattgcaat 4879 GhLIPN_N ATTAATCCAAAGgtacatgtttttcatgtcctacttatgagaaatctatattgcaat 4888 GR_Ea05119 ------------............................................. 850 GhLIPN_I taccatatcaaatttataacaatcggactcaaaactggcagGGTAAGGTGGCTGTAA 4936 GhLIPN_N taccatatcaaatttataacaatcgggctcaaaactggcagGGTGAGGTGGCTGTAA 4945 GR_Ea05119 .........................................---G------------ 866 GhLIPN_I GTTATTGTATGAACTCGAGGTCATACACATCATTGCATACTCTTGTAGAAGAGATGT 4993 GhLIPN_N GTTATTGTATGAACACGAGGTCATACACATCATTGCATACTCTTGTAGAAGAGATGT 5002 GR_Ea05119 --------------A------------------------------------------ 923 GhLIPN_I TTCCACCAACCTCATCCATTGAACCGgtaaggcatccatttctcgcctctttatata 5050 GhLIPN_N TTCCACCAACCTCATCCATTGAACTGgtaaggcatcaatttctcgcctgtttatata 5059 GR_Ea05119 ------------------------C-............................... 949

7

7

8

8

9

10

10

6

9

GhLIPN_I tgatgcaaaaaaacaattgaaaatttatctacaactgtctcttatttgttttggagc 5107 GhLIPN_N tgatgcaaaagaacaattgaaaatttatctacaactgtctcttatttgttttggagc 5116 GR_Ea05119 ......................................................... 949 GhLIPN_I cttcttgatatccttgatattttaatacgtttgcagGAAGATTATAACGAATGGAAT 5164 GhLIPN_N cttcttgatatccttgatattttaatacgtttgcagGAAGATTATAACGAATGGAAT 5173 GR_Ea05119 ....................................--------------------- 970 GhLIPN_I_ TTTTGGAAAGTGCCATTGCAAGATATTGAGTAAcatagttgattgcctaaagaatac 5221 GhLIPN_N_ TTTTGGAAAGTGCCATTGCAAGATATTGAGTAAcatagttgattgcctaaagaatac 5230 GR_Ea05119 --------------------------G------------------------------ 1027 GhLIPN_I ggtttagttataccaaggagatgatcagtagcttaatttgtcccataggtaagaaat 5278 GhLIPN_N ggtttagttataccaaggagatgatcagtagcgtaatttgtcccataggtaagaaat 5287 GR_Ea05119 --------------------------------t------------------------ 1084 GhLIPN_I tgttctgagattcttcaagttcttaaactcgcttcctcactgacattcatattcttt 5335 GhLIPN_N_ tgttatgatattcttcaagttcttaaactcgcttcctcactgacattcatattcttt 5344 GR_Ea05119 ----c---t-—c--------------------------------------------- 1141 GhLIPN_I ggtgaccttgaaatggtctgttggggttcatcttgtaaggaataaataattcaagtg 5392 GhLIPN_N ggtgaccttgaaatggtctgttggggttcatcttgtaaggaataaataattcaagtg 5401 GR_Ea05119 ----c---------------------------------------------------- 1198 GhLIPN_I gataaaatcaaagccattgccta 5415 GhLIPN_N gataaaatcaaagccattgccta 5458 GR_Ea05119 ----------------------c 1222

11

GhLIPN_I acagGTTGAAGCTCATCTTTACTTGTGGAAATGGAACGCAAAGATTGTAATTTCAGA 2942 GhLIPN_N acagGTTGAAGCTCATCTTTACTTGTGGAAATGGAACGCAAAGATTGTAATTTCAGA 2963 GR_Ea05119 .........................................---------------- 16 GhLIPN_I TGTGGATGGAACTATTACCAAgtagggatttatctcctgatattgttgctctgctgt 2999 GhLIPN_N TGTGGATGGAACTATTACCAAgtaaggatttatctcctgatattgttgctctgcagt 3020 GR_Ea05119 ---------C-T---------.................................... 37 GhLIPN_I atttaaaattttcccgtttcacccatttatggacctgtctaaactagctctgaatct 3056 GhLIPN_N atttaaaattttcctgtttcacccatttatggacctgtctaaactagctctgaatct 3077 GR_Ea05119 ......................................................... 37 GhLIPN_I tattatgatattattatggaaacagagaaattaaaccatatattcgcatttgatgct 3113 GhLIPN_N tattatgatattattatggaaacagagaaattaaaccatatattcgcatttgatgct 3134 GR_Ea05119 ......................................................... 37 GhLIPN_I gaaattataacttccattatgatctttagtattagagcagcagtgagcattctagaa 3170 GhLIPN_N gaaattataacttccattatgatctttagtattagagcagcagtgagcattctagaa 3191 GR_Ea05119 ......................................................... 37 GhLIPN_I gcagtcataccctaattgatttggtgtgaaatctttctttcgaatatgttgtagtta 3227 GhLIPN_N gcagtcataccctaattgatttggtgtgaaatctttctttcgaatatgttttagtca 3248 GR_Ea05119 ......................................................... 37 GhLIPN_I tgtatagtgtaatacaagatgcatgtttcaataaccttaacatatttctttcagGTC 3284 GhLIPN_N tgtatagtgtaatacaagatgcatgtttcaaaaaccttaacatatttctttcagGTC 3305 GR_Ea05119 ......................................................--- 40 GhLIPN_I TGATGTCTTAGGCCAGTTTATGCCTTTAGTTGGAAGGGATTGGACACAATCTGGTGT 3341 GhLIPN_N TGATGTCTTAGGCCAGTTTATGCCTTTAGTTGGAAGGGATTGGACACAATCTGGTGT 3362 GR_Ea05119 ----------------------- C-------------------------------- 97 GhLIPN_I ------------------------.gtatgcttttctcaattgtttctttgttcagc 3398 GhLIPN_N AGCTAATCTTTTCTCAGCTATTAA.gtatgcttttctcaattgtttctttgttcaac 3419 GR_Ea05119 ------------------------G................................ 121 GhLIPN_I ttgacttgactgagaagttgttcttttctggactatcatgcaaaactagatctagga 3455 GhLIPN_N ttgacttgactgagaagtttttcttttctggactatcatgcaaaactagatctagta 3476 GR_Ea05119 ......................................................... 121 GhLIPN_I aattgctccaccgttttttagtttcaccacaactgaaatcaaaacttgtctcatgtt 3512 GhLIPN_N aattgctccaccattttttagtttcaccacaacagaaatcaaaacttgtctcatgtt 3533 GR_Ea05119 ......................................................... 121 GhLIPN_I tcccttgcgctttacttttactagttctttataaaattccccactgatgtaagtgat 3569 GhLIPN_N tcccttgcgctttacttttactagttctttataaaattccccactgatgtaagtgat 3590 GR_Ea05119 ........................................................ 121 GhLIPN_I gataaattttttaatgcaagtttatagagtaacttttgataaccttagtatcaagat 3626 GhLIPN_N gataaattttttaatgcaagtttatagagtaacttttgataaccttagtatcaagat 3647 GR_Ea05119 ......................................................... 121 GhLIPN_I tatttgagcattatcctgtcaccattatgccccttaacatttatttttctgttgcga 3683 GhLIPN_N tatgtgagcattatcctgtcaccattatgcccctcaacatttatttttctgttgtga 3704 GR_Ea05119 ......................................................... 121 GhLIPN_I aatgtatttttgcactccttagtttatatatgggtagtgatgaaattcttctcataa 3740 GhLIPN_N aatgtatttttgcacttcttagtttatatatgggtagtgatgaaattcttctcataa 3761 GR_Ea05119 ......................................................... 121 GhLIPN_I gttgcatttatgtcatctttatccatccctcctgtaacatgtgtgcatgcacaaaca 3797 GhLIPN_N gttgcatttgtgtcatctttattcatccctcctgtaacatgtgtgcatgcacaaaca 3818 GR_Ea05119 ......................................................... 121 GhLIPN_I caatattgtttgtagggaacaatatcaaactgtgtggctcttctatttattccatta 3854 GhLIPN_N caatattgtttgtagagaacaatatcaaacagtgtggctcttctatttattccatta 3875 GR_Ea05119 ......................................................... 121 GhLIPN_I accaacatattttggcatgcatattaaccaaatgatttccttgggtgtctatgtcca 3911 GhLIPN_N accaacatattttggcatgcatattaaccaattattttcttgggtgtccgatgtcca 3932 GR_Ea05119 ......................................................... 121 GhLIPN_I acactaactgtatgttcacaacagGAGAATGGGTATCAGCTTCTATTTCTCAGTGCA 3968 GhLIPN_N acactaactgtatgttcacaacagGAGAACGGATATCAGCTTCTATTTCTCAGTGCA 3989 GR_Ea05119 ........................-G---C--G------------------------ 155

5

4

5

4

6

Genomic assignment : A- or D-genome

Agreements between GhLIPN I and D-genome = 18

Agreements between GhLIPN N and D-genome = 8

Agreements between both homoeologs = 9

Therefore GhLIPN I likely derives from the ancestral D-genome and GhLIPN N from the ancestral A-genome

GhLIPN_I_prom TTTCTTAACGCTGATTATGAGATTGTGAGAGAACGAGTATATATAGAGTGAAAAAAGA -943 GHLIPN_N_prom TTGGTTCACGTTGATTATGTGATTGTGAGAAAAAGAGAATATATAGAGTGAAAAAAGG -1383 GhLIPN_I_prom GAGATGATGATGTAACGTAATTAGGCAACGAGTGGATAGGCTAACGAGGTTTATTTTA -885 GHLIPN_N_pro1 GAGATGATGATGTAACGTAATTAGGCAAAGAGTGGATAGGCTAATGAGGTTTATTTTA -1325 GhLIPN_I_prom TTTTGACCAGAAGGGGCCGCCTTTAAAAGTAGATTGGAGTATGATATAAGTAGGAGTT -827 GHLIPN_N_prom TTTTGACCAGAATGGGCCGCCTTTAAAAGTAGAATGGAGTATGATATAAGTAGGAGTT -1267 GhLIPN_I_prom GCGATGTGAAGCAGGGGATGAATGTGTAATAAAAAAGAAAGTAAAATTCCTGCCTTCT -769 GHLIPN_N_prom GCGATATGAAGCAGGGGATGAATGTGTAATAAGAAAGAAAATAAAATTCATGGCTTCT -1209 GhLIPN_I_prom CCTCATCTCA-----ACCAGTCGGGTAAAACATAAAATC----TAAATTAT--TTAG- -723 GHLIPN_N_prom CCTCATCTCAACTCAACCCGTAAGGTAAAACATAAAAACAAACTAAAATATAGTTAAG -1151 GhLIPN_I_prom -TATTTATTTAAATGAGATTTGGTTGGTGATCATGGTA-TTATAGTTTTGGTAGAAGA -667 GHLIPN_N_prom TTAATAATCTAAATGA-ATTAG--TGGTGATCATGGTTGTTGTAGTTTTGGTAGAAGA -1096 GhLIPN_I_prom AGATGGGTC---TT--ACAAAAAACAAACATCCCAAACTATAATTTAGATTTTAATTT -614 GHLIPN_N_prom AGATGGGTGAGTTTCAACAAAAACAAAACATCCCAAACTATAATTTAGAATTTA---- -1042 GhLIPN_I_prom CTATTAATTACGTGTTAAATGCCATCTTAGCCACGACACAGAGCATGATATCTTAAAC -556 GHLIPN_N_prom ---------------TAAATG---TTTAATTCAATGCTCACCG-------TCTTAAAC -1009 GHLIPN_I_prom ACGGGAATTAAATC----------TTTACTTCTAA-----ATTCACTTATTTGGTCGC -513 GHLIPN_N_prom ACCGGAATTAAATCTAAATTCACTTTTGTTTCTTATTATTATTACAAAATTTTGTCGC -951 GHLIPN_I_prom ATTTAATTTTAATCATTTCTTTTCTTTTCCAAAACAAGCTCAATATTTGTTATGTTTG -455 GHLIPN_N_prom ATTTAATTTTAATCATTTCTTTTCTTTTCCAAAACAAGCTCAATATTTGGTATGTTTG -893 GHLIPN_I_prom AATTAAGCT------------------------------------------------- -446 GHLIPN_N_prom AATTAAGCTTTTAGTTGTTTCAAACACTTACAAAAATAACAGAATTTTATGTTGGTTC -835 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom ACTGGAAAGTGTATCACTAACACTCATTTTTTATTTTTGTTATTTTGTCACATGTAAG -777 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom ATTTGAAATATACAAGTTTCAAGGTTTTTATTATTCAATCAGAATAAATATGAATAAT -719 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom TTTCACTAATATGATAGAAAATACTCCTGAATAATGTCTTTTATTTCGATCTCATTAT -661 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom AGAATGATTAGATTTTTGAATTTCAATTTTAGATATGTAGGAAAATATAAAATGTAAG -603 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom GGAGGAATTTTTGAGGAAAACTAACAAATTAGATGTTAGACTTTGATGTTTGAGATAG -545 GHLIPN_I_prom ---------------------------------------------------------- -445 GHLIPN_N_prom ATTATAATCTACATTTAAAAAAAAAAAAGAAAAAGAAAAAGAATTTCGACTTCTTTTT -487 GHLIPN_I_prom ----------------------------------------CAGGAGGTTTCTTTTC-A -429 GHLIPN_N_prom AAACAAAAGTTGATTAGTGTTAGTGATACACTGTCATTCTCAGGAGGTTACTTTTCTA -429 GHLIPN_I_prom CGTGTTTGAAACAACTAACAGCTTAATTCAAACCTAACAAATATTGAGCTTGTTTTGG -371 GHLIPN_N_prom AGTGTTTGAAACAACTGAAAGCTTAATTCAAACATAACAAATATTGAGCTTGTTTTGG -371 GHLIPN_I_prom AA---------------TTATATGCGACCAAATTTTCTAATAATAGACAAGAAACAAA -328 GHLIPN_N_prom AAAAGAAATGATTTAAATTATATGCGACAAAATTTTCTAATAATATATAAGAAACAAA -313 GHLIPN_I_prom TGTGAAATGAAGGGTAATTTAGTCAATTTAGATGTAAAGATT-------TTTTTTTAG -277 GHLIPN_N_prom AGTGAAATGAAGGGTAATTTAGTCAATTTAGAAGTTAATAATTTTTTTCTTTTTTTCG -255 GHLIPN_I_prom CATTACTGTAGAAGTTTCAATAATTATAATGTTTGTTCATTCCGGCGGCTCTACCATC -219 GHLIPN_N_prom GAATAAAACTCAAGTTTCAATAATTATAATGTTTCTTCATCCCGGCGGCTCTACCATC -197 GHLIPN_I_prom CTTAAG-CCTGAAAAAGCGGGACAATATTTCAAATATCCTAAATTACCCTTCCATTAA -162 GHLIPN_N_prom CTTAAGACCGGAAAAAGCGGGACAATATTTCAAATATCCAAAATTACCCTTCCATTAA -139 GHLIPN_I_prom TCACAAATTTACACCTCATATTTCTCGCTTCGTAGAGTCTTTCTAAGCAAAACTAAGG -104 GHLIPN_N_prom TCACAAATTTACACGTCATATTTCTCG----------TATTTTTAAGCAAAACTAAGG -91 GHLIPN_I_prom GACCTATTTATGTCTTTTCACAAGGTTACTTGTTTATTGCATGGCGGTTGCGGTTAG- -47 GHLIPN_N_prom GACCTATTTATGTCTTTTCGCA--------------TTGCATGGCGGTTGCGGTTAGG -47 GHLIPN_I_prom TCGCCGTTGCCAACTTCTGACTCGTTTTCAAAATTTCCATTTTTTT -1 GHLIPN_N_prom TCGCCGTTGCCAACTTCTGACTCGTTTTCTAAATTTCCATTTTTCC -1

Elicitor/pathogen-responsive

Promoter analysis: GhLIPN

Dof TFBS

Myb/Myc TFBS

Hormone-responsive

CAAT-box

Protein analysis: GhLIPN 1 105 672

829

630 858 N-terminal lipin C-LIP HAD-like

domain

BNLS775-791

Predicted protein structure: GhLIPN

• Homology to phosphatase domain of a polynucleotide kinase (HAD superfamily)• 18% ID; E-score = 3.7e-07,

estimated precision = 100%

Haloacid dehalogenase (HAD) domain

RT-qPCR: GhLIPN

0 2 4 6 8 100.000

0.500

1.000

1.500

2.000

2.500

3.000

3.500

4.000

4.500

5.000

Time (h)

Rela

tive

conc

entr

ation

* p-value > 0.05

*

*

*

*

Hypothetical Gene Interaction Model

top related