narrow-leafed lupin (lupinus angustifolius l.) functional … · 2017. 1. 21. ·...

10
Narrow-leafed lupin (Lupinus angustifolius L.) functional identication and characterization of the aldehyde dehydrogenase (ALDH) gene superfamily Jose C. Jimenez-Lopez Plant Reproductive Biology Laboratory, Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Spanish National Research Council (CSIC), 1 Profesor Albareda, Granada E-18008, Spain The UWA Institute of Agriculture, The University of Western Australia, 35 Stirling Highway, Crawley, Perth, WA 6009, Australia abstract article info Article history: Received 1 January 2016 Received in revised form 25 February 2016 Accepted 23 March 2016 Available online 28 March 2016 Aldehyde dehydrogenases (ALDHs) constitute a NAD(P) + -dependent superfamily of proteins, which catalyze the irreversible oxidation of aldehydes molecules to non-toxic carboxylic acids. This superfamily in legume seeds has a primary role in oxidative stress response under biotic and abiotic stresses as desiccation, cold, salinity, and heat shock factors that largely affect seed germination and seedling growth. Up to date only a study has been dedicated to identify and characterize the complete ALDH superfamily in a le- gume crop, Glycine max, and other few partial studies focusing on their sequence cloning in pea, and fenugreek, despite the importance of legume crops for food sustainability. The consecution of the narrow-leafed lupin (NLL) (Lupinus angustifolius L.) genome sequencing project have resulted in the identication of ALDH genes that still need to be comparatively analyzed for systematic identication and ALDH gene families functional characterization. In this study, nineteen unique ALDH sequences that encode for 10 ALDH families were identied in L. angustifolius. 5 additional ALDH families (ALDH5, ALDH6, ALDH12, ALDH18, and ALDH22) are present in this specie in comparison to Glycine max. Phylogenetic analysis showed that ALDHs were split into 11 clades contain- ing at least one ALDH gene of NLL. The relationship among families is quite well conserved between legume species and among monocot and dicot plants. Structurefunctional analysis of ALDH5 and ALDH6 showed a catalytic mechanism which involved C330E427 couple, and C320/R143-R319, respectively in a specic molec- ular (catalytic) environment. The structurefunctional analysis identied potential neo-functionalities only in families ALDH2C4 and ALDH7B4. Overall, this study reveals new insights into the functionalstructure features and evolution of ALDH gene fam- ilies, particularly concerning NLL, a legume crop. These ALDH functional identication and characterization are valuable in molecular breeding strategies applied to legume for improvement of stress tolerance and yield. © 2016 The Author. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Keywords: Aldehyde molecules Genomics L. albus L. luteus Legume crops Metabolism Narrow-leafed lupin Oxidative stress Sweets lupins 1. Introduction ALDH superfamily is integrated of 23 families of diverse enzymes displaying key roles in a broad number of aldehyde molecules, as well as intermediates or by-products of several fundamental metabolic path- ways in both, catabolic and biosynthetic pathways (Ishitani et al., 1995). Aldehyde dehydrogenases (ALDHs) have been found in prokaryotic and eukaryotic organisms, where many ALDH families are highly conserved between plants and animals (Brocker et al., 2013), an indicative of com- mon use of substrates among these enzymes. ALDH enzymes integrate as cofactor either NAD + or NADP + to transform aldehyde molecules to carboxylic acids, indicating that this reaction is a major source of reducing equivalents (NADH or NADPH) (Hou and Bartels, 2015), helping to cellular redox balance maintenance. Plants are sessile organisms that are continuously being exposed to different types of biotic (i.e. pests, insects, fungi), and abiotic stresses (i.e. salinity, desiccation, cold and heat shock). This induces the genera- tion of reactive oxygen species (ROS), leading to an imbalanced cellular aldehyde molecule level, and interfering with steady-state metabolic reactions in cells (Bartels, 2001). In order to protect themselves and coping with these stresses, plants have to produce a broad range of Plant Gene 6 (2016) 6776 Corresponding author at: Plant Reproductive Biology Laboratory, Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Spanish National Research Council (CSIC), 1 Profesor Albareda, Granada E-18008, Spain. E-mail addresses: [email protected], [email protected]. http://dx.doi.org/10.1016/j.plgene.2016.03.007 2352-4073/© 2016 The Author. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Contents lists available at ScienceDirect Plant Gene journal homepage: www.elsevier.com/locate/plantgene

Upload: others

Post on 02-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

Plant Gene 6 (2016) 67–76

Contents lists available at ScienceDirect

Plant Gene

j ourna l homepage: www.e lsev ie r .com/ locate /p lantgene

Narrow-leafed lupin (Lupinus angustifolius L.) functional identification andcharacterization of the aldehyde dehydrogenase (ALDH)gene superfamily

Jose C. Jimenez-Lopez ⁎Plant Reproductive Biology Laboratory, Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Spanish National Research Council (CSIC), 1 ProfesorAlbareda, Granada E-18008, SpainThe UWA Institute of Agriculture, The University of Western Australia, 35 Stirling Highway, Crawley, Perth, WA 6009, Australia

⁎ Corresponding author at: Plant Reproductive BioloBiochemistry, Cell and Molecular Biology of Plants, EstaSpanish National Research Council (CSIC), 1 Profesor Alba

E-mail addresses: [email protected], jcjim

http://dx.doi.org/10.1016/j.plgene.2016.03.0072352-4073/© 2016 The Author. Published by Elsevier B.V

a b s t r a c t

a r t i c l e i n f o

Article history:Received 1 January 2016Received in revised form 25 February 2016Accepted 23 March 2016Available online 28 March 2016

Aldehyde dehydrogenases (ALDHs) constitute aNAD(P)+-dependent superfamily of proteins, which catalyze theirreversible oxidation of aldehydesmolecules to non-toxic carboxylic acids. This superfamily in legume seeds hasa primary role in oxidative stress response under biotic and abiotic stresses as desiccation, cold, salinity, and heatshock factors that largely affect seed germination and seedling growth.Up to date only a study has been dedicated to identify and characterize the complete ALDH superfamily in a le-gume crop, Glycine max, and other few partial studies focusing on their sequence cloning in pea, and fenugreek,despite the importance of legume crops for food sustainability. The consecution of the narrow-leafed lupin (NLL)(Lupinus angustifolius L.) genome sequencing project have resulted in the identification of ALDH genes thatstill need to be comparatively analyzed for systematic identification and ALDH gene families functionalcharacterization.In this study, nineteen unique ALDH sequences that encode for 10 ALDH families were identified inL. angustifolius. 5 additional ALDH families (ALDH5, ALDH6, ALDH12, ALDH18, and ALDH22) are present in thisspecie in comparison to Glycine max. Phylogenetic analysis showed that ALDHswere split into 11 clades contain-ing at least one ALDH gene of NLL. The relationship among families is quite well conserved between legumespecies and among monocot and dicot plants. Structure–functional analysis of ALDH5 and ALDH6 showed acatalytic mechanism which involved C330–E427 couple, and C320/R143-R319, respectively in a specific molec-ular (catalytic) environment. The structure–functional analysis identified potential neo-functionalities only infamilies ALDH2C4 and ALDH7B4.Overall, this study reveals new insights into the functional–structure features and evolution of ALDH gene fam-ilies, particularly concerning NLL, a legume crop. These ALDH functional identification and characterization arevaluable in molecular breeding strategies applied to legume for improvement of stress tolerance and yield.

© 2016 The Author. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).

Keywords:Aldehyde moleculesGenomicsL. albusL. luteusLegume cropsMetabolismNarrow-leafed lupinOxidative stressSweets lupins

1. Introduction

ALDH superfamily is integrated of 23 families of diverse enzymesdisplaying key roles in a broad number of aldehyde molecules, as wellas intermediates or by-products of several fundamentalmetabolic path-ways in both, catabolic and biosynthetic pathways (Ishitani et al., 1995).Aldehyde dehydrogenases (ALDHs) have been found in prokaryotic andeukaryotic organisms, where many ALDH families are highly conserved

gy Laboratory, Department ofción Experimental del Zaidín,reda, Granada E-18008, [email protected].

. This is an open access article under

between plants and animals (Brocker et al., 2013), an indicative of com-mon use of substrates among these enzymes.

ALDH enzymes integrate as cofactor either NAD+ or NADP+ totransform aldehyde molecules to carboxylic acids, indicating thatthis reaction is a major source of reducing equivalents (NADH orNADPH) (Hou and Bartels, 2015), helping to cellular redox balancemaintenance.

Plants are sessile organisms that are continuously being exposed todifferent types of biotic (i.e. pests, insects, fungi), and abiotic stresses(i.e. salinity, desiccation, cold and heat shock). This induces the genera-tion of reactive oxygen species (ROS), leading to an imbalanced cellularaldehyde molecule level, and interfering with steady-state metabolicreactions in cells (Bartels, 2001). In order to protect themselves andcoping with these stresses, plants have to produce a broad range of

the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Page 2: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

68 J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

stress-responsive proteins, which might play crucial roles in stresstolerance and survival (Ni et al., 2009). Among these genes are found al-dehyde dehydrogenase (ALDH) enzymes that contribute to aldehydehomeostasis as ‘scavengers’ to eliminate toxic aldehydes (Rodrigueset al., 2006). Thus, when the cell oxidative damage protecting the sys-tem is overcome, the action of ALDH superfamily enzymes representsanother major functional line of defense.

Up until now, and due to the increased number of sequenced plantgenomes, 13 different families of plant ALDH superfamily, ALDH2,ALDH3, ALDH5 to ALDH7, ALDH10 to ALDH12, ALDH18, ALDH21 toALDH24 have been identified (Yoshida et al., 1998). There are specificfamilies to plants (ALDH10, ALDH12, ALDH21 to ALDH24) (Brockeret al., 2013), and others with mammalian orthologs (ALDH2, ALDH3,ALDH5 to ALDH7 and ALDH18) (Yoshida et al., 1998).

The expression of plant ALDH genes is developmentally regulated,variable in different tissues, and particularly differentially expressed inresponse to environmental stresses, oxidative stress or heavy metals(Kirch et al., 2005; Gao and Han, 2009), thus having a fundamentalrole in adaptation along evolution. Furthermore, ALDH proteins havebeen found in numerous subcellular compartments including cytosol,mitochondria, plastids, peroxisomes and microsomes.

Herein it has been carried out various genome-wide identification ofnew ALDH genes mediating different physiological processes in a num-ber of higher plant species leading to a comprehensive evolutionary andfunctional comparison of the plant ALDH superfamily, includingArabidopsis thaliana (Kirch et al., 2004), Zea mays (Jimenez-Lopezet al., 2010), Oryza sativa (Kotchoni et al., 2010b), Chlamydomonasreinhardtii, O. tauri, the moss Physcomitrella patens (Wood and Duff,2009), Vitis vinifera (Zhang et al., 2012), Populus tricocarpa (Tian et al.,2015), Gossypium raimondii (He et al., 2014), Selaginella moellindorffii(Brocker et al., 2013), and the only legume crop specie Glycine max(Kotchoni et al., 2012), among others.

Narrow-leafed lupin is an important legume crop for human con-sumptions, with demonstrated nutraceutical properties (Villarinoet al., 2015), and having major agricultural benefits as diseases breakin crop rotation (Lucas et al., 2015), and able tofix nitrogen, substantial-ly reducing the need for expensive fertilizers, and enrich the soil forsubsequent crops. The recent publication of NLL genome sequences(http://www.lupinexpress.org/) makes it possible for genome-wideidentification of ALDH superfamily in blue lupin, all of which still needto be functionally characterized.

In the current study we systematically identified 19 ALDH genes be-longing to ten different families in the NLL genome, providing a unifiedand comprehensive functional comparison of the lupin ALDH gene su-perfamily to other sequenced plant species, with the aim of studyingtheir evolutionary relationship, structure–functional features of thenewly identified sequences, as well as the structure-based modelingcatalytic mechanism analysis of ALDH5 and ALDH6 families, which areinvolved in environmental stress responses. The results obtained inour study provided sufficient information that enables us to furthercomparatively study the evolution and functional characterization ofALDH gene families in NLL and other plant species.

2. Material and methods

2.1. Database searches for identification, annotation and characterizationof ALDH genes in L. angustifolius

In order to identify the NLL ALDH gene and protein families, previ-ously identified Arabidopsis-, rice-, maize-, grape-, soybean, cotton, Sor-ghum bicolor, -Selaginella moellendorffii-, poplar-, moss-, algae-, andO. tauri-ALDH sequences (nucleotides and proteins) were retrievedfrom NCBI (http://www.ncbi.nlm.nih.gov/), and used to investigateALDH and ALDH-like DNA sequences of L. angustifolius with BLASTN,TBLASTN, and BLASTX in BLAST (from NCBI, low complexity filter;based on a Blosum62 substitution matrix). Annotation details of the

L. angustifolius genome are available from International lupin GenomeSequencing Project (http://www.lupinexpress.org/), which wasscanned with BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) to checksequences from last release annotations.

Protein motifs of the identified NLL-ALDHs were queried using thePROSITE release 20.121 (http://prosite.expasy.org/scanprosite/), Pfam29.0 (http://pfam.xfam.org/). The retrieved sequences were then dou-ble checked using Pfam 00171 (ALDH family), PS00070 (ALDH cysteineactive site), PS00687 (ALDH glutamic acid active site), KOG2450 (alde-hyde dehydrogenase), KOG2451 (aldehyde dehydrogenase), KOG 2453(aldehyde dehydrogenase) and KOG2456 (aldehyde dehydrogenase) inorder to identify domains of NLL ALDH protein superfamily. Putativefunctionswere thereafter assigned to retrieved proteins based upon sig-nificant similarity to functionally characterized proteins as previouslydescribed (Kotchoni et al., 2010a). The NLL ALDH deduced polypeptideswere then annotated using the established annotation criteria by theALDH Gene Nomenclature Committee (AGNC) (Kotchoni et al., 2010a;Jimenez-Lopez et al., 2010).

2.2. Comparative phylogenetic analysis of L. angustifolius L. ALDH genesand other plant species

In order to develop a comparative phylogenetic analysis of NLLALDHproteins, multiple protein alignments of ALDH protein sequences fromNLL and A. thalianaweremade using ClustalWmultiple sequence align-ment tools (http://www.ebi.ac.uk/Tools/clustalw/) using a Blosum32protein weight matrix, as well as multiple alignment gap opening/extension penalties of 10/0.5 and pairwise gap opening/extensionpenalties of 10/0.1.

Alignments were analyzed by using BioEdit V 7.1.3.0 and portions ofsequences that did not align with high confidence were eliminated.Phylogenetic trees were built using neighbor-joining (NJ) method.Branches of the tree were tested with 1000 bootstrap replicates. Thetree was visualized by using Treedyn (www.treedyn.org). Phylogenetictree was generated by the neighbor-joining method (NJ), and the treebranches were tested with 1000 bootstrap replicates. ALDH sequencesfrom NLL together with these from maize, Arabidopsis, and Glycinemaxwere used to generate and visualize the complete tree as describedpreviously (Kirch et al., 2004; Jimenez-Lopez et al., 2010; Kotchoni et al.,2010b, 2012).

2.3. Narrow-leafed lupin ALDH proteins modeling and structural featuresstudy

The ALDH protein sequences (ALDH5F1, ALDH6B2, ALDH12A1,ALDH18B1/18B2, and ALDH22) were searched for homology in theProtein Data Bank (PDB). The homologous templates suitable for thesesequences were selected by BLAST server (http://ncbi.nlm.nih.gov/).The fold recognition homology BioInfoBank Metaserver (http://meta.bioinfo.pl/) was also used for templates selection. Furthermore, Swiss-model server (swissmodel.expasy.org) was also used for templateidentification, retrieving the following templates3u4j, 3ifg, 4f9i, 3ed6,1t90, 1a4s, 2h5q, 1vlu, 4ujbe, 3iwk, and 4qgk from the PDB database(Jimenez-Lopez et al., 2013a).

ALDH protein models were built by homology modeling usingSWISS-MODEL via the ExPASy web server (swissmodel.expasy.org).An initial structural model was generated and checked for recognitionof errors in 3D structure by using ProSA (prosa.services.came.sbg.ac.at/prosa.php), and also for a first overall quality estimation of the modelwith QMEAN (swissmodel.expasy.org/qmean/cgi/index.cgi). Finalstructures of ALDH proteins were subjected to energy minimizationwith GROMOS96 force field energy implemented in DeepView/Swiss-PDBViewer v3.7 (spdbv.vital-it.ch) to improve the van der Waalscontacts and correct the stereochemistry of themodel. For each sequenceanalyzed, thequality of themodelwas assessed byQMEAN, checking pro-teins stereologywith PROCHECK (www.ebi.ac.uk/thornton-srv/software/

Page 3: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

69J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

PROCHECK), and ProSA (prosa.services.came.sbg.ac.at/prosa.php) pro-grams, as well as the protein energy with ANOLEA (protein.bio.puc.cl/cardex/servers/anolea). The Ramachandran plot statistics for the modelswere also calculated to show the number of protein residues in thefavored regions (Jimenez-Lopez et al., 2012).

The electrostatic Poisson–Boltzmann (PB) potentials for all thestructures were analyzed using APBS (DeLano Scientific LLC) molecularmodeling software implemented in PyMol 0.99 (www.pymol.org) withAMBER99 to assign the charges and radii to all of the atoms (including

Table 1Narrow-leafed lupin ALDH genes and families.

GeneName

ALDH family member Subcellularlocation

Molecula

Family 2 ALDH2B4 ALDH family 2 member B4 Mitochondrial ALDH (N

ALDH2B7 ALDH family 2 member B7 Mitochondrial ALDH (N

ALDH2C4(variant 1)

ALDH family 2 member C4a Cytosol ALDH (Ndehydrog

ALDH2C4(variant 2)

ALDH family 2 member C4b Cytosol ALDH (Ndehydrog

Family 3 ALDH3F1 ALDH family 3 member F1 RE; membrane ALDH [N3-Chlorodehydro

ALDH3H1 ALDH family 3 member H1 RE; Golgi apparatus;membrane; plastids;vacuole

ALDH [N3-Chlorodehydro

ALDH3I1 ALDH family 3 member I1 Chloroplast; plastids ALDH [N3-Chlorodehydro

Family 5 ALDH5F1 Succinate-semialdehydedehydrogenase

Mitochondrial ALDH(Nion bindsemialde[NAD+/N

Family 6 ALDH6B2 Methylmalonate-semialdehydedehydrogenase [acylating]

Mitochondrial ALDH (NCopper isemialde(acetylatMethylmdehydro

Family 7 ALDH7B4a(variant 1)

ALDH family 7 member B4a Cytosol ALDH (N

ALDH7B4b(variant 2)

ALDH family 7 member B4b Cytosol ALDH (N

Family 10 ALDH10A8 Betaine aldehydedehydrogenase 1

Peroxisomal Betaine-

ALDH10A9 Betaine aldehydedehydrogenase 2

Peroxisomal Betaine-

Family 11 ALDH11A3 NADP-dependentglyceraldehyde-3-phosphatedehydrogenase

Cytosol Glyceralddehydro(non-ph

ALDH11A4 NADP-dependentglyceraldehyde-3-phosphatedehydrogenase

Cytosol Glyceralddehydro(non-phion bind

Family 12 ALDH12A1 Delta-1-pyrroline-5-carboxylatedehydrogenase 12A1

Mitochondrial 1-Pyrrolidehydrodehydrozinc ionbinding

Family 18 ALDH18B1 Delta-1-pyrroline-5-carboxylate synthase A

Chloroplast; cytosol;membrane

ATP bind5-carboxglutamatglutamatdehydro

ALDH18B2 Delta-1-pyrroline-5-carboxylate synthase B

Chloroplast; cytosol;membrane

ATP bind5-carboxglutamatglutamatdehydro

Family 22 ALDH22A1 ALDH family 22 member A1 Chloroplast, cytosoland membrane

ALDH (N

hydrogens), whichwere added andoptimizedwith the Python softwarepackage PDB2PQR. Fine grid spaces of 0.35 Å were used to solve the lin-earized PB equation in sequential focusing multigrid calculations in amesh of 130 points per dimension at 310.00 K. The dielectric constantswere 2.0 for the protein and 80.00 forwater. The outputmeshwas proc-essed in the scalar OpenDX format to render isocontours andmaps ontothe surfaces with PyMOL 0.99 (Jimenez-Lopez et al., 2013b). Potentialvalues are given in units of kT per unit charge (k Boltzmann's constant;T temperature).

r function Signature NLL Genome sequencingproject number

Reference

AD+); ATP-binding E306-C339 comp32384_c0_seq2 Currentstudy

AD+) E302-C336 comp49635_c0_seq3 Currentstudy

AD+); Coniferyl-aldehydeenase

E270-C304 comp36766_c0_seq4 Currentstudy

AD+); Coniferyl-aldehydeenase

E268-C300 comp65564_c0_seq1 Currentstudy

AD+/NAD(P)+];allyl aldehydegenase

– comp23074_c0_seq2 Currentstudy

AD+/NAD(P)+];allyl aldehydegenase

E222 comp59788_c0_seq3 Currentstudy

AD+/NAD(P)+];allyl aldehydegenase

– comp32715_c0_seq5 Currentstudy

AD) activity; Coppering; Succinate-hyde dehydrogenaseAD(P)+]

E296-C330 comp69150_c1_seq2 Currentstudy

AD+);on binding; Malonate-hyde dehydrogenaseing) activity;alonate-semialdehydegenase (acylating)

C320 comp68392_c0_seq1 Currentstudy

AD+) E267 comp193_c0_seq1 Currentstudy

AD+) E267 comp70985_c0_seq1 Currentstudy

aldehyde dehydrogenase E260-C294 comp38531_c0_seq2 Currentstudy

aldehyde dehydrogenase E260-C294 comp69633_c0_seq2 Currentstudy

ehyde-3-phosphategenase (NADP+)osphorylating)

E266-C300 comp4265_c0_seq1 Currentstudy

ehyde-3-phosphategenase (NADP+)osphorylating); cobalting

– comp63738_c0_seq3 Currentstudy

ne-5-carboxylategenase activity; aldehydegenase (NAD) activity;binding; Cobalt ion

C333 comp69369_c0_seq3 Currentstudy

ing; Delta1-pyrroline-ylate synthetase activity;e 5-kinase activity;e-5-semialdehydegenase activity

PS00902–PS01223 comp57033_c0_seq3 Currentstudy

ing; Delta1-pyrroline-ylate synthetase activity;e 5-kinase activity;e-5-semialdehydegenase activity

PS00902–PS01223 comp57033_c0_seq2 Currentstudy

AD+) E16-C50 comp24706_c0_seq1 Currentstudy

Page 4: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

70 J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

3. Results and discussion

3.1. Characteristics of ALDH gene families in L. angustifolius

In the present study, it has been identified 19 ALDH gene sequencesfromNLL (Tables 1), that code for members of 10 ALDH protein families(ALDH2, ALDH3, ALDH5, ALDH6, ALDH7, ALDH10, ALDH11, ALDH12,ALDH18, and ALDH22) which were previously identified in otherplant species (Table 1), with the exception of ALDH19. The classificationand naming of protein families were made according to AGNC(Kotchoni et al., 2010a), where protein root symbols (ALDH) werefollowed by a family designation number (2, 3, 5, 6, 7, 10, 11, 12, 18,and 22), a subfamily designator (A, B, C, D etc.) the individual genenum-ber (based on chromosomal positionwithin each subfamily), and a low-

Table 2ALDH family members identified in plants. Each ALDH gene family in corresponding plant spe

ALDH fam

SpecieCommon

name2 3 4 5 6 7 8 9 10 11 12 13

Lupinusangustifolius

NLL or Bluelupin

4 3 – 1 1 2 – – 2 2 1 –

Glycine max Soybean 5 1 – – – 4 – – 6 2 – –

Arabidopsisthaliana

Thale cress 3 3 – 1 1 1 – – 2 1 1 –

Chlamydomonasreinhardtii

Unicellulargreen algae

1 – – 1 1 – – – 1 1 1 –

Eutremaparvulum

algae 3 3 – 1 1 1 – – 2 1 1 –

Eutremasalsugineum

algae 3 4 – 1 1 1 – – 2 1 1 –

Gossypiumraimondii

New WorldCotton

8 6 – 1 3 1 – – 2 3 1 –

Malusdomestica

Apple 13 7 – 2 2 2 – – 2 3 2 –

Oryza sativa Asian rice 5 5 – 1 1 1 – – 2 1 1 –

Ostreococcustauri

unicellulargreen alga

– 1 – 1 – – – – 1 1 1 –

Physcomitrellapatens

Moss 2 5 – 2 1 1 – – 1 5 1 –

Populustrichocarpa

Blackcottonwood

4 6 – 1 4 2 – – 2 3 1 –

Selaginellamoellendorffii

GemmiferousSpikemoss

6 2 – 1 1 1 – – 1 6 1 –

Setaria italicaFoxtailMillet

6 4 – 1 1 1 – – 2 1 1 –

Sorghumbicolor

Sorghum 5 4 – 1 1 1 – – 2 1 1 –

Vitis viniferaCommon

grape vine5 4 – 3 3 2 – – 2 2 1 –

Volvox carteriColonial

green algae1 – – – 1 – – – 1 1 1 –

Zea mays maize 6 5 – 2 1 1 – – 3 1 1 –

“-” no member of the ALDH gene family was identified in the corresponding species. NLL: Narr

case letter to designate the number of variant (a, b, c, d) as illustrated inTable 1.

The ALDHs identified in L. angustifolius encode proteins that rangedfrom 165 to 726 amino acids (aa) in length, with predicted isoelectricpoints (pIs) from 5.14 to 7.03. Four families (5, 6, 12, and 22) were rep-resented by one gene,whilewe observed that families 2, 3, 7, 10, 11, and18, were represented by more than a gene member in NLL, similar tomany other plant species as Arabidopsis, sorghum, rice, and differentalgae (Table 2), probably because these four families were house-keepingALDHs, involved in key fundamental plant roles, i.e.metabolismand maintenance balance of nontoxic molecule concentrations. Twofamilies (2, and 3) were comparatively abundant in NLL. In comparisonto soybean, the other legume species, NLL showed ALDH gene se-quences for ALDH classes 5, 6, 12, 18 and 22, which are missed in

cies is shadowed and including the number of members.

ilies

14 15 16 17 18 19 20 21 22 23 24ALDH

numberReference

– – – – 2 – – – 1 – – 19Currentstudy

– – – – – – – – – – – 18Kotchoni

et al. (2012)

– – – – 2 – – – 1 – – 16Kirch et

al. (2004)

– – – – 1 – – – 1 – 1 9Wood and

Duff (2009)

– – – – 2 – – – 1 – – 16Hou andBartels(2014)

– – – – 2 – – – 1 – – 16Hou andBartels(2014)

– – – – 4 – – – 1 – – 30He et al.(2014)

– – – – 4 – – – 2 – – 39Li et al.(2013)

– – – – 2 – – – 1 – – 20Gao and

Han (2009)

– – – – – – – – 1 – – 6Wood and

Duff (2009)

– – – – 1 – – 1 1 – 21Wood and

Duff (2009)

– – – – 2 – – – 1 – – 26Tian et al.

(2015)

– – – – 1 – – 1 1 2 – 24Brocker etal. (2013)

– – – – 2 – – – 1 – – 20Chen et al.

(2014)

– – – – 2 – – – 1 – – 19Paterson

et al. (2009)

– – – – 2 – – – 1 – – 25Zhang etal. (2012)

– – – – 1 – – – 1 – – 7Prochnik

et al. (2010)

– – – – 2 – – – 1 – – 23Jimenez–Lopez et

al. (2010)

ow-leafed lupin.

Page 5: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

Fig. 1. Comparative phylogenetic analysis of Lupinus angustifolius L. ALDHs. Neighbor-joining(NJ) method was used to perform a phylogenetic analysis of 75 ALDH proteins fromL. angustifolius L. (La), Arabidopsis thaliana (At), Zea mays (Zm), and Glycine max (Gm).

71J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

soybean, but in absolute numbers, NLL only has 1 member more, sincesoybean hasmore abundance in the number of members for ALDH clas-ses 2, 3, 7 and 10. Family 2 was represented by 4 gene members in NLL(the most abundant in number of members), followed by family 3 with3members. Families 7, 10, 11 and 18 are represented by 2 genes, whichis an average number for many of the other species analyzed (Table 2).Understanding biological implications of the presence of one ALDHmember in some families (5, 12 and 22) in NLL, and the multiple mem-ber duplications in others (family 2) is of significant functional value.

ScanProsite analysis indicated that characteristic PS00687 andPS00070 domains were absent in some of the ALDH sequences. 8 out19 genes contained PS00070 domains (ALDH cysteine active site) andPS00687 (ALDH glutamic acid active site), which are usually found inthe ALDH protein superfamily; three sequences contained only one ofthese domains (PS00687 domain was absent in ALDH6 and ALDH12;PS00070 domain was absent in ALDH3H and ALDH7). Although someproteins did not contain a PS00687 or a PS00070 domain (ALDH3F,ALDH3I, ALDH11A4), conserved domain searches in NCBI http://www.ncbi.nlm.nih.gov/ within these proteins indicated that they belongedto the ALDH superfamily. PS00902 domain (Glutamate 5 kinase signa-ture) and PS01223 domain (γ-glutamyl phosphate reductase signature)were found in two sequences ALDH18B1 and ALDH18B2.

All ALDH gene families identified in higher plants such asArabidopsis were presented in NLL (Table 2), but many of these families(5, 6, 12, 18, and 22) missed in other close species as Glycine max. Com-pared to other well characterized plant ALDHs, NLL ALDH families are inthe average number of abundance with 19 genes, together with 19 inSorghum, 18 in soybean, 20 in rice, 21 in P. patens, 20 in foxtail millet,but one of the species having lower abundance in these genes togetherwith algae (Table 2). This lack of ALDH genesmay be due to the domes-tication process as happening with G. max (Kotchoni et al., 2012). Be-sides that, NLLS seem to have the necessary stress–response proteinsamong ALDHs, enabling it to tolerate environmental stress such asdrought, i.e. gene numbers in particular ALDH families as ALDH2,ALDH3, ALDH5, ALDH7 and ALDH11. ALDH5, ALDH6, ALDH12 andALDH22 were the smallest families, with only one member amongmost species, including NLL.

3.2. Phylogenetic analysis of NLL ALDH genes

In order to examine the phylogenetic relationship among NLL ALDHgenes including a comparative analysis with a close legume specieG. max, and the model plants Arabidopsis, and maize. 19, 18, 16, and23 full-length ALDH protein sequences, respectively, identified inthese species were aligned to further generate a phylogenetic tree.Fig. 1 shows 11 groupswith ALDHs from the same families clustered to-gether. The largest clades were clade 9 (made of family 10), clade 1(made of family 2), and clade 10 integrated also by family 2; and thesmaller clade includes family 22 (clade 2), clade 3 (family 12), andclade 7 (family 6). ALDH members of the same family did not alwayscluster together, i.e. clade 4 (families 18 frommaize and NLL, and family11 of NLL).

To understand the functional relevance ofmembers of theNLL ALDHprotein families, we established phylogenetic relationships amongALDHs from these four species. The phylogenetic tree (Fig. 1) indicatesthat NLL ALDHs share two common plant ALDH family cores, one di-verged to members of the family 2B (with the exception of maize),and another core that included the rest of the members of the ALDHfamilies for the four species analyzed. Among these families, ALDH3was the most distantly related family in the phylogeny for these fourspecies, followed by ALDH22, ALDH12 and ALDH18. This finding is inagreement with previous research in grape (Zhang et al., 2012), andrice (Kotchoni et al., 2010b), where ALDH3 and ALDH12 proteins arehighly divergent in sequence from the other ALDH families and didnot contain the conserved ALDH active sites (Tian et al., 2015). ALDH3is one of the most expanded and diverse group among plant ALDH

gene superfamily, where its members are differentially expressed in re-sponse to environmental stresses such as salinity, oxidants, pesticides,heavy metals, and dehydration (Kirch et al., 2001; Stiti et al., 2011;Sunkar et al., 2003), while the ALDH12 family may play a key role in thedegradation of proline (and also arginine) into glutamate, and it is widelydistributed in bacteria, fungi, plants and animals (Brocker et al., 2013).

Higher plants like cotton, apple, grape andmaize, seem to havemoreALDH genes than animals and fungi. Unlike mammals, plants cannotmove and are therefore more susceptible to environmental stress fac-tors and as a result theymay require additional stress-response proteinssuch as ALDHs to protect them when exposed to abiotic and bioticstresses (Rejeb et al., 2014). However, for lupin and soybean the domes-tication process may be responsible for the reduced number of ALDHgenes in comparison with other species included in Table 2, evenmore patent for soybean. This process seems to be affecting in a differ-ent manner the other domesticated plants as maize or apple that stillconserves a large number of ALDH genes. Thus, other factors as stressresponse adaptation may have more influence in the number of ALDHgene conservation (Boersma et al., 2005; Nelson et al., 2006).

These genetic events of gene loss and/or abundance would alsooccur in the ALDH superfamily. Among the 24 ALDH families, plantsALDH are present in 14 families, and seven (ALDH11, ALDH12,ALDH19, ALDH21, ALDH22, ALDH23 and ALDH24) are unique to plants.So far, a single gene of the ALDH19 family has only been identified in to-mato and it is also unique to plants (García-Ríos et al., 1997). ADH21and ALDH23 are unique to P. patens and S. moellendorffii, whileALDH24 is unique to C. reinhardtii (Table 2). Narrow-leafed lupin andother vascular plants share ten common core ALDH families (ALDH2,ALDH3, ALDH5, ALDH6, ALDH7, ALDH10, ALDH11, ALDH12, ALDH18,and ALDH22), suggesting that these ten families evolved prior to themonocot/eudicot divergence. Eight of the ten core families (ALDH2,ALDH3, ALDH5, ALDH6, ALDH10, ALDH11, ALDH12 and ALDH22) arealso shared by land plants and algae, suggesting that these familieshave ancient origins preceding the transition of aquatic plants ontoland (Table 2).

Page 6: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

72 J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

3.3. Structure–functional analysis of NLL ALDH with a focus on ALDH5 andALDH6

The ALDH gene superfamily has been systematically characterized invarious organisms (Hou and Bartels, 2015). The crystallographic struc-tures of selected ALDHs deposited in the Protein Database (PDB) havemade it possible to study theALDH families structure–functional relatedfeatures (Jimenez-Lopez et al., 2010; Kotchoni et al., 2010a). To ourknowledge, structural modeling and conformational feature compari-sons of ALDH protein members of this superfamily have only been per-formed in few organisms as rice (Kotchoni et al., 2010b) or maize(Jimenez-Lopez et al., 2010). Using computational modeling, we deter-mined the structural features in 3D structures of the active sites and theNAD(P)+-ring binding clefts of the members of all the NLL ALDH fami-lies (Fig. 2), with a focus on ALDH5 family (Fig. 3) catalytic cleft andmechanism. Each sequence was modeled based on the best structuraltemplates, where the first models obtained were refined using thestructural parameters summarized in Material and methods.

Fig. 2, Figs. S1 and S2 show that the greatest structural differencesbetween families were located in the oligomerization region (length,

Fig. 2. Structural analysis of Lupinus angustifolius L. ALDH2 and ALDH3 families. Three-dimensioand variant 2, 3F1, 3H1, and 3I1. Structures were depicted as a cartoon diagram. α-helices, βelectrostatic potential representation rotated 180 around the x-axis are provided for all memb

number, curvature angle and folding of secondary structures) wherethe global topologywas quite similar amongmembers of the same fam-ily, with a big cofactor domain for the ALDH18. However, we found thelargest differences in the angles of curvature of the coil structures in thecatalytic domain for all members of the ALDH6 and ALDH12, which areprojected outward from the general structure. Protein residues that arelinked to biological processes such as protein–protein and protein–ligand interactions are most likely solvent accessible, whereas theresidues implicated in protein structure and folding stability are locatedin the core of the protein.

The Adaptive Poisson–Boltzmann Solver (APBS) package (http://www.poissonboltzmann.org/) was used to generate the electrostaticsurface potentials for the most representative members of the NLLALDH families, as shown in Figs. 2 and 3, Figs. S1 and S2. We examinedthe charge distribution and patches that differentiate the family's mem-bers. The color in the models depicts the different surface properties,with red representing negative charge, blue positive and white neutral.Each protein is represented by two surface plots/views, which corre-spond to 180° rotations around the vertical (Z) axis. Although the over-all topologies of these proteins are quite similar (except for ALDH18 and

nal structure of NLL ALDH2 and ALDH3 corresponding to the members 2B4, 2C4 variant 1-sheets and coils are depicted in red, yellow and green, respectively. Two views of theers of both ALDH families. The surface colors are clamped at red (−10) or blue (+10).

Page 7: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

Fig. 3. Lupinus angustifolius L. ALDH5F1 protein structure, coenzyme and ligand-binding domain analysis. A) Three-dimensional structure of NLL ALDH5F1. Structures were depicted as acartoon diagram. α-helices, β-sheets and coils are depicted in red, yellow and green, respectively. Two views rotated 180 around the x-axis are provided for ALDH5F1 superfamily of theelectrostatic potential representation on the ALDH5F1 protein surfaces. The surface colors are clamped at red (−10) or blue (+10). B) Detailed view of the ALDH5F1 chain and the active(catalytic) site, and the spatial distribution of the coenzyme and amino acids involved in holding NAD(P)+, where critical catalytic (C330 and E427) residues are depicted as stick andcolored according with atoms. C) Hydrogen-bonding interactions in the ALDH5F1 coenzyme domain and its interaction with NAD(P)+. Hydrogen bonds are shown in broken lines.D) Detailed view of the cofactor-binding domain showing residues configuring this domain, highlighted in purple color in proper position related to the cofactor. E) Proposed catalytic(succinic semialdehyde – SSA – deshydrogenase) mechanism for NLL ALDH5F1 (adapted from Zhang and Liu, 2015).

73J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

ALDH22), we have observed several differences. A specific positive elec-trostatic potential distribution dominates the oligomerization domainsurface, particularly in ALDH2, 3, 5, 6, and 12 which is the largest posi-tively charged surface, included the polymerization region, whichspanned the cofactor binding domain. These charge distribution pat-terns might correlate well with their different functional properties,and large contacts for dimerization of subunits. In addition, the distribu-tion of these charges denotes differences in the mechanism of actionand/or interaction with other proteins and intracellular localization.Furthermore, the surfaces of the catalytic domain and the cofactor-binding domain contained themost profound differences in charge dis-tributions, being clearly positively charged in the family ALDH3.

ALDH crystal structures have exhibited conformational flexibility foraccommodation of NAD+ cofactor, reflecting a functional dynamic pref-erence for different forms of NADH/NAD+ cofactor (Jimenez-Lopez

et al., 2010; Kotchoni et al., 2010a; Pemberton et al., 2014). In thisstudy, using computational modeling for different members of NLLALDH superfamily provided structural insight into the shape of the var-ious ALDH catalytic clefts and enables us to study the important struc-tural features that dictate cofactor specificity – the NAD+ bindingpocket – for ALDH5F1 (Fig. 3A), a succinic semialdehyde dehydrogenasewhich catalyzes the conversion of succinic semialdehyde (SSA) to succi-nate during the last step of γ-aminobutyrate (GABA) catabolism. Thevariability of the binding pockets reflects the functional differences be-tween ALDH families as i) potential cellular localization, where ALDH5in Arabidopsis was found to be localized to the mitochondria and isalso predicted to be a mitochondrial protein in other plant species(Bouche et al., 2003; Gao and Han, 2009) as was predicted for NLL, ii)potential activities, i.e. GABA in plants is associated with pollen–pistilinteractions, oxidative stress and hypoxia (Fait et al., 2008; Palanivelu

Page 8: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

74 J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

et al., 2003); ALDH5 also respond (up-regulation) to water–stress (Gaoand Han, 2009; Kirch et al., 2005; Zhang et al., 2012). The ALDHs areknown to have variable conformations when compared to non-homologous proteins, type of ligand molcules, as well as the shapes ofbinding pockets that could hold similar ligands (Kahraman et al., 2007).

Similarly, the residue conservation of the binding site and structuralcomparisons of NADP+-dependent ALDHs with known NADP+-dependent conformation is crucial for predicting the cofactor specificityand the enzymatic mechanism (Fig. 3B). There is a conserved E residuein different positions of the primary sequence that is located on the op-posite side of the NAD ring from another conserved C residue; in thecase of ALDH5F1 is the pair E427–C330. These residues have been impli-cated in the nucleophilic attack and proton abstraction from the Cresidue during the ALDH reaction (Fig. 3C), and crucial feature thatinfluences the thiol extraction step during catalysis by the differentALDHs.

Fig. 4. Lupinus angustifolius L. ALDH6B2 protein structure, coenzyme and ligand-binding domaicartoon diagram.α-helices, β-sheets and coils are depicted in red, yellow and green, respectiveelectrostatic potential representation on the ALDH6B2 protein surfaces. The surface colors are c(catalytic) site, and the spatial distribution of the coenzyme and amino acids involved in holaccording with atoms. C) Hydrogen-bonding interactions in the ALDH6B2 coenzyme domain aview of the cofactor- and catalytic binding domain showing residues configuring this domE) Proposed catalytic (methylmalonyl semialdehyde – MMSA – dehydrogenase) mechanism fo

The study of the ALDH5F1 family member revealed that itssubstrate-binding site is formed by an aromatic cluster mainly com-posed of W197, F199, F427, a polar residue S275, and R206 and Q328residues. These different residues included in the catalytic domains(Fig. 3B) arewell conserved,waswell as the environment of the domainholding the co-factor (Fig. 3C and D), integrated by T195, P196, W197,K221, E224, A254, G274, S275, V278, K380, E427, and F429.

This ALDH variant is able to accommodate the NAD+ coenzyme, andconservesmost of the residues integrating the catalytic domain as E427,the driving catalytic reaction C330, where the proposed mechanism forthe ALDH5F1 (SSADH) involves two processes (Fig. 3E), i) acylation andii) deacylation. In the acylation process, the deprotonated C330 residueattacks on the carbonyl carbon of succinic semi-aldehyde to form athiohemiacetal intermediate. The hydride ion of the tetrahedric inter-mediate transfers to the C4 atom of the nicotinamide ring of NAD(P)+,generating the thioester intermediate. In the following deacylation

n analysis. A) Three-dimensional structure of NLL ALDH6B2. Structures were depicted as aly. Two views rotated 180 around the x-axis are provided for ALDH6B2 superfamily of thelamped at red (−10) or blue (+10). B) Detailed view of the ALDH6B2 chain and the activeding NAD(P)+, where critical catalytic (C320) residues are depicted as stick and colorednd its interaction with NAD(P)+. Hydrogen bonds are shown in broken lines. D) Detailedain, highlighted in light and deep blue color in proper position related to the cofactor.r NLL ALDH6B2 (adapted from Talfournier et al., 2011).

Page 9: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

75J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

process, the hydrolytic water molecule is activated by a nearby gluta-mate residue (E427) and the generated hydroxyl attacks the carbonylcarbon of succinic semi-aldehyde to form the second tetrahedral inter-mediate. The S–C bond of the intermediate cleaves to produce thefinal product semi-aldehyde and regenerate the C330 side chain(Marchal et al., 2001; Tsybovsky and Krupenko, 2011).

Therefore, we have identified a member of the ALDH6 family, alsoknown as methylmalonyl semialdehyde (MMSA) dehydrogenases,which has been named as ALDH6B2 (Table 1). These enzymes are in-volved in the catabolism of both valine and pyrimidine. ALDH6 familyis the only ALDH superfamily members to use NAD+, and coenzyme A(CoA) as cofactors, to catalyze the conversion of MMS to propionyl-CoA (Marchitti et al., 2008). In addition, it has been demonstrated thatALDH6 enzymes are capable of metabolizing malonate semialdehydeto acetyl-CoA (Marchitti et al., 2008; Oguchi et al., 2004). However,the exact functions of theALDH6 family in plants are yet to be thorough-ly revealed, but its physiological function is supported since ALDH6B7and ALDH6B3 are up-regulated under salinity and dehydration treat-ments (Zhang et al., 2012).

In the present study, the structure–functional features accompany-ing ALDH6B2 identified in NLL has been analyzed (Fig. 4). Fig. 4Ashows the structural shape of the ALDH6B2 catalytic clefts, allowing usto analyze the structural and electrostatic nature in the enzymerequired by cofactor specificity – the NAD+ binding pocket – forALDH6B2. Sequence alignment of NLL ALDH6B2 together with otherMSDH members showed a conserved catalytic C320, as well as R143and R319 residues (Fig. 4B), which are only conserved in MSDH se-quences, and in NLL as well. These two R residues are essential in thebinding of MMSA via stabilizing interactions through electrostatic con-tact between the carboxylate and the guanidinium groups (Talfournieret al., 2011). Therefore, R143 and R319 residues participate not only inMMSA binding via electrostatic interaction with the carboxylate butalso likely in positioning MMSA efficiently relative to C320 within theMSDH–NAD–MMSA ternary complex (Talfournier et al., 2011). Thisstabilization is not unique within the ALDH superfamily, since ALDH5family succinate semialdehyde dehydrogenase, has a similar situation(Sohling andGottschalk, 1993;Witkowski et al., 2002),where the stabi-lization of the carboxylate group of the substrate is mainly achievedthrough hydrogen bonds with at least R206 residue (Fig. 3B).

Beside the C–R–R residues, we have uncovered the key residues andtheir 3D distribution in the catalytic domain, that are involved in the en-vironment of the domain holding the co-factor NAD+, and its interac-tion and stabilization such as C186, K212, E215, S265, M287, and E420(Fig. 4C, and D). The proposed mechanism for the ALDH6B2 (MSDH)involves two processes (Fig. 4E), i) acylation and ii) deacylation. Theacylation process includes the deprotonated C320 residue does anucleophilic attack on the carbonyl carbon of MMSA to form athiohemiacetal (oxyanion) intermediate. The hydride ion intermediatetransfers to the C4 atom of the nicotinamide ring of NAD(P)+, generat-ing the thioacyl- intermediate, following the release of reduced co-factor NAD(P)H, and a bicarbonate ion (decarboxylation). These re-leases made another thioacyl- intermediate. From this step, deacylationprocess, a transthioesterification (–SCoA) or the hydrolytic activatedwater molecule (hydroxyl) could attack the carbonyl carbon of MSAintermediate, to the S–C bond of the intermediate cleaves to producethe final product semi-aldehyde (SA), SA–CoA, or a carbonic acid.

4. Conclusion

The aldehyde dehydrogenases represent a large gene superfamilyencoding NAD(P)+-dependent enzymes involved in a variety of metab-olism reactions, and particularly to detoxification of molecules generat-ed under different environmental stresses. The ALDH gene superfamilyhas been identified and reviewed in different organisms includingplants, but no systematic and structure–functional analyses have beenmade to date for NLL, an important legume crop.

We have performed a comprehensive study including NLL genomeanalysis, ALDH gene identification and naming, comparative phylogeny,structure–functional analysis of ALDH gene families with a focus inALDH5F1 and ALDH6B2. A total of 19 NLL ALDH genes have been iden-tified in its genome. These geneswere grouped into 10 families and pro-viding a unifiednomenclature for the deducedALDHpolypeptides usingthe criteria established by the ALDH Gene Nomenclature Committee(AGNC). Phylogenetic analysis indicates that ALDHs of NLL were splitinto two big clades, but these are divided in a total number of 11 cladeswhere ALDH families were well grouped in relationship to Arabidopsis,maize, and soybean ALDH families. Organization of ALDH families, sub-cellular localization, and structure–functional features of ALDHgenes are fairly conserved in NLL, with the exception of ALDH18 andALDH22 showing a more differential structure and possibly functionalfeatures.

The catalytic mechanism of both ALDH5F1 and ALDH6B2 genefamilies conserve a C-E pair and C/R-R residues, respectively responsibleto initiate the nucleophilic attack and the course of the reaction, buteach one in a specific molecular environment.

Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.plgene.2016.03.007.

Competing interests

The author has declared that no competing interests exist.

Acknowledgments

JCJ-L thanks the European Research Program MARIE CURIE (FP7-PEOPLE-2011-IOF) for the grant ref. number PIOF-GA-2011-301550,and the Spanish Ministry of Economy and Competitiveness for thegrant ref. number RYC-2014-16536 (Ramon y Cajal Research Program).

References

Bartels, D., 2001. Targeting detoxification pathways: an efficient approach to obtain plantswith multiple stress tolerance. Trends Plant Sci. 6, 284–286.

Boersma, J.G., Pallotta, M., Li, C., Buirchell, B.J., Sivasithamparam, K., Yang, H., 2005. Con-struction of a genetic linkage map using MFLP and identification of molecularmarkers linked to domestication genes in narrow-leafed lupin (Lupinus angustifoliusL.). Cell. Mol. Biol. Lett. 10, 331–344.

Bouche, N., Fait, A., Bouchez, D., Moller, S.G., Fromm, H., 2003. Mitochondrial succinic-semialdehyde dehydrogenase of the gamma-aminobutyrate shunt is required to re-strict levels of reactive oxygen intermediates in plants. Proc. Natl. Acad. Sci. U. S. A.100, 6843–6848.

Brocker, C., Vasiliou, M., Carpenter, S., et al., 2013. Aldehyde dehydrogenase (ALDH) su-perfamily in plants: gene nomenclature and comparative genomics. Planta 237,189–210.

Chen, Z., Chen, M., Xu, Z.S., Li, L.C., Chen, X.P., Ma, Y.Z., 2014. Characteristics and expres-sion patterns of the aldehyde dehydrogenase (ALDH) gene superfamily of foxtail mil-let (Setaria italica L.). PLoS One 9, e101136.

Fait, A., Fromm, H., Walter, D., Galili, G., Fernie, A.R., 2008. Highway or byway: the meta-bolic role of the GABA shunt in plants. Trends Plant Sci. 13, 14–19.

Gao, C., Han, B., 2009. Evolutionary and expression study of the aldehyde dehydrogenase(ALDH) gene superfamily in rice (Oryza sativa). Gene 431, 86–94.

García-Ríos, M., Fujita, T., LaRosa, P.C., et al., 1997. Cloning of a polycistronic cDNA fromtomato encoding γ-glutamyl kinase and γ-glutamyl phosphate reductase. Proc.Natl. Acad. Sci. U. S. A. 94, 8249–8254.

He, D., Lei, Z., Xing, H., Tang, B., 2014. Genome-wide identification and analysis of the al-dehyde dehydrogenase (ALDH) gene superfamily of Gossypium raimondii. Gene 549,123–133.

Hou, Q., Bartels, D., 2015. Comparative study of the aldehyde dehydrogenase (ALDH) genesuperfamily in the glycophyte Arabidopsis thaliana and Eutrema halophytes. Ann. Bot.115, 465–479.

Ishitani, M., Nakamura, T., Hart, S.Y., Takabe, T., 1995. Expression of the betaine aldehydedehydrogenase gene in barley in response to osmotic stress and abscisic acid. PlantMol. Biol. 27, 307–315.

Jimenez-Lopez, J.C., Gachomo, E.W., Seufferheld, M.J., Kotchoni, S.O., 2010. The maizeALDH protein superfamily: linking structural features to functional specificities.BMC Struct. Biol. 10, 43.

Jimenez-Lopez, J.C., Kotchoni, S.O., Hernandez-Soriano, M.C., Gachomo, E.W., Alché, J.D.,2013a. Structural functionality, catalytic mechanism modelling and molecular aller-genicity of phenylcoumaran benzylic ether reductase, an olive pollen (Ole e 12) aller-gen. J. Comput. Aided Mol. Des. 27, 873–895.

Page 10: Narrow-leafed lupin (Lupinus angustifolius L.) functional … · 2017. 1. 21. · Narrow-leafedlupin(Lupinusangustifolius L.)functionalidentificationand characterization of the aldehyde

76 J.C. Jimenez-Lopez / Plant Gene 6 (2016) 67–76

Jimenez-Lopez, J.C., Kotchoni, S.O., Rodríguez-García, M.I., Alché, J.D., 2012. Structure andfunctional features of olive pollen pectin methylesterase using homology modelingand molecular docking methods. J. Mol. Model. 18, 4965–4984.

Jimenez-Lopez, J.C., Rodríguez-García, M.I., Alché, J.D., 2013b. Analysis of the effects ofpolymorphism on pollen profilin structural functionality and the generation of con-formational, T- and B-cell epitopes. PLoS One 8, e76066.

Kahraman, A., Morris, R.J., Laskowski, R.A., Thornton, J.M., 2007. Shape variation in proteinbinding pockets and their ligands. J. Mol. Biol. 368, 283–301.

Kirch, H.H., Nair, A., Bartels, D., 2001. Novel ABA- and dehydration-inducible aldehyde de-hydrogenase genes isolated from the resurrection plant Craterostigma plantagineumand Arabidopsis thaliana. Plant J. Cell Mol. Biol. 28, 555–567.

Kirch, H.-H., Bartels, D., Wei, Y., Schnable, P.S., Wood, A.J., 2004. The ALDH gene superfam-ily of Arabidopsis. Trends Plant Sci. 9, 371–377.

Kirch, H.-H., Schlingensiepen, S., Kotchoni, S., Sunkar, R., Bartels, D., 2005. Detailed expres-sion analysis of selected genes of the aldehyde dehydrogenase (ALDH) gene super-family in Arabidopsis thaliana. Plant Mol. Biol. 57, 315–332.

Kotchoni, S.O., Jimenez-Lopez, J.C., Gachomo, E.W., Seufferheld, M.J., 2010a. A new andunified nomenclature for male fertility restorer (RF) proteins in higher plants. PLoSOne 5, e15906.

Kotchoni, S.O., Jimenez-Lopez, J.C., Gao, D., Edwards, V., Gachomo, E.W., Margam, V.M.,Seufferheld, M.J., 2010b. Modeling-dependent protein characterization of the rice al-dehyde dehydrogenase (ALDH) superfamily reveals distinct functional and structuralfeatures. PLoS One 5, e11516.

Kotchoni, S.O., Jimenez-Lopez, J.C., Kayodé, A.P.P., Gachomo, E.W., Baba-Moussa, L., 2012. Thesoybean aldehyde dehydrogenase (ALDH) protein superfamily. Gene 495, 128–133.

Li, X., Guo, R., Li, J., Singer, S.D., Zhang, Y., Yin, X., Zheng, Y., Fan, C., Wang, X., 2013.Genome-wide identification and analysis of the aldehyde dehydrogenase (ALDH)gene superfamily in apple (Malus × domestica Borkh.). Plant Physiol. Biochem. 71,268–282.

Lucas, M.M., Stoddard, F.L., Annicchiarico, P., Frías, J., Martínez-Villaluenga, C., et al., 2015.The future of lupin as a protein crop in Europe. Front. Plant Sci. 6, 705.

Marchal, S., Cobessi, D., Rahuel-Clermont, S., Tête-Favier, F., Aubry, A., Branlant, G., 2001.Chemical mechanism and substrate binding sites of NADP-dependent aldehyde de-hydrogenase from Streptococcus mutans. Chem. Biol. Interact. 130-132, 15–28.

Marchitti, S.A., Brocker, C., Stagos, D., Vasiliou, V., 2008. Non-P450 aldehyde oxidizing en-zymes: the aldehyde dehydrogenase superfamily. Expert Opin. Drug Metab. Toxicol.4, 697–720.

Nelson, M.N., Phan, H.T.T., Ellwood, S.R., Moolhuijzen, P.M., Hane, J., Williams, A., et al.,2006. The first gene-based map of Lupinus angustifolius L.—location of domesticationgenes and conserved synteny with Medicago truncatula. Theor. Appl. Genet. 113,225–238.

Ni, F.-T., Chu, L.-Y., Shao, H.-B., Liu, Z.-H., 2009. Gene expression and regulation of higherplants under soil water stress. Curr. Genomics 10, 269–280.

Oguchi, K., Tanaka, N., Komatsu, S., Akao, S., 2004. Methylmalonate-semialdehyde dehy-drogenase is induced in auxin-stimulated and zinc-stimulated root formation inrice. Plant Cell Rep. 22, 848–858.

Palanivelu, R., Brass, L., Edlund, A.F., Preuss, D., 2003. Pollen tube growth and guidance isregulated by POP2, an Arabidopsis gene that controls GABA levels. Cell 114, 47–59.

Paterson, A.H., Bowers, J.E., Bruggmann, R., Dubchak, I., Grimwood, J., et al., 2009. The Sor-ghum bicolor genome and the diversification of grasses. Nature 457, 551–556.

Pemberton, T.A., Srivastava, D., Sanyal, N., Henzl, M.Y., Becker, D.F., Tanner, J.J., 2014.Structural studies of yeast Δ1-pyrroline-5-carboxylate dehydrogenase (ALDH4A1):active site flexibility and oligomeric state. Biochemistry 53, 1350–1359.

Prochnik, S.E., Umen, J., Nedelcu, A.M., Hallmann, A., Miller, S.M., et al., 2010. Genomicanalysis of organismal complexity in the multicellular green alga Volvox carteri. Sci-ence 329, 223–226.

Rejeb, I.B., Pastor, V., Mauch-Mani, B., 2014. Plant responses to simultaneous biotic andabiotic stress: molecular mechanisms. Plants 3, 458–475.

Rodrigues, S.M., Andrade, M.O., Gomes, A.P.S., DaMatta, F.M., Baracat-Pereira, M.C., Fontes,E.P.B., 2006. Arabidopsis and tobacco plants ectopically expressing the soybeanantiquitin-like ALDH7 gene display enhanced tolerance to drought, salinity, and oxi-dative stress. J. Exp. Bot. 57, 1909–1918.

Sohling, B., Gottschalk, G., 1993. Purification and characterization of a coenzyme-A-dependent succinate-semialdehyde dehydrogenase from Clostridium kluyveri. Eur.J. Biochem. 212, 121–127.

Stiti, N., Missihoun, T.D., Kotchoni, S.O., Kirch, H.H., Bartels, D., 2011. Aldehyde dehydroge-nases in Arabidopsis thaliana: biochemical requirements, metabolic pathways, andfunctional analysis. Front. Plant Sci. 2, 65.

Sunkar, R., Bartels, D., Kirch, H.H., 2003. Overexpression of a stress-inducible aldehyde de-hydrogenase gene from Arabidopsis thaliana in transgenic plants improves stress tol-erance. Plant J. Cell Mol. Biol. 35, 452–464.

Talfournier, F., Stines-Chaumeil, C., Branlant, G., 2011. Methylmalonate-semialdehyde de-hydrogenase from Bacillus subtilis: substrate specificity and coenzyme A binding.J. Biol. Chem. 286, 21971–21981.

Tian, F.-X., Zang, J.-L., Wang, T., Xie, Y.-L., Zhang, J., Hu, J.-J., 2015. Aldehyde dehydrogenasegene superfamily in populus: organization and expression divergence betweenparalogous gene pairs. PLoS One 10, e0124669.

Tsybovsky, Y., Krupenko, S.A., 2011. Conserved catalytic residues of the ALDH1L1 alde-hyde dehydrogenase domain control binding and discharging of the coenzyme.J. Biol. Chem. 286, 23357–233367.

Villarino, C.B.J., Jayasena, V., Coorey, R., Bell, S., Johnson, S.K., 2015. Nutritional, health andtechnological functionality of lupin flour addition to bread and other baked products:benefits and challenges. Crit. Rev. Food Sci. Nutr. http://dx.doi.org/10.1080/10408398.2013.814044.

Witkowski, A., Joshi, A.K., Smith, S., 2002. Mechanism of the β-ketoacyl synthase reactioncatalyzed by the animal fatty acid synthase. Biochemistry 41, 10877–10887.

Wood, A.J., Duff, R.J., 2009. The aldehyde dehydrogenase (ALDH) gene superfamily of themoss Physcomitrella patens and the algae Chlamydomonas reinhardtii and Ostreococcustauri. Bryologist 112, 1–11.

Yoshida, A., Rzhetsky, A., Hsu, L.C., Chang, C., 1998. Human aldehyde dehydrogenase genefamily. Eur. J. Biochem. 251, 549–557.

Zhang, J., Liu, Y., 2015. A QM/MM study of the catalytic mechanism of succinic semialde-hyde dehydrogenase from Synechococcus sp. PCC 7002 and Salmonella typhimurium.RSC Adv. 5, 101672–101682.

Zhang, Y., Mao, L., Wang, H., et al., 2012. Genome-wide identification and analysis of grapealdehyde dehydrogenase (ALDH) gene superfamily. PLoS One 7, e32153.