Updated March 10, 2006 by JW.

 

CESA proteins are at the end.

TIGR Rice Community Annotation spreadsheet is at the very end

 

General references: Hazen et al. (2002) Plant Physiol. 128:336 ; Keegstra and Walton (2006) Science, in press.

This web site address: /CSL_updates.htm

 

Please send corrections to walton@msu.edu

 

 

CSLA summary

 

 

Gene

GenBank BAC/WGS

cDNA/EST (incomplete)

introns

protein

Notes

OsCslA1

AP000366

(Xsome 2)   

 

8

521 aa

Richmond also calls this OsCslA1 but several Genbank entries call it OsCslA9 or OsCslA9-like.

OsCslA2

AC021893

(Xsome 10)

D22177

AF435640

8

580

 

OsCslA3

AP003509

AACV01013359

(Xsome 6)

 

8

551

 

OsCslA4

AC073556

AAL84294

(Xsome 3)

 

8

549

 

OsCslA5

AC084766

XM_470723

(Xsome 3)

 

 

10

574

 

OsCslA6

XM_467756*
AP005297* 

(Xsome 2)

AA749881

AF435648

CB645479

CB644141

CB684314

8

574

 

OsCslA7

AP004260

(Xsome 7)

C71923

AF435643

CB672774
CX118263

CB651046

9

585

 

OsCslA9

AP008212
AP004737

(Xsome 6)

CR280231
CX104183
 

8

527

AF432499

OsCslA11

AP004666

AP005757

XM_482559

(Xsome 8)

CF964708 
BX929094
CK061630 

 

8

570

 

The positions of eight introns are conserved in all of them. In addition, OsCslA5 is predicted to have two more introns near its 5’ end, and A7 to have one more intron in its 5’ end. The additional intron in A7 is confirmed by at least EST.

 

OsCslA4 has an atypical intron start of GC instead of GT near the 3’ end.

 

*OsCslA6: The predicted aa sequences of XM_470723 and AP005297 have 3 additional amino acids at first intron junction, which I think is wrong based on alignment with other OsCslA’s. Examination of available ESTs confirms that “TMQ” don’t belong in there.

 

 

>OsCslA1  521 aa

MEVNGGGAAGLPEAWSQVRAPVIVPLLRLAVAVCLTMSVLLFLERMYMAVVISGVKILRRRPDRRYRCDPIPDDDPELGTSAFPVVLIQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPVIKEMVRIECERWAHKGVNITYQIRENRKGYKAGALKEGMKHGYVRECEYVAIFDADFQPDPDFLRRTIPFLVHNSDIALVQARWRFVNADECLMTRMQEMSLDYHFTVEQEVSSSVCAFFGFNGTAGVWRVSAVNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFRFQQHRWSCGPANLFRKMLMEIVRNKKVTIWKKIHVIYNFFLIRKIIAHIVTFAFYCLIIPATIFVPEVRIPKWGCVYIPTIITLLNSVGTPRSFHLLFFWILFENVMSLHRTKATLIGLLEAGRANEWVVTEKLGNALKMKSSSKSSAKKSFMRVWDRLNVTELGVAAFLFSCGWYDLAFGKDHFFIYLFFQGAAFFIVGIGYVGTIVPQS

 

>OsCslA2  580 aa [this one was difficult]

MSTNGGAPSQKRSWLPSRPLLTTTTQTYPPPLLPFKKLHAPPTAARRSLPPAASKPMASSSSSSLPAAWAAAVRAWAVAPALRAAVWACLAMSAMLVAEAAWMGLASLAAAAARRLRGYGYRWEPMAAPPDVEAPAPAPAEFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIIIQVLDDSTDPFVKELVELECKEWASKKINIKYEVRNNRKGYKAGALRKGMEHTYAQLCDFVAIFDADFEPESDFLLKTMPYLLHNPKIALVQTRWEFVNYNVCLMTRIQKMSLDYHFKVEQESGSFMHAFFGFNGTAGVWRVSAINQSGGWKDRTTVEDMDLAVRASLKGWEFLYVGDIRVKSELPSTFQAYRHQQHRWTCGAANLFRKMAWEIITNKEVSMWKKYHLLYSFFFVRRAIAPILTFLFYCIVIPLSAMVPEVTIPVWGLVYIPTAITIMNAIRNPGSVHLMPFWILFENVMAMHRMRAALSGLLETARANDWVVTEKVGDQVKDELDVPLLEPLKPTECAERIYIPELLLALYLLICASYDFVLGNHKYYIYIYLQAVAFTVMGFGFVGTRTPCS

 

>OsCslA3  551 aa

MAMAGADGPTAGAAAAVRWRGGESLLLLLLRWPSSAELVAAWGAARASAVAPALAAASAACLALSAMLLADAVLMAAACFARRRPDRRYRATPLGAGAGADDDDDDEEAGRVAYPMVLVQIPMYNEREVYKLSIGAACGLSWPSDRLIVQVLDDSTDPTVKGLVELECKSWGNKGKNVKYEVRNTRKGYKAGALKEGLLRDYVQQCNYVAIFDADFQPEPDFLLRTIPYLVRNPQIGLVQAHWEFVNTSECLMTRIQKMTLHYHFKVEQEGGSSTFAFFGFNGTAGVWRISALEEAGGWKDRTTVEDMDLAVRAGLKGWKFVYLADVKVKSELPSNLKTYRHQQHRWTCGAANLFRKVGAEILFTKEVPFWWKFYLLYSFFFVRKVVAHVVPFMLYCVVIPFSVLIPEVTVPVWGVVYVPTTITLLHAIRNTSSIHFIPFWILFENVMSFHRTKAMFIGLLELGGVNEWVVTEKLGNGSNTKPASQILERPPCRFWDRWTMSEILFSIFLFFCATYNLAYGGDYYFVYIYLQAIAFLVVGIGFCGTISSNS

 

>OsCslA4. 549 aa.

MEGQWGRWRLAAAAAASSSGDQIAAAWAVVRARAVAPVLQFAVWACMAMSVMLVLEVAYMSLVSLVAVKLLRRVPERRYKWEPITTGSGGVGGGDGEDEEAATGGREAAAFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIIIQVLDDSTDPAIKDLVELECKDWARKEINIKYEIRDNRKGYKAGALKKGMEHIYTQQCDFVAIFDADFQPESDFLLKTIPFLVHNPKIGLVQTRWEFVNYDVCLMTRIQKMSLDYHFKVEQESGSSMHSFFGFNGTAGVWRVSAINEAGGWKDRTTVEDMDLAVRASLKGWQFLYVGDIRVKSELPSTFKAYRHQQHRWTCGAANLFRKMATEIAKNKGVSVWKKLHLLYSFFFVRRVVAPILTFLFYCVVIPLSVMVPEVSIPVWGMVYIPTAITIMNAIRNPGSIHLMPFWILFENVMAMHRMRAALTGLLETMNVNQWVVTEKVGDHVKDKLEVPLLEPLKPTDCVERIYIPELMVAFYLLVCASYDLVLGAKHYYLYIYLQAFAFIALGFGFAGTSTPCS

 

>OsCslA5   574 aa.

MEAGEAAGAVLFLLAAAVSLLAAVSTGALDFTYLVTVVGEGSSTSPGSGGGAWWREAWVGARSRAVAPALQVGVWACMVMSVMLVVEATYNSAVSVAARLVGWRPERWFKWEPLGGGAGAGDEEKGEAAAAAYPMVMVQIPMYNELEVYKLSIGAVCGLKWPKERLIIQVLDDSTDAFIKNLVELECEDWASKGLNIKYATRSGRKGFKAGALKKGMEWDYAKQCEYVAIFDADFQPEPDFLLRTVPFLMHNQNVALVQARWVFVNDRVSLLTRIQKTFLDYHFKAEQEAGSATFAFFSFNGTAGVWRTEAINDAGGWKDRTTVEDMDLAVRATLKGWKFIYLGDLRVKSELPSTYKAYCRQQFRWSCGGANLFRKMIWDVLVAKKVSSLKKIYILYSFFLVRRVVAPAVAFILYNVIIPVSVMIPELFLPIWGVAYIPTALLIVTAIRNPENLHTVPLWILFESVMSMHRLRAAVAGLLQLQEFNQWIVTKKVGNNAFDENNETPLLQKSRKRLINRVNLPEIGLSVFLIFCASYNLVFHGKNSFYINLYLQGLAFFLLGLNCVGTLPDHCCF

 

>OsCslA6   574 aa.

MQGSSTSILHFVPSDPTSTSVLDFLSPTPRGTSPVHDRRLHAGDLALRAGGDRLLVADTVAAVVESLVQAWRQVRMELLVPLLRGAVVACMVMSVIVLAEKVFLGVVSAVVKLLRRRPARLYRCDPVVVEDDDEAGRASFPMVLVQIPMYNEKEVYQLSIGAACRLTWPADRLIVQVLDDSTDAIVKELVRKECERWGKKGINVKYETRKDRAGYKAGNLREGMRRGYVQGCEFVAMLDADFQPPPDFLLKTVPFLVHNPRLALVQTRWEFVNANDCLLTRMQEMSMDYHFKVEQEAGSSLCNFFGYNGTAGVWRRQVIDESGGWEDRTTAEDMDLALRAGLLGWEFVYVGSIKVKSELPSTLKAYRSQQHRWSCGPALLFKKMFWEILAAKKVSFWKKLYMTYDFFIARRIISTFFTFFFFSVLLPMKVFFPEVQIPLWELILIPTAIILLHSVGTPRSIHLIILWFLFENVMALHRLKATLIGFFEAGRANEWIVTQKLGNIQKLKSIVRVTKNCRFKDRFHCLELFIGGFLLTSACYDYLYRDDIFYIFLLSQSIIYFAIGFEFMGVSVSS

 

>OsCslA7   585

MVEAGEIGGAAVFALAAAAALSAASSLGAVDFRRPLAAVGGGGAFEWDGVVPWLIGVLGGGDEAAAGGVSVGVAAWYEVWVRVRGGVIAPTLQVAVWVCMVMSVMLVVEATFNSAVSLGVKAIGWRPEWRFKWEPLAGADEEKGRGEYPMVMVQIPMYNELEVYKLSIGAACELKWPKDKLIVQVLDDSTDPFIKNLVELECESWASKGVNIKYVTRSSRKGFKAGALKKGMECDYTKQCEYIAIFDADFQPEPNFLLRTVPFLMHNPNVALVQARWAFVNDTTSLLTRVQKMFFDYHFKVEQEAGSATFAFFSFNGTAGVWRTTAINEAGGWKDRTTVEDMDLAVRASLNGWKFIYVGDIRVKSELPSTYGAYCRQQFRWACGGANLFRKIAMDVLVAKDISLLKKFYMLYSFFLVRRVVAPMVACVLYNIIVPLSVMIPELFIPIWGVAYIPMALLIITTIRNPRNLHIMPFWILFESVMTVLRMRAALTGLMELSGFNKWTVTKKIGSSVEDTQVPLLPKTRKRLRDRINLPEIGFSVFLIFCASYNLIFHGKTSYYFNLYLQGLAFLLLGFNFTGNFACCQ

 

>OsCslA9   527

MAAAGAVLPEQIAAMWEQVKAPVVVPLLRLSVAACLAMSVMLFVEKVYMSVVLVGVHLFGRRPDRRYRCDPIVAAGADNDDPELADANAAFPMVLIQIPMYNEREVYKLSIGAACGLSWPSDRVIVQVLDDSTDPVIKEMVQVECKRWESKGVRIKYEIRDNRVGYKAGALREGMKHGYVRDCDYVAIFDADFQPDPDFLARTIPFLVHNPDIALVQARWKFVNANECLMTRMQEMSLDYHFKVEQEVGSSTHAFFGFNGTAGVWRISAMNEAGGWKDRTTVEDMDLAVRAGLKGWKFVYLGDLMVKSELPSTFKAFRYQQHRWSCGPANLFRKMLVEIATNKKVTLWKKIYVIYNFFLVRKIIGHIVTFVFYCLVVPATVLIPEVEIPRWGYVYLPSIVTILNSIGTPRSLHLLIFWVLFENVMSLHRTKATLIGLLETGRVNEWVVTEKLGDALKLKLPGKAFRRPRMRIGDRVNALELGFSAYLSFCGCYDIAYGKGYYSLFLFLQSITFFIIGVGYVGTIVPH

 
>CslA11. BAC AP004666, 8 introns. 570 aa.

MSSSGGGGVAEEVARLWGELPVRVVWAAVAAQWAAAAAAARAAVVVPPVRALVAVSLAMTVMILAEKLFVAAVCLAVRAFRLRPDRRYKWLPIGAAAAAASSEDDEESGLVAAAAAFPMVLVQIPMFNEREVYKLSIGAACSLDWPSDRVVIQVLDDSTDLVVKDLVEKECQKWQGKGVNIKYEVRGNRKGYKAGALKEGLKHDYVKECEYIAMFDADFQPESDFLLRTVPFLVHNSEIALVQTRWKFVNANECLLTRFQEMSLDYHFKYEQEAGSSVYSFFGFNGTAGVWRIAAIDDAGGWKDRTTVEDMDLAVRATLQGWKFVYVGDVKVKSELPSTFKAYRFQQHRWSCGPANLFKKMMVEILENKKVSFWNKIHLWYDFFFVGKIAAHTVTFIYYCFVIPVSVWLPEIEIPLWGVVYVPTVITLCKAVGTPSSFHLVILWVLFENVMSLHRIKAAVTGILEAGRVNEWVVTEKLGDANKTKPDTNGSDAVKVIDVELTTPLIPKLKKRRTRFWDKYHYSEIFVGICIILSGFYDVLYAKKGYYIFLFIQGLAFLIVGFDYIGVCPP

 
 
 
 

OsCslC summary

 

gene

GenBank

ESTcDNA

introns

protein

Notes

OsCslC1

AP003377

AP008207

(Xsome 1)

                

 

4

690

BK000086

OsCslC2

AP005886
AP005568

(Xsome 9)

AI978402

AF435650
(1.8 kb)

4

698

 

OsCslC3

AP004013

(Xsome 8)

 

4

745

 

OsCslC4

AC122144

AC108884

 pseudogene

 

xxxxx

 

OsCslC5

AC034258

 pseudogene

 

xxxxx

 

OsCslC6

AC098694

AC078891

(Xsome 10)

 

pseudogene

 

xxxxx

 

OsCslC7

AC108873

AP008211

(Xsome 5)

C74862

AF435642

(1.1)

AAL38527
XM_475689

 

4

688

 

OsCslC8

AC083751

(Xsome 2)

 pseudogene

 

xxxxx

 

OsCslC9

AC133450

(Xsome 3)

  
  
  
  
AAAA02011089 (aligns perfectly
 with 8085)
DP000009 – no extra Ala
AP008209 
no extra Ala
 

 

AU068180
CK051466

AF435641
(2.0)(lacks the Ala)

CK008085 (has Ala) 
CK056494 (lacks Ala)
AK121805
(has the Ala) 
AF435652S1
 (lacks the Ala)
 
 

 

4

595 or 596

 

AAT85054 (has AAA); AF435641 (has AA)

 

Our genome sequence
 has one less codon
(an Ala near the N
 terminus) than some 
entries in GenBank: 
MAPWSGLWGGKLAAGESP vs.
 MAPWSGLWGGKLAAAGESP 
There are ESTs for 
both forms. The extra 
Ala is due to 
insertion of 
nucleotides CCG. Based on the fact that all genomic sequences lack the extra Ala, and that 2 other CslC’s (1  and 10) have only two 
Ala in this location 
(both have VAAG vs. 
LAAG/LAAAG for C9), I 
think that 595 is the 
correct length, not 
596. Or else it’s a 
natural variant. 

 

OsCsl10

AP005309

AP008213

(Xsome 7)

 

4

686

yes

 

Intron borders are conserved.

 

>OsCslC1
MARWWGGEGRGGSGTPVVVKMESPEWAISEVEAGAAAPGSPAAGGKAGRGKNARQITWVLLLKAHRAAGKLTGAASAALSVAAAARRRVAAGRTDSDDAAAAPPGESPALRARFHGFLRAFLLLSVLLLAVDVAAHAQGWHAVVPDLLAVEGLFAAAYASWLRVRLEYLAPGLQFLANACVVLFLIQSADRLILCLGCLWIKLKGIKPVPKASGGGGGGKGSDDVEAGADEFPMVLVQIPMCNEKEVYQQSIGAVCNLDWPRSNFLVQVLDDSDDAATSALIKEEVEKWQREGVRILYRHRVIRDGYKAGNLKSAMNCSYVKDYEFVVIFDADFQPQADFLKRTVPHFKGNEDVGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYINDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSKIGVWKKFNLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPAWVVCYIPATMSLLNILPAPKSFPFIVPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVSLVEKQPKQQRVGSAPNLDSLAKESHPKKDSKKKKHNRIYQKELALSFLLLTAAARSLLSVQGIHFYFLLFQGVSFLVVGLDLIGEQVE

 

>OsCslC2
MAPPGVGVGVAYLWGKGRGGRKGTPVVVTMESPNYSVVEVDGPDAEAELRTAAVAMDKGGGRGRSRSRTARQLTWVLLLRARRAAGRLASFAAAAARRFRRSPADAADELGRGRGRLMYGFIRGFLALSLLALAVELAAYWNGWRLRRPELHVPEAVEIEGWAHSAYISWMSFRADYIRRPIEFLSKACILLFVIQSMDRLVLCLGCFWIKLRKIKPRIEGDPFREGSGYQHPMVLVQIPMCNEKEVYEQSISAACQLDWPREKFLIQVLDDSSDESIQLLIKAEVSKWSHQGVNIVYRHRVLRTGYKAGNLKSAMSCDYVKDYEFVAIFDADFQPTPDFLKKTIPHFEGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAGVWRIQALEESGGWLERTTVEDMDIAVRAHLNGWKFIFLNDVKVLCELPESYEAYRKQQHRWHSGPMHLFWLCLPDILTAKISSWKKANLILLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPVWVICYVPVCMSFLNILPSPRSFPFIVPYLLFENTMSVTKFNAMVSGLFKLGSSYEWIVTKKSGRSSESDLSTAVERDTKDLTLPRLQKQISESELIDLKMQKERQEKAPLGAKKANKIYKKELALSLLLLTAATRSLLSAQGIHFYFLLFQGVSFLFVGLDLIGEQID

 

>OsCslC3
MAPPPNTYSESWWGGKEERGTPVVVKMDNPYSLVEIDGPGMAAPSEKARGKNAKQLTWVLLLRAHRAVGCVAWLAAGFWAVLGAVNRRVRRSRDADAEPDAEASGRGRAMLRFLRGFLLLSLAMLAFETVAHLKGWHFPRSAAGLPEKYLRRLPEHLQHLPEHLRRHLPEHLRMPEKEEIEGWLHRAYVAWLAFRIDYIAWAIQKLSGFCIALFMVQSVDRLVLCLGCFWIKLRGIKPVADTSISNDDIEATAGDGGGYFPMVLIQMPMCNEKEVYETSISHVCQIDWPRERMLVQVLDDSDDETCQMLIKAEVTKWSQRGVNIIYRHRLNRTGYKAGNLKSAMSCDYVRDYEFVAIFDADFQPNPDFLKLTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVYLSFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLNGWKFIFLNDVKVLCELPESYQAYRKQQHRWHSGPMQLFRLCLPAVFKSKISTWKKANLVMLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPIWVICYVPVIMSVLNILPAPKSFPFVIPYLLFENTMSVTKFNAMVSGLFQLGSSYEWVVTKKAGRTSSESDILALAEAADADARPPPAKLHRGVSEGGLKEWAKLHKEQEDATAAAAAAAAPGTPVKKSKAAKAPNRIFKKELALAFLLLTAATRSLLSAQGLHFYFLLFQGVTFLAVGLDLIGEQvs

 

>OsCslC7
MAPSWWGMDGRKRHRPLTGGEAVDKKESATGVPTRWWCSREVGTGFAAGDGKTGPAKRRQIKWVLMLKAHRAAGRLTGAASAALAVASAARRRVASGRTDADAAPGESTALRARSYGCIRVSLVLSLLLLAVEVAAYLQGWHLEEVASLLAVDGLFAASYAGWMRLRLDYLAPPLQFLTNACVALFMVQSIDRLVLCLGCFWIRFKGIKPVPQAAAAGKPDVEAGAGDYPMVLVQMPMCNEREVYQQSIGAVCNLDWPKSNFLVQVLDDSDDATTSALIKEEVEKWQREGVRIIYRHRVIRDGYKAGNLKSAMNCSYVKDYEFVVIFDADFQPQADFLKRTVPHFKGKDDVGLVQARWSFVNKDENLLTRLQNVNLCFHFEVEQQVNGAFLNFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFVFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSKIGFWKKFNLIFLFFLLRKLILPFYSFTLFCVILPMTMFVPEAELPAWVVCYIPATMSILNILPAPKSFPFIVPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVGLVEKHSKQQRVGSAPNLDALTKEESNPKKDSKKKKHNRIYRKELALSFLLLTAAARSLLSAQGIHFYFLLFQGVSFLVVGLDLIGEQVE

 

>OsCslC9. 596 aa or 595 aa.
MAPWSGLWGGKLAAAGESPVLRSRFYAFIRAFVVLSVLLLIVELGAYINGWDDLAASALALPVIGVESLYASWLRFRATYVAPFIQFLTDACVVLFLIQSADRLIQCLGCFYIHLKRIKPNPKSPALPDAEDPDAAYYPMVLVQIPMCNEKEVYQQSIAAVCNLDWPRSNFLVQVLDDSDDPTTQTLIREEVLKWQQNGARIVYRHRVLRDGYKAGNLKSAMSCSYVKDYEFVAIFDADFQPNPDFLKRTVPHFKDNDELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIFLNFFGFNGTAGVWRIKALDDSGGWMERTTVEDMDIAVRAHLRGWKFIFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCLPDIIKCKIVFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPALMSLLNILPSPKSFPFIIPYLLFENTMSVTKFNAMISGLFQLGNAYEWVVTKKSGRSSEGDLISLAPKELKHQKTESAPNLDAIAKEQSAPRKDVKKKLNRIYKKELALSLLLLTAAARSLLSKQGIHFYFLLFQGISFLLVGLDLIGEQIE

 

source of the variation between 595 and 596 aa forms of CslC9: 
...GGGCGGGAAGTTGGCCG---CCGGCGAGAGCCCCGTGCTCCGCTCCCGCTTCTACGCGT TCATCAGGGCATTCGTCGCCCT...

 

>OsCslC10. BAC AP005309, chromosome 7.
MAPWSGFWAASRPALAAAAAGGTPVVVKMDNPNWSISEIDADGGEFLAGGRRRGRGKNAKQITWVLLLKAHRAAGCLAWLASAAVALGAAARRRVAAGRTDDADAETPAPRSRLYAFIRASLLLSVFLLAVELAAHANGRGRVLAASVDSFHSSWVRFRAAYVAPPLQLLADACVVLFLVQSADRLVQCLGCLYIHLNRIKPKPISSPAAAAAALPDLEDPDAGDYYPMVLVQIPMCNEKEVYQQSIAAVCNLDWPRSNILVQVLDDSDDPITQSLIKEEVEKWRQNGARIVYRHRVLREGYKAGNLKSAMSCSYVKDYEYVAIFDADFQPYPDFLKRTVPHFKDNEELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIFINFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLNGWKFVFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCLPDIIRCKIAFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFIPEAELPDWVVCYIPALMSFLNILPAPKSFPFIIPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLIALAPKELKQQKILDLTAIKEQSMLKQSSPRNEAKKKYNRIYKKELALSLLLLTAAARSLLSKQGIHFYFLMFQGLSFLLVGLDLIGEDVK

 

 

 

OsCSLD summary

 

 gene

GenBank #

EST #

cDNA

introns

protein

Notes

OsCslD1

AC027037

(Xsome 10)

CA754914

 

1

1127 aa

OSJNBa0035H01
AAL58185

 

OsCslD2

AP001552

(Xsome 6)

JSC EST14

AA753598

(about 18 at TIGR)

AK105393 (4089 bp) AK102134 (3968 bp) AK102695 (2757 bp)

2

1170

Called D4 at TIGR. Os06g02180 

OsCslD3

AP004459

AP008214
(Xsome 8)

AA735599

 

1

1115

BK000093

 

OsCslD4

AL845342
(Xsome 12)

JSC EST9

AU078363

=AU082165

AU082190

=AU082189

AF435644

(1196 bp)

1

1215 

 

OsCslD5

AP005449

AP008212
(Xsome 6)

none?

none?

0

1012

Called D4 at TIGR. Os06g22980980 Os06g22980

 

 

>OsCslD1. 1 intron. 1127 aa
MASKGILKNGGKPPTAPSSAAPTVVFGRRTDSGRFISYSRDDLDSEISSVDFQDYHVHIPMTPDNQPMDPAAGDEQQYVSSSLFTGGFNSVTRAHVMEKQASSARATVSACMVQGCGSKIMRNGRGADILPCECDFKICVDCFTDAVKGGGGVCPGCKEPYKHAEWEEVVSASNHDAINRALSLPHGHGHGPKMERRLSLVKQNGGAPGEFDHNRWLFETKGTYGYGNAIWPEDDGVAGHPKELMSKPWRPLTRKLRIQAAVISPYRLLVLIRLVALGLFLMWRIKHQNEDAIWLWGMSIVCELWFALSWVLDQLPKLCPINRATDLSVLKDKFETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAADYPVDKLACYVSDDGGALLTFEAMAEAASFANLWVPFCRKHEIEPRNPDSYFNLKRDPFKNKVKGDFVKDRRRVKREYDEFKVRVNGLPDAIRRRSDAYHAREEIQAMNLQREKMKAGGDEQQLEPIKIPKATWMADGTHWPGTWLQASPEHARGDHAGIIQVMLKPPSPSPSSSGGDMEKRVDLSGVDTRLPMLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRIALYGFDPPRSKDHTTPWSCCLPRRRRTRSQPQPQEEEEETMALRMDMDGAMNMASFPKKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPRETLDASIVAEAISVVSCWYEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTHRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALFASSKMKVLQRIAYLNVGIYPFTSVFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLIITITLCLLAMLEIKWSGIALEEWWRNEQFWLIGGTSAHLAAVLQGLLKVIAGIEISFTLTSKQLGDDVDDEFAELYAVKWTSLMIPPLTIIMINLVAIAVGFSRTIYSTIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWIAIKPPSAQANSQLGGSFSFP


>OsCslD2. 2 introns. 1170 aa.
MASSGGGGLRHSNSSRLSRMSYSGEDGRAQAPGGGGDRPMVTFARRTHSGRYVSYSRDDLDSELGNSGDMSPESGQEFLNYHVTIPATPDNQPMDPAISARVEEQYVSNSLFTGGFNSVTRAHLMDKVIESEASHPQMAGAKGSSCAINGCDAKVMSDERGDDILPCECDFKICADCFADAVKNGGACPGCKDPYKATELDDVVGARPTLSLPPPPGGLPASRMERRLSIMRSQKAMTRSQTGDWDHNRWLFETKGTYGYGNAIWPKENEVDNGGGGGGGGGLGGGDGQPAEFTSKPWRPLTRKLKIPAGVLSPYRLLILIRMAVLGLFLAWRIKHKNEDAMWLWGMSVVCELWFGLSWLLDQLPKLCPVNRATDLAVLKDKFETPTPSNPNGRSDLPGLDIFVSTADPEKEPPLVTANTILSILAADYPVEKLSCYVSDDGGALLTFEAMAEAASFANMWVPFCRKHDIEPRNPESYFNLKRDPYKNKVRSDFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKRQREAALDDVVEAVKIPKATWMADGTHWPGTWIQPSAEHARGDHAGIIQVMLKPPSDDPLYGTSGEEGRPLDFTEVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRSSAVMSNGPFILNLDCDHYVYNSQAFREGMCFMMDRGGDRIGYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGIMGPVYVGTGCLFRRIALYGFDPPRSKEHSGCCSCCFPQRRKVKTSTVASEERQALRMADFDDEEMNMSQFPKKFGNSNFLINSIPIAEFQGRPLADHPGVKNGRPPGALTVPRDLLDASTVAEAISVISCWYEDKTEWGQRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRKMKFLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVRTLNVTFLTYLLVITLTMCMLAVLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVIAGIEISFTLTSKSGGDEADDEFADLYIVKWTSLMIPPIVIMMVNLIAIAVGFSRTIYSEIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLLAITISLLWVAINPPSQNSQIGGSFTFP


>OsCslD3. 1 intron. 1115 aa. MSTGPGKKAIRNAGGVGGGAGPSAGGPRGPAGQAVKFARRTSSGRYVSLSREDIDMEGELAADYTNYTVQIPPTPDNQPMLNGAEPASVAMKAEEQYVSNSLFTGGFNSATRAHLMDKVIESSVSHPQMAGAKGSRCAMPACDGSAMRNERGEDVDPCECHFKICRDCYLDAQKDGCICPGCKEHYKIGEYADDDPHDGKLHLPGPGGGGNKSLLARNQNGEFDHNRWLFESSGTYGYGNAFWPKGGMYDDDLDDDVDKLGGDGGGGGGGGPLPEQKPFKPLTRKIPMPTSVISPYRIFIVIRMFVLLFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPVNRSTDLAVLKEKFETPSPSNPHGRSDLPGLDVFVSTADPEKEPVLTTATTILSILAVDYPVEKLACYVSDDGGALLTFEAMAEAASFANVWVPFCKKHDIEPRNPDSYFSVKGDPTKGKRRNDFVKDRRRVKREFDEFKVRINGLPDSIRRRSDAFNAREDMKMLKHLRETGADPSEQPKVKKATWMADGSHWPGTWAASAPDHAKGNHAGILQVMLKPPSPDPLYGMHDDDQMIDFSDVDIRLPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNGPFMLNFDCDHYINNAQAVREAMCFFMDRGGERIAYIQFPQRFEGIDPSDRYANNNTVFFDGNMRALDGLQGPMYVGTGCMFRRFAVYGFDPPRTAEYTGWLFTKKKVTTFKDPESDTQTLKAEDFDAELTSHLVPRRFGNSSPFMASIPVAEFQARPLADHPAVLHGRPSGALTVPRPPLDPPTVAEAVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDAFLGTAPINLTDRLHQVLRWATGSVEIFFSRNNAFLASRKLMLLQRISYLNVGIYPFTSIFLLVYCFIPALSLFSGFFIVQKLDIAFLCYLLTMTITLVALGILEGLLKVMAGIEISFTLTAKAAADDNEDIYADLYIVKWSSLLIPPITIGMVNIIAIAFAFARTIYSDNPRWGKFIGGGFFSFWVLAHLNPFAKGLMGRRGKTPTIVFVWSGLLSITVSLLWVAISPPEANSNGGARGGGFQFP


>OsCslD4. 1 intron. 1215 aa. MSRRLSLPAGAPVTVAVSPVRSPGGDAVVRRGSGLTSPVPRHSLGSSTATLQVSPVRRSGGSRYLGASRDGGADESAEFVHYTVHIPPTPDRATASVASEAEAAAEAEEVHRPQRSYISGTIFTGGLNCATRGHVLNFSGEGGATAASRAAASGNMSCKMRGCDMPAFLNGGRPPCDCGFMICKECYAECAAGNCPGCKEAFSAGSDTDESDSVTDDDDDEAVSSSEERDQLPLTSMARKFSVVHSMKVPGAAANGNGKPAEFDHARWLFETKGTYGYGNALWPKDGHAHSGAGFVAADEPPNFGARCRRPLTRKTSVSQAILSPYRLLIAIRLVALGFFLAWRIRHPNPEAVWLWAMSVACEVWFAFSWLLDSLPKLCPVHRAADLAVLAERFESPTARNPKGRSDLPGIDVFVTSADPEKEPPLVTANTILSILAADYPVEKLACYLSDDGGALLSFEALAETASFARTWVPFCRKHGVEPRCPEAYFGQKRDFLKNKVRVDFVRERRKVKREYDEFKVRVNSLPEAIRRRSDAYNAGEELRARRRQQEEAAAAAAAGNGELGAAAVETAAVKATWMSDGSHWPGTWTCPAADHARGDHAGIIQAMLAPPTSEPVMGGEAAECGGLIDTTGVDVRLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYVHNSSALREGMCFMLDRGGDRVCFVQFPQRFEGVDPSDRYANHNLVFFDVSMRAMDGLQGPMYVGTGCVFRRTALYGFSPPRATEHHGWLGRRKIKLFLTKKKSMGKKTDRAEDDTEMMLPPIEDDDGGADIEASAMLPKRFGGSATFVASIPVAEYQGRLLQDTPGCHHGRPAGALAVPREPLDAATVAEAIGVISCFYEEKTEWGRRIGWIYGSVTEDVVTGYRMHNRGWRSVYCVTPRRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALFASPRMKLLQRVAYFNAGMYPFTSVFLLAYCLLPAVSLFSGKFIVQRLSATFLAFLLVITLTLCLLALLEIKWSGITLHEWWRNEQFWVIGGTSAHPAAVLQGLLKVIAGVDISFTLTSKPGNGGGDGGVGGEGNDDEAFAELYEVRWSYLMVPPVTIMMVNAVAIAVAAARTLYSEFPQWSKLLGGAFFSFWVLCHLYPFAKGLLGRRGRVPTIVFVWSGLISMIISLLWVYINPPAGARERIGGGGFSFP


>OsCslD5. 0 intron. 1012 aa
MSVDYANYTVLMPPTPDNQPSGGAPPAAPSAGGARPGDLPLPPYGSSSSSRLVNRRGGGDDGAKMDRRLSTARVPAPSSNKSLLVRSQTGDFDHNRWLFETKGTYGIGNAYWPQDNVYGDDGGGGAVKMEDLVEKPWKPLSRKVPIPPGILSPYRLLVLVRFVALFLFLVWRVTNPNMDALWLWGISIVCEFWFAFSWLLDQMPKLNPINRAADLAALKEKFESPSPTNPTGRSDLPGLDVFISTADPYKEPTLVTANTLLSILATEYPVEKLFVYISDDGGALLTFESMAEACAFAKVWVPFCRKHSIEPRNPDSYFTQKGDPTKGKKRPDFVKDRRWIKREYDEFKIRVNSLPDLIRRRANALNARERKLARDKQAAGDADALASVKAATWMADGTHWPGTWLDPSPDHAKGDHASIVQVMIKNPHHDVVYGEAGDHPYLDMTDVDMRIPMFAYLSREKRAGYDHNKKAGAMNAMVRASAILSNGPFMLNFDCDHYIYNCQAIREAMCYMLDRGGDRICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCLFRRYAIYGFNPPRAIEYRGTYGQTKVPIDPRQGSEAMPGAGGGRSGGGSVGGDHELQALSTAHPDHEAPQKFGKSKMFIESIAVAEYQGRPLQDHPSVLNGRPPGALLMPRPPLDAATVAESVSVISCWYEDNTEWGQRLGWIYGSVTEDVVTGYRMHNRGWRSVYCITRRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSKNNAVLASRRLKFLQRMAYLNVGIYPFTSLFLIMYCLLPALSLFSGQFIVATLDPTFLSYLLLITITLMLLCLLEVKWSGIGLEEWWRNEQFWVIGGTSAHLAAVLQGLLKVVAGIEISFTLTAKAAAEDDDDPFAELYLIKWTSLFIPPLAVIGINIIALVVGVSRTVYAEIPQYSKLLGGGFFSFWVLAHYYPFAKGLMGRRGRTPTIVYVWAGLISITVSLLWITISPPDDSVAQGGIDV  

 

 

OsCslE summary

 

 

Of the original “E” genes (Hazen et al.), only OsCslE1, E2, and E6 appear at this point to be valid genes. The original E3, E4, or E5 are not valid.

 

Intron borders are conserved. OsCslE1 and E6 each have six introns whose boundaries are the same with each other and with six of the seven introns in E2. (E2 has one extra intron.)

 

>OsCslE1. 737 aa.   cDNA AK102766
METTAAATAAERRRPLFTTEELGGRAVYRVQAATVAAGILLVLYYRATRVPAAGEGRAAWLGMAAAELWFAVYWVIAQSVRWRPFRRRTFRDRLAERYEQNLPGVDIFVCTADPQSEPPSLVISTILSVMAYNYPSEKISVYLSDDGGSILTFYALWEASIFAKKWLPFCKRYNIEPRSPAAYFSESKVHHNLCIPKEWALIKNLYEEMRERIDTATMSGKIPEEMKLKHKGFDEWNSDFTLKNHQPIVQILIDGKNRNAIDDDRNVLPTMVYVAREKRPQYHHNFKAGALNALIRVSSVISDSPVILNVDCDMYSNNSDSIRDALCFFLDEEMGQKIGFVQYPQIFNNMTQNDIYGNSFNVSYHVEMCGLDSVGGCLYIGTGCFHRREILCGRIFSKDYKENWNRGIKERGKENINEIEEKATSLVTCTYEHRTQWGNDIGVKYGFPAEDIITGLAIHCRGWESAFINPKRAAFLGLAPSTLAQNILQHKRWSEGNLTIFLSKYCSFLFGHGKIKLQLQMGYCICGLWAANSLPTLYYVVIPSLGLVKGTPLFPQIMSPWATPFIYVFCVKTLYGLYEALLSGDTLKGWWNGQRMWMVKSITSYLYGFIDTIRKCVGMSKMSFEVTAKVSGHDEAKRYEQEILEFGSSSPEYVIIATVALLNFVCLVGGLSQIMAGVWNMPWNVFLPQAILCGMIVIINMPIYEAMFLRKDNGRIPTAVTLASIGFVMLAFLVPIV

>OsCslE2. AAL25130. AK101487
MAGSGGGVVSGGRQRGPPLFATEKPGRMAMAAYRVSAATVFAGVLLIWLYRATHLPPGGGDGVRRWAWLGMLAAELWFGFYWVLTLSVRWCPVYRRTFKDRLAQSYSEDELPSVDIFVCTADPTAEPPMLVISTVLSVMAYDYLPEKLNIYLSDDAGSVLTFYVLCEASEFAKHWIPFCKKYKVEPRSPAAYFAKVASPPDGCGPKEWFTMKELYKDMTDRVNSVVNSGRIPEVPRCHSRGFSQWNENFTSSDHPSIVQilidsnkqkavdidgnalptlvymarekkpqkqhhfkagslnalIRVSSVISNSPIIMNVDCDMYSNNSESIRDALCFFLDEEQGQDIGFVQYPQNFENVVHNDIYGHPINVVNELDHPCLDGWGGMCYYGTGCFHRREALCGRIYSQEYKEDWTRVAGRTEDANELEEMGRSLVTCTYEHNTIWGIEKGVRYGCPLEDVTTGLQIQCRGWRSVYYNPKRKGFLGMTPTSLGQILVLYKRWTEGFLQISLSRYSPFLLGHGKIKLGLQMGYSVCGFWAVNSFPTLYYVTIPSLCFLNGISLFPEKTSPWFIPFAYVMVAAYSCSLAESLQCGDSAVEWWNAQRMWLIRRITSYLLATIDTFRRILGISESGFNLTVKVTDLQALERYKKGMMEFGSFSAMFVILTTVALLNLACMVLGISRVLLQESPGGLETLFLQAVLCVLIVAINSPVYEALFLRRDKGSLPASVARVSICFVLPLCILSICK

 

>OsCslE6. 718 aa
METTTTERRRLFATEKVGGRAVYRLQAATVAAGILLVLYYRATRVPAAGEGRAAWLGMAAAELWFAVYWVITQSVRWCPVRRRTFKNRLAERYKENLPGVDVFVCTADPHAEPPSLVISTILSVMAYNYPSEKISVYLSDDGGSILTFYALWEASMFAKKWLPFCRRYNIEPRSPAAYFSESEGHHNLCSPKEWSFIKRIDSAVMSGKIPEEFKLKMKGFDEWNSEMTSKNHQPIVLIDGKSQNAVDDDGNVLPTLVYMAREKSPQYHHNFKAGALNALIRVSALISDSPVILNVDCDMYSNNSDSIRDALCFFLDEEMSHKIGFVQYPQNYNNMTKNNIYGNSLNVINHVEMRGLDSAGGCLYIGTGCFHRREILCGKKFSKDYKEDWGRGIKERGHENIDEIEEKAKSLATCTYELRTQWGNEIGVKYGCPVEDVITGLAIHCRGWESVYMEPQRAAFVGVAPATLAQTILQHKRWSEGNFTIFLSKHNTFLFGHGKISLQLQMGYCIYGLWAANSLPTIYYVMIPALGLVKGTPLFPEIMSPWATPFIYVFCVKTLYSLYEALLSGDTLKGWWNGQRMWMVKRITSYLYGFIDTIRKLLGLSKMSFEITAKVSDGDEAKRYEQEILEFGSSSPEFVIIATVALLNFVCLVAGLSKIMAGVWNVFLPQVILCGLIVITNIPIYEAMFVRKDKGRIPLPVTLASIGFVMLAFLLPIV